My name is Harshada Phadol, a data science enthusiast with a background in software engineering. I am experienced in machine learning, supervised and unsupervised algorithms and use data visualization techniques to present the results. I love to read, sketch and photography. Lastly, I love learning. Every day I push myself to learn something new, whether that be about machine learning, software engineering, or miscellaneous facts about the universe.
This project demonstrates the usage of Hadoop, MapReduce, and Hive on big data. The dataset files for comments comprise of over 2 million comments in total with 34 features. This data will help the purpose of understanding and analyzing the public reading interests, analyzing behaviors.
This repository consists of a Jupyter/Python notebook for the COVID19 prediction and analysis using SEIR epidemiological model and Genetic Algorithm mainly focused on the infection rate calculation.
Implementation of Decision Tree Classification, Random Forest Classification and Naïve Bayes Classification in Python.
Analysis and visualization of beer review dataset based on aroma, taste, appearance, palette and many other features using Python.