Python & Machine Learning Projects



EDA - Amazon Products and Discounts 2023

Conducted Exploratory Data Analysis on a sizable dataset of Amazon products and discounts from 2023. Through data cleaning, uncovered patterns and trends and examined relationships between variables. These insights enable businesses to make data-driven decisions on the Amazon platform.


KKBox Music Recommendations

Created a Predictive model for KKBOX that accurately predicts users' repetitive song listening. Used a Decision Tree model for binary classification and evaluated performance with metrics like classification report, confusion matrix, accuracy and cross-validation scores.


Cardiovascular (Heart attack) Disease Prediction

Worldwide, cardiovascular illnesses are the primary cause of death. The goal of this study is to forecast cardiovascular disease by identifying the major risk factors for heart attacks and creating a reliable prediction system.


Mercedes Benz Greener Manufacturing Project

Mercedes-Benz Greener Manufacturing project aims to optimize the testing system for their premium cars. This is achieved through the reduction of testing time by implementing a robust algorithm capable of predicting testing durations based on various feature combinations.
Implemented label encoding, dimensionality reduction and used XGBoost for accurate predictions. This resulted in faster testing and lower carbon dioxide emissions while maintaining Mercedes-Benz's high standards.


311 NYC Customer Request Analysis

The goal of this project was to provide an in-depth analysis of the NYC311 service request data that was gathered in New York City. The project concentrated on improving abilities in Data Wrangling, Exploratory Analysis and Data Visualization by utilizing Data Science methodologies.
The objective of the investigation was to derive significant insights into the customer service trends and complaint patterns that are common in New York City's variable environment.


Bank Term Deposit Prediction

Built a Decision Tree classifier to predict whether a client will subscribe to a Term Deposit based on their demographic and behavioral data. This predictive model can help Banks tailor their marketing strategies for maximum efficiency.


Movielens Case Study

Conducted Exploratory Data Analysis on Movielens dataset and built predictive models using XGBoost, Random Forest and Decision Tree to forecast movie ratings. Engineered features, identified key factors influencing ratings and applied Data Analysis and Machine Learning to generate actionable recommendations.


Diabetes Prediction

Project aims to improve Healthcare diagnostic capabilities by creating a predictive model that can reliably identify a patient's risk of Diabetes based on particular diagnostic parameters. The source of dataset is National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK)



Portfolio Home