Skip to content
View anujoshi3390's full-sized avatar

Block or report anujoshi3390

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
anujoshi3390/README.md

πŸ‘‹ Hi Welcome to My Data Portfolio!

Hi, I’m Anu Joshi, a Data Scientist passionate about transforming data into actionable insights. With a strong background in machine learning, deep learning, geospatial analysis, and data visualization, I specialize in applying Python, Power BI, and GIS to solve real-world problems across multiple industries.

I am currently pursuing my Master of Science in Data Science at the University of Boston and completed MIT’s Applied Data Science Program, where I’ve gained hands-on experience in AI, ML, and recommendation systems. My work spans climate science, business analytics, healthcare and marketing, leveraging data-driven strategies to optimize decision-making.

πŸ“Œ Featured Projects

β—ΎBoston House Price Prediction – Repository Leveraged machine learning and regression modeling to forecast housing prices, uncover key price drivers, and enhance investment insights.

β—Ύ Automotive Pricing Analysis – Repository Leveraged regression and tree-based models to identify key customer segments

β—Ύ Business Data Analysis: Restaurant Demand Forecasting – Repository Optimized restaurant performance using SQL and Python to identify customer demand trends.

β—Ύ Vaccine Marketing Strategy : N1H1 - Repository Optimized target market on who should be approached to receive the N1H1 vaccine and how resources should be allocated.

β—Ύ Pneumonia X-ray Detection – Repository Built a convolutional neural network (CNN) for medical image classification

More Projects Coming Soon!

Skills & Technologies

βœ” Programming & Data Analysis: Python (Pandas, NumPy, Scikit-Learn), SQL, βœ” Machine Learning & AI: Deep Learning, Time Series Forecasting, Recommendation Systems βœ” Data Visualization: Power BI, Matplotlib, Seaborn βœ” Big Data & Cloud: AWS, βœ” Geospatial Analysis: GIS, ArcGIS.

Algorithms I Have Experience With

I have worked with a wide range of machine learning, deep learning, and statistical algorithms, applying them to real-world datasets across marketing, healthcare, business analytics, and climate science. Below is a categorized list of the algorithms I have experience with:

πŸ”Ή Supervised Learning Algorithms βœ” Linear Regression – Predicting continuous outcomes (e.g., sales forecasting, house prices) βœ” Logistic Regression – Binary classification problems (e.g., fraud detection, churn prediction) βœ” Decision Trees – Intuitive classification and regression models βœ” Random Forest – Ensemble learning for improved accuracy and stability βœ” Gradient Boosting (XGBoost, LightGBM, CatBoost) – High-performance predictive modeling βœ” Support Vector Machines (SVM) – Complex classification problems βœ” k-Nearest Neighbors (k-NN) – Instance-based learning for pattern recognition

πŸ”Ή Unsupervised Learning Algorithms βœ” K-Means Clustering – Customer segmentation, market analysis βœ” Hierarchical Clustering – Identifying natural groupings in data βœ” DBSCAN – Density-based clustering for anomaly detection βœ” Principal Component Analysis (PCA) – Dimensionality reduction for feature selection βœ” t-SNE & UMAP – High-dimensional data visualization

πŸ”Ή Time Series & Forecasting Models βœ” ARIMA (AutoRegressive Integrated Moving Average) – Time series forecasting βœ” SARIMA & Prophet – Seasonal time series forecasting βœ” LSTMs (Long Short-Term Memory Networks) – Deep learning for sequential data

πŸ”Ή Deep Learning & Neural Networks βœ” Feedforward Neural Networks (FNNs) – Basic neural network architectures βœ” Convolutional Neural Networks (CNNs) – Image classification and feature extraction βœ” Recurrent Neural Networks (RNNs) – Sequential data analysis βœ” Transformers (BERT, GPT, T5) – Natural Language Processing (NLP) βœ” Autoencoders – Anomaly detection, feature engineering βœ” GANs (Generative Adversarial Networks) – Synthetic data generation

πŸ”Ή Recommendation Systems βœ” Collaborative Filtering (User-Based & Item-Based) – Personalized recommendations βœ” Matrix Factorization (SVD, ALS) – Latent factor modeling βœ” Hybrid Recommendation Systems – Combining multiple techniques

πŸ”Ή Anomaly Detection & Optimization βœ” Isolation Forest – Detecting rare events and anomalies βœ” One-Class SVM – Outlier detection in high-dimensional data βœ” Genetic Algorithms – Optimization and search problems

I have applied these algorithms using Python (Scikit-Learn, TensorFlow, PyTorch, XGBoost, StatsModels,) and have optimized models for business intelligence, climate data science.

πŸš€ Let’s Connect!

πŸ“§ Email:anujoshi3390@gmail.com LinkedIn: linkedin.com/in/anuradhasjoshi

Looking for exciting data science collaborations or career opportunities? Feel free to reach outβ€”I’d love to chat!

Pinned Loading

  1. Pneumonia-Chest-X-Rays-Detection Pneumonia-Chest-X-Rays-Detection Public

    Repository files navigation README Chest X-Ray Abnormality Detection Multi-Label Convolutional Neural Network (CNN) Image Classification Model

    Jupyter Notebook

  2. Data-Science-Project-Portfolio Data-Science-Project-Portfolio Public

    Regression and classification models, NLP sentiment analysis, neural networks, time series forecasting, exploratory data analysis

    Jupyter Notebook