Hi, Iβm Anu Joshi, a Data Scientist passionate about transforming data into actionable insights. With a strong background in machine learning, deep learning, geospatial analysis, and data visualization, I specialize in applying Python, Power BI, and GIS to solve real-world problems across multiple industries.
I am currently pursuing my Master of Science in Data Science at the University of Boston and completed MITβs Applied Data Science Program, where Iβve gained hands-on experience in AI, ML, and recommendation systems. My work spans climate science, business analytics, healthcare and marketing, leveraging data-driven strategies to optimize decision-making.
βΎBoston House Price Prediction β Repository Leveraged machine learning and regression modeling to forecast housing prices, uncover key price drivers, and enhance investment insights.
βΎ Automotive Pricing Analysis β Repository Leveraged regression and tree-based models to identify key customer segments
βΎ Business Data Analysis: Restaurant Demand Forecasting β Repository Optimized restaurant performance using SQL and Python to identify customer demand trends.
βΎ Vaccine Marketing Strategy : N1H1 - Repository Optimized target market on who should be approached to receive the N1H1 vaccine and how resources should be allocated.
βΎ Pneumonia X-ray Detection β Repository Built a convolutional neural network (CNN) for medical image classification
More Projects Coming Soon!
β Programming & Data Analysis: Python (Pandas, NumPy, Scikit-Learn), SQL, β Machine Learning & AI: Deep Learning, Time Series Forecasting, Recommendation Systems β Data Visualization: Power BI, Matplotlib, Seaborn β Big Data & Cloud: AWS, β Geospatial Analysis: GIS, ArcGIS.
Algorithms I Have Experience With
I have worked with a wide range of machine learning, deep learning, and statistical algorithms, applying them to real-world datasets across marketing, healthcare, business analytics, and climate science. Below is a categorized list of the algorithms I have experience with:
πΉ Supervised Learning Algorithms β Linear Regression β Predicting continuous outcomes (e.g., sales forecasting, house prices) β Logistic Regression β Binary classification problems (e.g., fraud detection, churn prediction) β Decision Trees β Intuitive classification and regression models β Random Forest β Ensemble learning for improved accuracy and stability β Gradient Boosting (XGBoost, LightGBM, CatBoost) β High-performance predictive modeling β Support Vector Machines (SVM) β Complex classification problems β k-Nearest Neighbors (k-NN) β Instance-based learning for pattern recognition
πΉ Unsupervised Learning Algorithms β K-Means Clustering β Customer segmentation, market analysis β Hierarchical Clustering β Identifying natural groupings in data β DBSCAN β Density-based clustering for anomaly detection β Principal Component Analysis (PCA) β Dimensionality reduction for feature selection β t-SNE & UMAP β High-dimensional data visualization
πΉ Time Series & Forecasting Models β ARIMA (AutoRegressive Integrated Moving Average) β Time series forecasting β SARIMA & Prophet β Seasonal time series forecasting β LSTMs (Long Short-Term Memory Networks) β Deep learning for sequential data
πΉ Deep Learning & Neural Networks β Feedforward Neural Networks (FNNs) β Basic neural network architectures β Convolutional Neural Networks (CNNs) β Image classification and feature extraction β Recurrent Neural Networks (RNNs) β Sequential data analysis β Transformers (BERT, GPT, T5) β Natural Language Processing (NLP) β Autoencoders β Anomaly detection, feature engineering β GANs (Generative Adversarial Networks) β Synthetic data generation
πΉ Recommendation Systems β Collaborative Filtering (User-Based & Item-Based) β Personalized recommendations β Matrix Factorization (SVD, ALS) β Latent factor modeling β Hybrid Recommendation Systems β Combining multiple techniques
πΉ Anomaly Detection & Optimization β Isolation Forest β Detecting rare events and anomalies β One-Class SVM β Outlier detection in high-dimensional data β Genetic Algorithms β Optimization and search problems
I have applied these algorithms using Python (Scikit-Learn, TensorFlow, PyTorch, XGBoost, StatsModels,) and have optimized models for business intelligence, climate data science.
π§ Email:anujoshi3390@gmail.com LinkedIn: linkedin.com/in/anuradhasjoshi
Looking for exciting data science collaborations or career opportunities? Feel free to reach outβIβd love to chat!