π Currently working on: Recommender Systems with PySpark.
- βοΈ Content-Based Recommender Systems: Jaccard and Cosine Similarities, TF-IDF, Word2Vec + LSH and SVD.
- βοΈ User-Based Recommender Systems: Cosine Similarity and ALS (Alternating Least Squares).
- π² Cold Start Problem.
- π² Streaming Updates: update models in real time.
- π² Handling cache for real time responses (Spark Structured Streaming or Kafka, not sure yet).
π Also working on: Building Machine Learning Models From Scratch.
- βοΈ Simple Linear Regression Model.
- βοΈ Multivariate Regression Model.
- βοΈ Logistic Regression: multivariate, but only 2 classes in target variable.
- βοΈ Multinomial Logistic Regression: multivariate and multiple classes in target variable.
- βοΈ Regularization Techniques: Ridge, Lasso and Elastic-Net.
- βοΈ Gradient Descent Optimization: batch size modifications, momentum and adaptive methods.
- π² Support Vector Machine (SVM).
- π² Neural Network.
π Side Projects:
- Simulations with Python (SimPy) and R (simmer), and visualization (animated plots and PyGame).
- AI bot to summarize Data Science news.
- Image-to-Text business notes taken by hand for further processing.
