Skip to content
View shetty-shithil's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Champaign, United States

Block or report shetty-shithil

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shetty-shithil/README.md

πŸ‘‹πŸ» Hi, I'm Shithil Sudarshan Shetty

Data Scientist | Data Engineer | ML Engineer

🎯 Data Scientist & Data Engineer with 3 years of industry experience building ML models, Generative AI applications, scalable data pipelines, and cloud-native systems on AWS, now pursuing an MS in Information Management (Data Science & Analytics) at UIUC.

I love solving real-world problems at the intersection of Machine Learning,Generative AI, LLMs, and Data Engineering, and turning messy data into impactful insights.


πŸš€ About Me

  • πŸ”­ Currently pursuing: MS in Information Management @ UIUC
  • 🧠 Passionate about: LLMs, Generative AI, Data Engineering, Applied ML
  • πŸ’Ό Previously: Senior Data Engineer (Business Intelligence), Piramal Finance
  • 🀝 Open to: Collaborations in AI, ML, LLMOps, and Data Platform Engineering
  • 🌍 Location: Champaign, IL (open to relocate)
  • ⚑ Fun fact: I enjoy exploring system design + trying out new ML frameworks!

πŸ’Ό Experience Summary

Senior Data Engineer - Piramal Finance (2022–2025)

  • Built an LLM-powered SQL optimization agent β†’ reduced query latency 60% & compute cost 25%
  • Designed an automated Kafka–Airflow loan pipeline β†’ enabled real-time MIS, driving 60% business growth
  • Developed a GenAI-powered Data Catalog agent using GPT + Snowflake metadata β†’ cut onboarding time 40%
  • Built a 250+ column Snowflake data mart with PySpark, AWS Lambda, S3 β†’ reduced manual prep time 70%

Data Science Intern - MRPL (2022)

  • Improved data reliability 25% through automated quality checks
  • Reduced manual reporting work by 5+ hrs/week with ML-driven ETL automation

πŸ“š Publications

3D Model Rendering Using Three.js for Campus Visualization
IEEE ICAST, 2022
Link: https://ieeexplore.ieee.org/document/10039553


πŸ› οΈ Tech Stack

Languages & Machine Learning

Python R NumPy Pandas PyTorch TensorFlow

Data Engineering & Cloud

AWS Snowflake Airflow Spark Kafka

GenAI & NLP

HuggingFace OpenAI

Visualization & Analytics

Tableau Matplotlib

Databases & Tools

MySQL PostgreSQL Git GitHub Jira JavaScript

πŸ”— Connect With Me

🌐 LinkedIn: https://www.linkedin.com/in/shithil-shetty
πŸ“§ Email: shetty7@illinois.edu

Pinned Loading

  1. ctr-prediction-engagement-modeling ctr-prediction-engagement-modeling Public

    CTR prediction and engagement modeling for social media ads using exploratory analysis, feature engineering, and machine learning.

    Jupyter Notebook

  2. time-series-sales-forecasting time-series-sales-forecasting Public

    Sales forecasting and business performance analysis using Tableau dashboards and XGBoost-based machine learning.

    Jupyter Notebook

  3. Pyspark Pyspark Public

    Jupyter Notebook

  4. PranavCR01/tiktok-misinformation-tool PranavCR01/tiktok-misinformation-tool Public

    Python 3 4