Data Analyst • Data Engineer • ML Practitioner
I build pipelines, dashboards, and machine learning models that help businesses move faster and smarter.
I’m a Data Analyst & Data Engineer who loves building things that scale —
from ETL pipelines and analytics layers, to ML forecasting models and interactive dashboards.
I enjoy the full journey:
Data → Insights → Decisions → Business Value.
- Modular ETL/ELT workflows
- dbt-style SQL modeling (staging → marts)
- Databricks, Delta Lake, PySpark
- Workflow automation & data quality checks
- KPI engineering, cohort behavior, funnel analysis
- Dashboard design in Tableau & Looker Studio
- Business analytics for e-commerce, inventory, and operations
- Forecasting (energy, sales, demand)
- Classification & regression (XGBoost, LSTM)
- MLflow tracking, evaluation metrics
- A/B testing
- Uplift modeling
- ROI & causal insights
Languages / DBs
SQL • Python • PostgreSQL • BigQuery
ML / AI
XGBoost • Random Forest • LSTM • TensorFlow/Keras
Data Engineering
dbt-style modeling • ETL/ELT • PySpark • Delta Lake • Databricks
BI / Analytics Tools
Tableau • Looker Studio • Excel • GA4
Ops & Productivity
GitHub • Virtual Environments • Workflow automation
📌 Databricks • PySpark • Delta Lake • Tableau • SQL
End-to-end Lakehouse pipeline using Databricks & Spark to process incremental ecommerce events. Implements Landing → Bronze → Silver → Gold layers with daily job orchestration and business-ready funnel & product metrics for Tableau dashboards.
🔗 Repo: https://github.com/Sajithpemarathna/ecommerce-product-funnel-analytics-databricks
📌 SQL • Python • ETL design • Tableau
- Built a full analytics stack: raw → staging → dimension models
- Delivered insights on revenue, customer behavior & delivery delays
🔗 Repo: https://github.com/Sajithpemarathna/olist-ecommerce-analytics
📌 SQL • Python • KPI engineering • Dashboarding
- Identified aging stock, pricing gaps & margin leakage
- 54% unsold stock & 346+ day aging documented for action
🔗 Repo: https://github.com/Sajithpemarathna/Inventory-business-case
📌 ML (RF/XGBoost/LSTM) • SQL • Time-series analysis
- Forecasts up to 2030 supporting sustainability & policy planning
- Advanced dbt patterns
- Workflow orchestration (Airflow-style)
- Scalable forecasting pipelines
- Behavioral analytics & uplift modeling
- I love creating clean, story-driven dashboards
- I enjoy projects where analytics directly changes business outcomes
- I like simplifying complex datasets into insights people actually use
- Big fan of ML models that solve practical, real-world problems
📍 Berlin, Germany
🔗 LinkedIn: https://www.linkedin.com/in/sajith-pemarathna
📬 Email: sajiths.pemarathna@gmail.com

