Skip to content
View Sajithpemarathna's full-sized avatar

Block or report Sajithpemarathna

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Sajithpemarathna/README.md

✨ Hi, I'm Sajith — Turning Raw Data into Real Impact ✨

Data Analyst • Data Engineer • ML Practitioner
I build pipelines, dashboards, and machine learning models that help businesses move faster and smarter.


🚀 About Me

I’m a Data Analyst & Data Engineer who loves building things that scale —
from ETL pipelines and analytics layers, to ML forecasting models and interactive dashboards.

I enjoy the full journey:
Data → Insights → Decisions → Business Value.


🧠 What I Do Best

🔧 Data Engineering

  • Modular ETL/ELT workflows
  • dbt-style SQL modeling (staging → marts)
  • Databricks, Delta Lake, PySpark
  • Workflow automation & data quality checks

📊 Analytics & BI

  • KPI engineering, cohort behavior, funnel analysis
  • Dashboard design in Tableau & Looker Studio
  • Business analytics for e-commerce, inventory, and operations

🤖 Machine Learning

  • Forecasting (energy, sales, demand)
  • Classification & regression (XGBoost, LSTM)
  • MLflow tracking, evaluation metrics

🎯 Experimentation

  • A/B testing
  • Uplift modeling
  • ROI & causal insights

🛠 Tech Toolbox

Languages / DBs
SQLPythonPostgreSQLBigQuery

ML / AI
XGBoostRandom ForestLSTMTensorFlow/Keras

Data Engineering
dbt-style modelingETL/ELTPySparkDelta LakeDatabricks

BI / Analytics Tools
TableauLooker StudioExcelGA4

Ops & Productivity
GitHubVirtual EnvironmentsWorkflow automation


🚀 Featured Work

🛒 Ecommerce Product Funnel Analytics (Databricks)

📌 Databricks • PySpark • Delta Lake • Tableau • SQL

End-to-end Lakehouse pipeline using Databricks & Spark to process incremental ecommerce events. Implements Landing → Bronze → Silver → Gold layers with daily job orchestration and business-ready funnel & product metrics for Tableau dashboards.

🔗 Repo: https://github.com/Sajithpemarathna/ecommerce-product-funnel-analytics-databricks


🛒 Olist E-Commerce Analytics Pipeline

📌 SQL • Python • ETL design • Tableau


🚗 Used-Car Inventory & Pricing Analytics

📌 SQL • Python • KPI engineering • Dashboarding


Energy Forecasting for Germany (MSc Thesis)

📌 ML (RF/XGBoost/LSTM) • SQL • Time-series analysis

  • Forecasts up to 2030 supporting sustainability & policy planning

🌱 Currently Learning

  • Advanced dbt patterns
  • Workflow orchestration (Airflow-style)
  • Scalable forecasting pipelines
  • Behavioral analytics & uplift modeling

🎨 Fun Facts About Me

  • I love creating clean, story-driven dashboards
  • I enjoy projects where analytics directly changes business outcomes
  • I like simplifying complex datasets into insights people actually use
  • Big fan of ML models that solve practical, real-world problems

🤝 Let’s Connect!

📍 Berlin, Germany
🔗 LinkedIn: https://www.linkedin.com/in/sajith-pemarathna
📬 Email: sajiths.pemarathna@gmail.com


✨ Thanks for visiting — let's build something amazing with data! ✨

Pinned Loading

  1. olist-ecommerce-analytics olist-ecommerce-analytics Public

    End-to-end data analytics project using Python, SQL, PostgreSQL, and Tableau. Includes full pipeline (RAW → STAGING → DIM/FACT) and two interactive dashboards.

    Python 1

  2. Inventory-business-case Inventory-business-case Public

    Inventory analytics case study for an online used-car marketplace – from raw CSV to KPIs, Python, Excel, SQL, and Tableau dashboards.

    Jupyter Notebook 1

  3. Energy-Consumption-Forecasting-in-Germany-Using-Machine-Learning Energy-Consumption-Forecasting-in-Germany-Using-Machine-Learning Public

    Forecasted sector-wise energy use in Germany using Python, SQL, and ML models (Random Forest, XGBoost, LSTM). Delivered insights via Tableau dashboards to support data-driven energy policy.

    Jupyter Notebook 1