Welcome to my portfolio of data engineering and analytics projects, with a primary focus on Microsoft Fabric. This repository showcases my ability to build end-to-end data solutions, from raw data ingestion to advanced analytics and business intelligence.
Each project folder contains a detailed README that breaks down the project's architecture, methodology, and key components.
- Project Name:
Bing-News - Description: An intelligent and scalable data pipeline that fetches and analyzes news data from the Bing News API. This project integrates machine learning for sentiment analysis and uses Data Activator for real-time alerting, providing a complete solution for monitoring media trends.
- Link to Project: Bing News Pipeline
- Key Skills: API Integration, Machine Learning (NLP), Delta Lake, Power BI, Data Activator.
- Project Name:
Earthquake-Data-Project - Description: A comprehensive data engineering solution that ingests, processes, and analyzes global earthquake data in real-time from a public API. It demonstrates a robust pipeline using Microsoft Fabric's Lakehouse, PySpark, and Power BI for impactful visualizations.
- Link to Project: Earthquake Data Pipeline
- Key Skills: Data Ingestion, PySpark Transformations, Lakehouse Architecture (Bronze/Silver/Gold), Power BI Reporting.
- Project Name:
COVID-19-Data-Project - Description: An end-to-end data solution that ingests, transforms, and analyzes COVID-19 data from the ECDC. The project features a dynamic pipeline for automated data ingestion, performance-optimized dataflows for transformation, and a multi-page Power BI report for clear, actionable insights.
- Link to Project: COVID-19 Data Project
- Key Skills: Dynamic Pipelines, Dataflows, Lakehouse/Warehouse Architecture, Data Transformation, Power BI Reporting.
I am a passionate data professional specializing in building scalable and reliable data solutions. My expertise lies in leveraging modern cloud platforms to transform raw data into actionable insights and strategic business assets.