π Emerging Data Engineer | Building reliable data pipelines & AI-powered analytics | Cloud & ETL Enthusiast
Iβm a data engineer passionate about data architecture, automation, and applied AI.
Currently pursuing my M.S. in Business Analytics at Fordham University, I serve as a Graduate Assistant in Database Management & Artificial Intelligence, helping students and faculty explore topics like SQL, ETL, cloud computing, and LLMs.
Previously, I built data validation pipelines, OLAP data warehouses, and scalable ETL workflows during my internship at Oeson Global.
My goal is to create governance-driven, production-grade data systems that empower organizations to make data-informed decisions.
Programming: Python | SQL | R | Java (basic)
Data Engineering: Apache Airflow | PySpark | Databricks | Great Expectations | Dataiku
Cloud & Infrastructure: AWS (Glue, S3) | Docker | Kafka (intro)
Data Warehousing: Snowflake (intro) | Delta Lake | MySQL | SQLite | Oracle
ML & AI: Scikit-learn | TensorFlow | NLP | LLMs (Gemini, OpenAI APIs)
Visualization: Power BI | Tableau | Matplotlib | Excel
Certifications:
IBM Data Engineering (in progress) | AWS Data Engineering | Google Data Analytics | Databricks Lakehouse Fundamentals | IBM Granite Code Optimization | NVIDIA Prompt Engineering (in progress)
π Fordham University β Graduate Assistant, AI & Database Management
- Supported AI and database courses covering SQL, data modeling, ETL, big data, and applied ML.
- Validated and improved student projects using Scikit-learn, TensorFlow, and Generative AI tools.
π§ Oeson Global β Data Engineering Intern
- Built reusable data validation pipelines using Great Expectations (improving accuracy by 25%).
- Designed Databricks-based OLAP data warehouse using PySpark and Star Schema.
βοΈ Terminaux Vraquiers du SΓ©nΓ©gal β Operations Manager
- Optimized reporting and performance dashboards for 100+ employees.
- Improved process efficiency by 30% through data-driven insights.
- π¦ Global Banks Market Cap ETL β Automated ETL pipeline using Python, Pandas & SQLite.
- π΅ Spotify Artist Analytics (Airflow + MySQL) β Real-time ETL & visualization in Power BI.
- π§Ύ Amazon Review Fraud Detection β NLP-based sentiment classification using Scikit-learn.
- π¦οΈ Weather Data Pipeline β Containerized Airflow DAGs for weather data analytics.
- 𧬠AI-Powered Health Guidance System β LLM-based intelligent triage using Google Gemini API.
π Education:
- M.S. Business Analytics β Fordham University (Expected Dec 2025)
- M.S. Auditing & Control β BEM Management School
- M.S. Quality, Hygiene, Security, Environment Management - ISM Dakar
π¬ Languages: Fluent English | Native French
π Interests: Data lineage, real-time streaming, open-source data quality tools
π Bronx, New York, USA
πΌ LinkedIn
βοΈ gaelmayanza@gmail.com
βοΈ βTurning data into actionable intelligence through scalable and ethical engineering.β