Skip to content
View gbareas's full-sized avatar

Block or report gbareas

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
gbareas/README.md

Hi there, I'm Guillem Barea 👋

R&D Data Scientist | Physics-Informed AI | HPC

I am a PhD Researcher transitioning into industry. I specialize in analyzing high-dimensional, chaotic time-series data (Fluid Dynamics) and translating complex physical systems into actionable predictive models.

  • 🔭 Currently working on: Forecasting chaotic turbulence using LSTM Neural Networks.
  • 🎓 PhD Focus: Wall-bounded Supercritical Fluid Turbulence & Modal Analysis (POD/SPOD) at UPC.
  • Superpower: processing Terabyte-scale datasets (HDF5) efficiently using High-Performance Computing (HPC) clusters.
  • 👯 Looking to collaborate on: Physics-Informed Machine Learning (PINNs) & Time-Series Forecasting.

🛠️ Technical Stack

Languages & Core: Python Bash C++

Data Science & AI: NumPy TensorFlow Pandas Scikit-Learn

HPC & Tools: Linux Git HDF5


🚀 Featured Projects

Using Deep Learning to predict chaotic systems.

  • Objective: Predict the temporal evolution of turbulent velocity modes.
  • Method: Trained a Long Short-Term Memory (LSTM) network on temporal coefficients extracted from POD analysis.
  • Result: Successfully forecasted non-linear dynamics with low MSE, demonstrating the bridge between Fluid Dynamics and Sequence Modeling.
  • Tech: Python, Keras/TensorFlow, NumPy.

The core computational engine behind my PhD thesis, designed to process TB-scale DNS datasets.

This pipeline is divided into two specialized modules for spatial and spectral feature extraction:

  • Module A: Proper Orthogonal Decomposition (POD)

    • The Data Science: Unsupervised Learning / PCA (Principal Component Analysis).
    • Function: Performs Singular Value Decomposition (SVD) on massive snapshot matrices to extract dominant spatial features.
    • Method: Implemented parallelized POD (PCA) algorithms to reduce dimensionality and identify energy-containing vortices.
  • Module B: Spectral POD (SPOD)

    • The Data Science: Advanced Signal Processing / FFT (Fourier Transforms).
    • Function: Combines spectral estimation (Welch’s method) with dimensionality reduction to isolate coherent structures in the frequency domain.
    • Method: Implemented parallelized SPOD algorithms to reduce dimensionality and identify energy-containing vortices.

📊 The "Why" (My Philosophy)

I believe the hardest problems in industry—whether in Tech, Finance, or Engineering—are multivariate, stochastic, and noisy. My background isn't just "coding"; it's rigorous mathematical modeling. I don't just import libraries; I understand the linear algebra and statistics that power them.


📫 Connect with Me

Popular repositories Loading

  1. gbareas gbareas Public

    Config files for my GitHub profile.

  2. RHEA_to_SPOD RHEA_to_SPOD Public

    Python

  3. python-repo-template python-repo-template Public template

    Makefile

  4. predictive-maintenance-api predictive-maintenance-api Public

    Makefile

  5. RHEA_to_POD RHEA_to_POD Public

    This repository allows users to obtain POD modes, eigenvalues and temporal coefficients out of DNS data

    Python

  6. HP_thermodynamics_tools HP_thermodynamics_tools Public

    Python