Skip to content
View sipemu's full-sized avatar

Highlights

  • Pro

Block or report sipemu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sipemu/README.md

Simon Müller

Systems Architect | Rust Engineer | Ph.D. Mathematician

I specialise in High-Performance Computing (HPC) and ML Infrastructure. My focus is replacing GIL-bound Python bottlenecks with highly optimised Rust and C++ kernels, leveraging SIMD, zero-copy data transfer (Apache Arrow), and in-process OLAP engines (DuckDB).

LinkedInCrates.io


🔬 Data Science Focus

  • Advanced Forecasting: Large-scale Hierarchical Time Series (HTS), Probabilistic Forecasting, and Handling Intermittent Demand (Sparse Data) for Supply Chains.
  • Rigorous Statistics: Bayesian Inference (Stan), Design of Experiments (DoE), Functional Data Analysis (FDA).
  • Industrial AI: Anomaly Detection in manufacturing processes, Predictive Quality, and Root Cause Analysis using Causal Inference.

⚡ Engineering Focus

  • Systems Programming: Porting interpretability-heavy Python logic to Rust/C++ (WASM/Native).
  • GenAI Infrastructure: Building Model Context Protocol (MCP) servers and dependency-free inference engines for Foundation Models.
  • Data Engineering: Designing zero-copy ETL pipelines using DuckDB, Polars, and Apache Arrow.

🛠 Selected R&D

🦀 [Rust] Chronos-2 Inference Engine

  • Architecture: Full re-implementation of the Chronos-2 time-series foundation model in pure Rust.
  • Objective: Remove heavy PyTorch/Python dependencies for edge and high-throughput environments.
  • Tech: Candle / Burn, WASM, Tokio.

🚀 [Rust/C++] AnoFox Forecasting Engine

  • Performance: Achieved 2,900x speedup vs. statsmodels/pandas loops by moving logic to C++.
  • Design: Hybrid architecture using DuckDB for parallelized data shuffling and Rust for vectorized statistical kernels.
  • Tech: Rust, DuckDB C-API, OpenMP.

🤖 [Rust] MCP (Model Context Protocol) Servers

  • Implementation: Custom Rust-based servers implementing the MCP standard to inject dynamic context (DB schemas, API specs) into AI coding agents.
  • Tech: axum, serde, async-trait.

📦 Open Source & Crates

Crate / Repo Description Stack
sipemu 4+ Utility crates for statistical computing. Rust
AnoFox-Statistics High-performance statistical extension for DuckDB. Rust, DuckDB
Polars-Statistics FFI bindings for high-speed statistics on Polars DataFrames. Python, Rust, Polars

💻 Tech Stack

Core: Rust Python R

Data & ML: DuckDB Polars

Infrastructure: Docker AWS GitHub Actions


Pinned Loading

  1. DataZooDE/anofox-statistics DataZooDE/anofox-statistics Public

    A DuckDB extension for statistical regression analysis, providing OLS, Ridge, WLS, and time-series regression capabilities with full diagnostics and inference directly in SQL.

    Rust 5 1

  2. DataZooDE/anofox-forecast DataZooDE/anofox-forecast Public

    Statistical timeseries forecasting in DuckDB

    Rust 23 2

  3. anofox-statistics-rs anofox-statistics-rs Public

    Statistical tests in Rust

    Rust 2

  4. anofox-regression anofox-regression Public

    A rust library for regression analysis.

    Rust 1

  5. anofox-forecast anofox-forecast Public

    Timeseries forecasting in Rust

    Rust 1

  6. fdars fdars Public

    Functional Data Analysis in R and Rust - High-performance FDA algorithms including depth measures, metrics, clustering, smoothing, and regression

    Rust