Michael Morgan Mmorgan-ML

Hi, I'm Michael Christian Morgan.

Independent Machine Learning Researcher

📍 South Korea (Open to Remote Roles)

I am a researcher bridging the gap between Linguistics, Complex Systems Biology, and Artificial Intelligence.

With an academic foundation in East Asian Linguistics and Articulatory Phonetics from the University of Georgia, I approach Large Language Models not just as statistical predictors, but as dynamic systems akin to biological evolution and physical processes. My work focuses on escaping local minima in architectural design by applying principles from nature, physics, and "weird math" to deep learning.

🔬 Research Focus & Technical Stack

I specialize in Python and PyTorch. My work often requires going deeper than standard libraries, including writing custom C++/CUDA kernels for optimizing novel architectures like my Neuromodulatory Control Networks.

My current research interests include:

Physics-Inspired Architectures: Energy-based models, Hamiltonian dynamics, and thermodynamic approaches to loss landscapes.
Biologically Plausible AI: Neuromodulation, hypernetworks, and synaptic plasticity simulations.
Evolutionary Computing: Genetic algorithms applied to Transformer hyperparameter optimization and Artificial Life simulations.
Linguistics: Viewing tokenization through the lens of articulatory phonetics and continuous dynamic systems.

⚡ Featured Projects

1. Evolutionary MicroTransformer Framework

A massive-scale evolutionary simulator for hyperparameter optimization and artificial life.

This project is a comprehensive framework that pits populations of "MicroTransformers" (10k parameters) against each other in a survival-of-the-fittest environment based on validation loss performance.

The Genome: Agents possess 17 "genes" representing initialization hyperparameters.
Lifecycle: Agents fight, breed, mutate, clone, and evolve over generations.
Gene Forge: A tool for manual gene editing and Horizontal Gene Transfer (stealing genes from other successful agents).
Stress Chamber: A rigorous testing module for collecting performance data on specific genomes across random seeds.
Speciation Dashboard: Uses PCA to map 17-dimensional gene data onto a 3D interactive graph. It utilizes clustering to automatically categorize agents into taxonomic ranks (Kingdom, Phylum, Class, etc.), visualizing how distinct survival strategies evolve divergently.

2. Neuromodulatory Control Network (NCN)

A novel LLM architecture inspired by the human brain's neuromodulatory systems.

Successfully trained 18M param model on 1 epoch of TinyStories, achieving 4.5 PPL.

(Fig 1) Token-Level Neuromodulatory Dynamics. Heatmaps display the NCN output values for Layer Gain (Top), Attention Precision (Middle), and FFN Gating (Bottom). X-axis represents the token sequence; Y-axis represents Layer Depth (0-5).

(Fig 2) Training Convergence Analysis. The NCN model demonstrates rapid perplexity reduction, stabilizing significantly below standard baselines for this parameter class. The dark blue line represents the macro-trend (span=118), showing a smooth descent without the volatility typically associated with hypernetwork training.

(Fig 3) Grammar/Syntax Convergence (95%). The dashed blue line indicates a steep power-law fit, suggesting rapid acquisition of surface-level statistics.

(Fig 4) Intellectual Convergence (99%). The "Active Phase" (Red) tightly hugs the power law, indicating sustained learning of higher-order logic without premature plateaus.

This architecture moves beyond standard Transformers by implementing a global modulation mechanism.

Mechanism: Functions similarly to a hypernetwork but outputs modulation signals that dynamically adjust the Temperature, Gain, and FFN Gating of the main network blocks.
Implicit Learning: The network learns to contextually identify which modulation signals minimize loss without explicit supervision.
Optimization: Includes custom C++/CUDA kernels to handle the specific computational requirements of the modulation layers.
(Paper currently in pre-print phase, includes preliminary empirical results).

3. Phase-Slip Sampler ("The Whispering Muse")

Available on PyPI.

Method	Diversity	Perplexity (Lower is Better)	Speed
Greedy Decoding	0.09 ± 0.01	1.29 ± 0.02 (Robotic)	20.4 T/s
Standard Sampling	0.37 ± 0.14	4.49 ± 1.83 (Unstable)	18.6 T/s
Phase-Slip (v1.0)	0.32 ± 0.15	3.66 ± 1.65 (Coherent)	6.8 T/s

A research-grade inference architecture designed to solve the "Stability vs. Creativity" dilemma in LLMs. Unlike standard sampling which adds noise to the output (logits), Phase-Slip operates on the model's internal memory.

Orthonormal Vector Rotation: Instead of adding destructive noise, the sampler geometrically rotates the Key-Value (KV) vectors in high-dimensional space. This shifts the semantic perspective ("The Muse") while preserving the magnitude of the signal (Confidence).
Dual-Path Logit Anchoring: The system performs two forward passes per token: one "Clean" (Grammar) and one "Phantom" (Creative). It mathematically fuses these outputs using a dynamic confidence gate, ensuring the model never hallucinates wildly.
Automatic Head Calibration: Includes a scanning utility to identify and target specific "creative" attention heads while leaving critical "structural" heads untouched.
Trade-off: This is a heavy-duty sampling method. It achieves ~18.5% better logical coherence (Perplexity) than Standard Sampling, with lower standard deviation, but at the cost of running at ~35% of the speed due to the dual-path computation. Best suited for creative writing where quality supersedes latency.

📜 Background & Experience

Linguistics & Phonetics: My background gives me a unique perspective on NLP. I view language processing as a constraint system rooted in physical production (articulatory phonetics) rather than purely symbolic manipulation.
Patent Translation: Extensive experience translating technical patents (English/Korean/Japanese), requiring extreme attention to detail and technical literacy.
Global Business: Experience in Duty Free Trade, connecting international suppliers to the Korean market.

📫 Connect

I am currently open to job offers and research collaborations (Remote preferred).

Email: mmorgankorea@gmail.com
Twitter/X: @Mmorgan_ML (DMs open)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly