About Me

Hi, I’m Turner Jabbour. I’ve been a software engineer for ~6 years, primarily working in Node and React. Around September 2025, I became deeply interested in GPU programming, ROCm, and the broader world of low-level performance engineering.

This repository is my space to learn in public as I delve into GPU kernel engineering and inference systems work.

There are three important directories:

kernels - I explore different kernels and include a write up of what I learned and how it relates to inference.
papers - I summarize and discuss different papers.
topics - I dive deep into some specific topic.

My long-term goal is to build strong competency in HIP, Triton, and AMD’s GPU software stack, with a focus on high-performance inference.

Currently Working On

I'm currently re-visiting my previous kernels, profiling them, and creating profiling writeups - currently working on my block-level reduction.

I just finished the writeup for my halving reduction.

Here is everything I've worked on so far, ordered and dated.

Scheduled Learning

RCCL

AMD’s collectives library for multi-GPU communication (AllReduce, AllGather, ReduceScatter, etc.) used heavily in distributed inference.

Triton

A higher-level DSL for writing high-performance kernels, increasingly used in modern inference work (FlashAttention, fused ops, reductions).

GPU Architecture

Wavefronts, SIMDs, LDS, VGPRs, vectorized memory access, latency hiding, and the AMDGCN compiler toolchain.

Model Serving at Scale

PagedAttention, KV-cache management, continuous batching, speculative decoding, and multi-GPU parallelism.

Profiling & Debugging

rocprof, occupancy analysis, register pressure.

Contact

doubleujabbour@gmail.com
LinkedIn

Name		Name	Last commit message	Last commit date
Latest commit History 190 Commits
papers		papers
src		src
templates		templates
topics		topics
.gitignore		.gitignore
FUTURE_READING.md		FUTURE_READING.md
Makefile		Makefile
PROGRESSION_LOG.md		PROGRESSION_LOG.md
README.md		README.md
THOUGHTS.md		THOUGHTS.md
TODO.md		TODO.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About Me

Currently Working On

Scheduled Learning

RCCL

Triton

GPU Architecture

Model Serving at Scale

Profiling & Debugging

Contact

About

Uh oh!

Releases

Packages

Languages

wjabbour/rocm-learning

Folders and files

Latest commit

History

Repository files navigation

About Me

Currently Working On

Scheduled Learning

RCCL

Triton

GPU Architecture

Model Serving at Scale

Profiling & Debugging

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages