Strix Halo Memory QoS Demo

This demonstration shows how Strix Halo maintains system responsiveness and prevents CPU memory starvation when running intensive AI workloads alongside CPU-intensive tasks.

Quick Start

1. Install Prerequisites

./scripts/install_prerequisites.sh

This installs Ollama, Python packages, CMake, and the required LLM model.

2. Run the Demo

./scripts/memory_qos_demo.sh --duration 600

This runs a 10-minute demonstration (600 seconds). Adjust the duration as needed.

3. Visualize Results

LATEST_CSV=$(ls -t logs/memory_qos_metrics_*.csv | head -1)
python3 scripts/visualize_memory_qos.py --metrics-file "$LATEST_CSV"

This generates a comprehensive visualization showing all metrics across the four phases.

Sample Output

The visualization generates a comprehensive dashboard showing memory QoS effectiveness:

What This Demo Shows

Memory QoS Protection: CPU stays responsive and memory availability is maintained even under heavy AI loads.

The demonstration runs through four phases:

Baseline: Memory-intensive workload only (CMake builds + Python memory operations)
Transition: Workload + LLM starting (QoS should activate immediately)
Contended: Both workloads fully active (QoS should maintain memory availability)
Recovery: LLM stopped, workload continues (system should recover gracefully)

Understanding the Results

Key Metrics

The demo measures and visualizes:

Memory Availability (Primary QoS Metric)
- Target: Should remain stable throughout all phases
- Higher is better
- Demonstrates memory QoS protection
Memory Retention by Phase
- Target: ≥95% (Excellent), ≥85% (Good)
- Shows how much memory availability is retained compared to baseline
- Higher is better
Swap Usage
- Target: 0% (no swap usage indicates no memory pressure)
- Lower is better
- Zero swap usage is a strong indicator of effective QoS
CPU Usage
- Shows CPU utilization across phases
- Stability matters more than absolute percentage
System Load Average
- Shows system load over 1, 5, and 15-minute windows
- Lower is better
- Demonstrates system responsiveness
I/O Bandwidth
- Shows fair sharing of I/O resources
- Higher is better

What Makes Strix Halo Different

Strix Halo's memory QoS ensures that:

CPU traffic gets guaranteed bandwidth allocation
Memory latency floors are enforced for CPU access
Foreground work remains interactive even under heavy AI load
Memory availability is maintained (no starvation)

This is what enables the "always-on AI PC" experience - AI assistants that don't require you to turn them off when you're doing real work.

Documentation

Detailed Guides

Memory QoS Demo Documentation: Complete guide to the demo, workloads, and metrics

Installation

Installation Guide: Detailed installation instructions

Troubleshooting

Common Issues

Ollama not found: Ensure Ollama is installed and in PATH
Model not available: Run ollama pull codellama:7b first
Python packages missing: Run pip install -r requirements.txt
CMake not found: Install build tools: sudo apt-get install cmake build-essential
Project path not found: Ensure test-projects/json exists or use --project-path to specify a different path

See Troubleshooting Guide for more details.

Contributing

Contributions, issues, and feature requests are welcome! Feel free to check the existing issues or open a new one.

Legal Notice

This demo shows absolute performance characteristics of the developer's Strix Halo hardware. All measurements are taken on Strix Halo systems only. This suite makes no direct comparisons or competitive claims.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
demo_assets		demo_assets
docs		docs
scripts		scripts
test-projects		test-projects
.gitignore		.gitignore
.gitkeep_assets		.gitkeep_assets
INSTALLATION.md		INSTALLATION.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Strix Halo Memory QoS Demo

Quick Start

1. Install Prerequisites

2. Run the Demo

3. Visualize Results

Sample Output

What This Demo Shows

Understanding the Results

Key Metrics

What Makes Strix Halo Different

Documentation

Detailed Guides

Installation

Troubleshooting

Common Issues

Contributing

Legal Notice

License

About

Uh oh!

Releases

Packages

Languages

License

Yu-amd/strix_halo_QoS

Folders and files

Latest commit

History

Repository files navigation

Strix Halo Memory QoS Demo

Quick Start

1. Install Prerequisites

2. Run the Demo

3. Visualize Results

Sample Output

What This Demo Shows

Understanding the Results

Key Metrics

What Makes Strix Halo Different

Documentation

Detailed Guides

Installation

Troubleshooting

Common Issues

Contributing

Legal Notice

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages