@@ -13,6 +13,7 @@ A GPU-driven system framework for scalable AI applications.
1313| Unit Tests (ROCm) | [ ![ Unit Tests (ROCm)] ( https://github.com/microsoft/ark/actions/workflows/ut-rocm.yml/badge.svg?branch=main )] ( https://github.com/microsoft/ark/actions/workflows/ut-rocm.yml ) |
1414
1515* NOTE (Nov 2023): ROCm unit tests will be replaced into an Azure pipeline in the future.*
16+
1617* NOTE (Dec 2023): ROCm unit tests are failing due to the nodes' issue. This will be fixed soon.*
1718
1819See [ Quick Start] ( docs/quickstart.md ) to quickly get started.
@@ -29,18 +30,20 @@ ARK provides a set of APIs for users to express their distributed deep learning
2930
3031ARK is under active development and a part of its features will be added in a future release. The following describes key features of each version.
3132
32- ### New in ARK v0.4 (Latest Release)
33+ ### New in ARK v0.5 (Latest Release)
3334
34- * Support AMD GPUs (CDNA2, single-GPU only)
35- * Add high-performance AllReduce & AllGather algorithms with MSCCL++
36- * Fix major bugs in the scheduler
35+ * Integrate with [ MSCCL++] ( https://github.com/microsoft/mscclpp )
36+ * Removed dependency on ` gpudma `
37+ * Add AMD CDNA3 architecture support
38+ * Support communication for AMD GPUs
39+ * Optimize OpGraph scheduling
40+ * Add a multi-GPU Llama2 example
3741
38- See details from https://github.com/microsoft/ark/issues/137 .
42+ See details from https://github.com/microsoft/ark/issues/168 .
3943
40- ### ARK v0.5 (TBU, Dec. 2023 )
44+ ### ARK v0.6 (TBU, Jan. 2024 )
4145
42- * Multi-GPU support for AMD GPUs
43- * Add multi-GPU LLM examples
46+ * Overall performance optimization
4447* Improve Python unit tests & code coverage
4548
4649## Contributing
0 commit comments