Skip to content

Commit 1762798

Browse files
authored
v0.5.0 (#183)
1 parent 97cd329 commit 1762798

File tree

11 files changed

+39
-33
lines changed

11 files changed

+39
-33
lines changed

.vscode/c_cpp_properties.json

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,13 @@
1+
{
2+
"configurations": [
3+
{
4+
"name": "Linux",
5+
"includePath": [
6+
"${workspaceFolder}/**",
7+
"/usr/local/cuda/include",
8+
"/opt/rocm/include"
9+
]
10+
}
11+
],
12+
"version": 4
13+
}

CITATION.cff

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
cff-version: 1.2.0
22
title: "ARK: A GPU-driven system framework for scalable AI applications"
3-
version: 0.4.1
3+
version: 0.5.0
44
message: >-
55
If you use this project in your research, please cite it as below.
66
authors:

CMakeLists.txt

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,8 +2,8 @@
22
# Licensed under the MIT license.
33

44
set(ARK_MAJOR "0")
5-
set(ARK_MINOR "4")
6-
set(ARK_PATCH "1")
5+
set(ARK_MINOR "5")
6+
set(ARK_PATCH "0")
77

88
set(ARK_VERSION "${ARK_MAJOR}.${ARK_MINOR}.${ARK_PATCH}")
99
set(ARK_SOVERSION "${ARK_MAJOR}.${ARK_MINOR}")

README.md

Lines changed: 11 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,7 @@ A GPU-driven system framework for scalable AI applications.
1313
| Unit Tests (ROCm) | [![Unit Tests (ROCm)](https://github.com/microsoft/ark/actions/workflows/ut-rocm.yml/badge.svg?branch=main)](https://github.com/microsoft/ark/actions/workflows/ut-rocm.yml) |
1414

1515
*NOTE (Nov 2023): ROCm unit tests will be replaced into an Azure pipeline in the future.*
16+
1617
*NOTE (Dec 2023): ROCm unit tests are failing due to the nodes' issue. This will be fixed soon.*
1718

1819
See [Quick Start](docs/quickstart.md) to quickly get started.
@@ -29,18 +30,20 @@ ARK provides a set of APIs for users to express their distributed deep learning
2930

3031
ARK is under active development and a part of its features will be added in a future release. The following describes key features of each version.
3132

32-
### New in ARK v0.4 (Latest Release)
33+
### New in ARK v0.5 (Latest Release)
3334

34-
* Support AMD GPUs (CDNA2, single-GPU only)
35-
* Add high-performance AllReduce & AllGather algorithms with MSCCL++
36-
* Fix major bugs in the scheduler
35+
* Integrate with [MSCCL++](https://github.com/microsoft/mscclpp)
36+
* Removed dependency on `gpudma`
37+
* Add AMD CDNA3 architecture support
38+
* Support communication for AMD GPUs
39+
* Optimize OpGraph scheduling
40+
* Add a multi-GPU Llama2 example
3741

38-
See details from https://github.com/microsoft/ark/issues/137.
42+
See details from https://github.com/microsoft/ark/issues/168.
3943

40-
### ARK v0.5 (TBU, Dec. 2023)
44+
### ARK v0.6 (TBU, Jan. 2024)
4145

42-
* Multi-GPU support for AMD GPUs
43-
* Add multi-GPU LLM examples
46+
* Overall performance optimization
4447
* Improve Python unit tests & code coverage
4548

4649
## Contributing

ark/gpu/gpu_mem.cc

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,7 @@ GpuMem::GpuMem(size_t bytes) { this->init(bytes); }
2525
GpuMem::GpuMem(const GpuMem::Info &info) { this->init(info); }
2626

2727
//
28-
void GpuMem::init(size_t bytes, bool expose) {
28+
void GpuMem::init(size_t bytes, [[maybe_unused]] bool expose) {
2929
if (bytes == 0) {
3030
ERR(InvalidUsageError, "Tried to allocate zero byte.");
3131
}

ark/include/ark.h

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -10,8 +10,8 @@
1010
#include <vector>
1111

1212
#define ARK_MAJOR 0
13-
#define ARK_MINOR 4
14-
#define ARK_PATCH 1
13+
#define ARK_MINOR 5
14+
#define ARK_PATCH 0
1515
#define ARK_VERSION (ARK_MAJOR * 10000 + ARK_MINOR * 100 + ARK_PATCH)
1616

1717
namespace ark {

cmake/Utils.cmake

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -24,10 +24,10 @@ endif()
2424
find_program(BLACK black)
2525
if(BLACK)
2626
add_custom_target(pylint
27-
COMMAND python3.8 -m black --check --config ${PROJECT_SOURCE_DIR}/pyproject.toml ${PROJECT_SOURCE_DIR}
27+
COMMAND python3 -m black --check --config ${PROJECT_SOURCE_DIR}/pyproject.toml ${PROJECT_SOURCE_DIR}
2828
)
2929
add_custom_target(pylint-autofix
30-
COMMAND python3.8 -m black --config ${PROJECT_SOURCE_DIR}/pyproject.toml ${PROJECT_SOURCE_DIR}
30+
COMMAND python3 -m black --config ${PROJECT_SOURCE_DIR}/pyproject.toml ${PROJECT_SOURCE_DIR}
3131
)
3232
else()
3333
message(STATUS "black not found.")

docs/install.md

Lines changed: 3 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -2,22 +2,12 @@
22

33
## Prerequisites
44

5-
* Linux kernel >= 4.15.0
6-
7-
- If you have a lower version, you can upgrade it via:
8-
```bash
9-
sudo apt-get update
10-
sudo apt-get install -y linux-image-4.15.0-13-generic linux-header-4.15.0-13-generic
11-
```
12-
135
* CMake >= 3.25.0 and Python >= 3.8
146

157
* Supported GPUs
168
- NVIDIA GPUs: Volta (CUDA >= 11.1) / Ampere (CUDA >= 11.1) / Hopper (CUDA >= 12.0)
179
- Hopper support will be added in the future.
18-
- AMD GPUs: CDNA2 (ROCm >= 5.0) / CDNA3
19-
- Multi-GPU execution is not yet supported for AMD GPUs and will be supported by a future release.
20-
- CDNA3 support will be added in the future.
10+
- AMD GPUs: CDNA2 (ROCm >= 5.7) / CDNA3 (ROCm >= 5.7)
2111

2212
* Mellanox OFED
2313

@@ -28,9 +18,9 @@ We currently provide only *base images* for ARK, which contain all the dependenc
2818
You can pull a base image as follows.
2919
```
3020
# For NVIDIA GPUs
31-
docker pull ghcr.io/microsoft/ark/ark:base-dev-cuda12.1
21+
docker pull ghcr.io/microsoft/ark/ark:base-dev-cuda12.2
3222
# For AMD GPUs
33-
docker pull ghcr.io/microsoft/ark/ark:base-dev-rocm5.6
23+
docker pull ghcr.io/microsoft/ark/ark:base-dev-rocm5.7
3424
```
3525

3626
Check [ARK containers](https://github.com/microsoft/ark/pkgs/container/ark%2Fark) for all available Docker images.

docs/sphinx/source/conf.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -20,8 +20,8 @@
2020
project = "ARK"
2121
copyright = "2023, ARK Team"
2222
author = "ARK Team"
23-
version = "0.4.1"
24-
release = "0.4.1"
23+
version = "0.5.0"
24+
release = "0.5.0"
2525

2626
# -- General configuration ---------------------------------------------------
2727

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ build-backend = "scikit_build_core.build"
44

55
[project]
66
name = "ark"
7-
version = "0.4.1"
7+
version = "0.5.0"
88

99
[tool.scikit-build]
1010
cmake.minimum-version = "3.25"

0 commit comments

Comments
 (0)