Fix shared memory format and add CUDA matrix multiplication example #10

Alessandro624 · 2025-12-30T15:33:20Z

Description

This PR fixes an issue in the device properties output related to shared memory formatting and introduces a complete CUDA matrix multiplication example, including build, execution, and profiling support.

Key changes

Fixed the shared memory output format in the device properties display to improve correctness and readability.
Added a new matrix_multiplication/ module featuring:
- matrixMul.cu: CUDA implementation of matrix multiplication.
- Makefile to simplify compilation.
- run.sh for easy execution.
- profile_nvprof.sh to collect performance metrics via NVIDIA profiling tools.
- README documenting usage, build steps, and profiling workflow.

Impact

This PR combines a small but important correctness fix with a practical CUDA example that can be used as a benchmark or learning reference. It strengthens the project’s focus on GPU performance analysis by pairing executable code with reproducible profiling scripts.

Alessandro624 added 6 commits December 30, 2025 16:23

Fix shared memory output format in device properties display

8aa12d4

Add matrix_multiplication/Makefile

82de151

Add matrix_multiplication/README

18af729

Add matrix_multiplication/matrixMul.cu

894aef1

Add matrix_multiplication/run.sh

6f9eb63

Add matrix_multiplication/profile_nvprof.sh

79a1eb2

Alessandro624 self-assigned this Dec 30, 2025

Alessandro624 added documentation Improvements or additions to documentation enhancement New feature or request labels Dec 30, 2025

Alessandro624 merged commit 05d73a3 into dev Dec 30, 2025
1 check passed

Alessandro624 deleted the matrix-mul branch December 30, 2025 15:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix shared memory format and add CUDA matrix multiplication example #10

Fix shared memory format and add CUDA matrix multiplication example #10

Uh oh!

Alessandro624 commented Dec 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fix shared memory format and add CUDA matrix multiplication example #10

Fix shared memory format and add CUDA matrix multiplication example #10

Uh oh!

Conversation

Alessandro624 commented Dec 30, 2025

Description

Key changes

Impact

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant