Skip to content

Conversation

@Alessandro624
Copy link
Owner

Overview:

This PR bundles two core functional additions aimed at improving observability, portability, and developer onboarding for the CUDA section of the project. It introduces both a device introspection module and a full matrix–vector multiplication example, complete with build, execution, and profiling workflows.

Key Highlights:

  • Added dedicated README.md files for deviceSpec and matrix–vector multiplication to streamline onboarding.
  • Introduced Makefiles for reproducible, configuration-free builds.
  • Added run.sh scripts and a profiling script to support automated execution and quick performance analysis.
  • Final documentation cleanup, including corrected example links.

@Alessandro624 Alessandro624 self-assigned this Dec 2, 2025
@Alessandro624 Alessandro624 added documentation Improvements or additions to documentation enhancement New feature or request labels Dec 2, 2025
@Alessandro624 Alessandro624 merged commit cd0aa0b into main Dec 2, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant