Skip to content

Conversation

@seberg
Copy link
Contributor

@seberg seberg commented Feb 12, 2025

This is just a work in progress, as of opening it does not work yet and definitely needs polishing after discussion.

The idea is to use legate imaging constraints for shuffling. Which is likely not be faster but simplifies things and may be useful to add offloading steps to the CPU (having legate deal with all data movements also for that).

EDIT: Current legate has issues with mixing 2-D and 1-D stores in constraints, but this is necessary. So this could not work until that is fixed (i.e. since there is a broadcast constraint, one can argue the interesting dimensionality is actually all 1-d here).

Signed-off-by: Sebastian Berg <sebastianb@nvidia.com>
Signed-off-by: Sebastian Berg <sebastianb@nvidia.com>
Signed-off-by: Sebastian Berg <sebastianb@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant