Skip to content

Conversation

@rkarim2
Copy link
Contributor

@rkarim2 rkarim2 commented Sep 1, 2022

Replaces the thrust call with a work efficient scan implementation.

…ient parallel scan) to optimize performance especially in scan along axis for ND-arrays.

Implementation not finished, likely buggy and also need to add NAN handling variant.
The optimized scan builds and runs without errors.
nancumsum and nancumprod still are still thrust based and need to be updated.
…n can be slightly faster on 1D)

Minor bugfixes and adding error checks.
@rkarim2 rkarim2 added the category:improvement PR introduces an improvement and will be classified as such in release notes label Sep 1, 2022
@rkarim2 rkarim2 requested a review from magnatelee September 1, 2022 23:40
@manopapad manopapad changed the base branch from branch-22.10 to branch-22.12 September 30, 2022 21:57
@marcinz marcinz changed the base branch from branch-22.12 to branch-23.03 January 26, 2023 01:00
@marcinz marcinz changed the base branch from branch-23.03 to branch-23.05 March 6, 2023 20:47
@marcinz marcinz changed the base branch from branch-23.05 to branch-23.07 May 18, 2023 20:30
@marcinz marcinz changed the base branch from branch-23.07 to branch-23.09 July 18, 2023 15:44
@marcinz marcinz changed the base branch from branch-23.09 to branch-23.11 September 26, 2023 00:38
@marcinz marcinz changed the base branch from branch-23.11 to branch-24.01 November 9, 2023 17:14
@marcinz marcinz changed the base branch from branch-24.01 to branch-24.03 February 22, 2024 01:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category:improvement PR introduces an improvement and will be classified as such in release notes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant