-
Notifications
You must be signed in to change notification settings - Fork 205
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Consider support for segmented reductions and sorts specified by count-value representation #676
Comments
IMO a "RangeOfRanges" interface is preferable to counts because:
It's arguable that RangeOfRanges is more prone to error than a count interface, but I think that can be solved by providing convenience wrappers for some index to RoR transformations, such as the one from scanned counts to RoR. Note that there is some precedent for the scanned count representation in the context of sparse matrices. The "compressed sparse row" (CSR) format uses an array of N + 1 integers to describe the beginning and end of each sparse row. |
I agree with Nathan, I think the more flexible solution is preferable. |
It's true that a range-of-ranges formulation would be more general, but I think that this is a case where Worse is Better. For the purposes of discussion, let's assume that the range-of-ranges proposal looks like
Where One of my objections to
For any other backend,
I'm not sure how this algorithm adds value in any other context. It's true that it would eliminate quite a bit of verbosity, but so would
In general I prefer an orthogonal interface, i.e. a small set of composable primitives (rather than the cross product thereof). I guess my primary reasons for preferring
|
Consider this API:
Why a count-value structure?
and for the backend to target.
Why not a pairs of offsets, or range of ranges, or similar structure?
Because it forces the user to specify redundant data in the common case (continguous, non-overlapping values). It's easy to mess this up. There's no possibility of this kind of error with a count-value format (unless the user just gets the counts wrong). Indirection could be achieved as normal with a
permutation_iterator
.Doesn't the count-value format imply that the implementation of each algorithm first has to perform an exclusive scan?
Under what conditions would an initial scan be expensive compared to the operation being performed?
If we preferred a different representation more convenient for the backend, would that simply force the user to perform the scan himself?
The text was updated successfully, but these errors were encountered: