-
Notifications
You must be signed in to change notification settings - Fork 205
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Consider support for vectorized binary search with sorted needles #691
Comments
The above trick won't work because both the haystack and the needles need to be sorted wrt to the same compare (like, say, |
It seems difficult to expose this functionality without inventing a new name for it.
D's standard library has no binary search functions analogous to C++ (or functions for doing many searches in bulk like Thrust). Instead, it has a single
It doesn't seem to have a bulk "find these needles in this haystack" algorithm. |
The structure of this problem is very similar to the set operations: both consume two sorted lists and produce a single result. In particular, this operation is close to Moreover, it seems likely that this algorithm often would be applied in contexts where the data would also be consumed by the set algorithms. How about
In this formulation, the On the other hand, if the result was used to actually do the hypothetical insertion, it would not necessarily be equivalent to the output of any of the standard set algorithms. For example, the insertion would not produce the result of |
There's also
I don't feel like this name really captures what's going on here. |
Here are some ideas from Michael:
|
|
My code uses lower_bound and upper_bound with needles sorted all the time, so I'm definitely excited to hear that there might be further performance to be gained. For what it's worth, I think that lower_merge_rank/upper_merge_rank make sense given the explanation you quoted above. [off-topic] I really like the search functions using arguments named using 'needles' and 'haystack' Are you considering using those names in the Thrust source code? |
I came across the "needles" and "haystack" nomenclature in The D Programming Language. Dunno where it originated. We'd probably use it if we add this algorithm. |
…cent_difference Port adjacent difference into CUB
When the needles are sorted, we can significantly accelerate the search.
To indicate that the needles are sorted, we can receive an additional
Compare
parameter:The text was updated successfully, but these errors were encountered: