Remove query-time usage of ByteSequence::slice in PQVectors to reduce object allocations #403

michaeljmarshall · 2025-03-20T20:21:51Z

With #370, I introduced the ByteSequence::slice method and started using it on the query path. The PR replaces long lived pq vector ByteSequences with short lived slices of larger ByteSequences created at query time. However, profiling reveals that these objects get created very often for certain workloads, especially brute force workloads in Cassandra. After analyzing the code, it looks trivial to migrate to a solution where we do not create a slice object per access to a pq vector. The main downside is the addition of an offset and a length argument to many methods and the fact that these changes feel brittle, if we want to maintain the slice() construct, which I think we do for the simplicity of non-query path code. However, given the performance implications and the fact that the interface changes are all internal to jvector, I think this change is worth considering.

…ations

michaeljmarshall · 2025-03-20T20:35:48Z

I keep seeing failures for:

Test2DThreshold.testThreshold20k:44->testThreshold:74 visited 75.38348% of the vectors, which is more than 75.0%

This doesn't seem related and feels potentially flaky since the tests pass on retry.

jkni · 2025-03-21T18:12:39Z

Looks reasonable to me -- I agree that this is a bit brittle and suggests that we might want to revisit the API in the future, but I wouldn't call it disproportionately awkward/brittle compared to other performance-based sacrifices we've made around these code paths.

marianotepper · 2025-03-21T18:26:40Z

I agree the comments above and the PR looks good to me. I suggest we open an issue about the interface so that we do not forget.

Based on datastax/jvector#403

Remove query-time usage of ByteSequence::slice to reduce object alloc…

46fc0d7

…ations

michaeljmarshall requested review from jbellis, jkni and marianotepper March 20, 2025 20:21

michaeljmarshall self-assigned this Mar 20, 2025

marianotepper requested review from jkni and removed request for jbellis and jkni March 20, 2025 21:14

jkni approved these changes Mar 21, 2025

View reviewed changes

marianotepper approved these changes Mar 21, 2025

View reviewed changes

michaeljmarshall added a commit to datastax/cassandra that referenced this pull request Mar 21, 2025

Bump jvector for ByteSequence optimization

d88a0f8

Based on datastax/jvector#403

marianotepper merged commit 0962ddb into datastax:main Mar 21, 2025
6 checks passed

michaeljmarshall deleted the reduce-bytesequence-slice-usage-on-query-path branch March 21, 2025 21:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove query-time usage of ByteSequence::slice in PQVectors to reduce object allocations #403

Remove query-time usage of ByteSequence::slice in PQVectors to reduce object allocations #403

michaeljmarshall commented Mar 20, 2025

michaeljmarshall commented Mar 20, 2025

jkni commented Mar 21, 2025

marianotepper commented Mar 21, 2025

Remove query-time usage of ByteSequence::slice in PQVectors to reduce object allocations #403

Remove query-time usage of ByteSequence::slice in PQVectors to reduce object allocations #403

Conversation

michaeljmarshall commented Mar 20, 2025

michaeljmarshall commented Mar 20, 2025

jkni commented Mar 21, 2025

marianotepper commented Mar 21, 2025