-
-
Notifications
You must be signed in to change notification settings - Fork 2.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
refactor(rust): Add updated multiscan pipeline #21925
Conversation
@@ -75,8 +77,10 @@ impl ApplyExtraOps { | |||
scan_source_idx, | |||
hive_parts, | |||
} => { | |||
// This should always be pushed to the reader, or otherwise handled separately. | |||
assert!(pre_slice.is_none()); | |||
// Negative slice should have been resolved earlier. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added support for slice in post-apply, as external readers may not support ReaderCapabilities::PRE_SLICE
.
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #21925 +/- ##
==========================================
- Coverage 80.83% 80.46% -0.37%
==========================================
Files 1629 1635 +6
Lines 235097 236228 +1131
Branches 2693 2693
==========================================
+ Hits 190039 190080 +41
- Misses 44424 45515 +1091
+ Partials 634 633 -1 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
422ee66
to
0396e3c
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It looks good. Bit difficult to say much more when it is not connected yet.
PR contains an updated multiscan pipeline implementation. This is not hooked up yet - it is currently missing the IR lowering and refactoring of the individual readers to use the updated pipeline.
Some improvements include:
<T: MultiScanable>
withdyn FileReader
ReaderCapabilities
(slice/predicate)row_position_on_end
The effects of these optimizations differ across file types - I will have more concrete benchmark numbers for them per file type in follow-up PRs.
@coastalwhite