Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement : -files0-from (#378) #509

Merged
merged 39 commits into from
Apr 2, 2025
Merged

Implement : -files0-from (#378) #509

merged 39 commits into from
Apr 2, 2025

Conversation

Crypto-Darth
Copy link
Contributor

Implements : #378

Huge Thanks to @hanbings for the implementation guide.

@Crypto-Darth
Copy link
Contributor Author

I will add more tests as soon as possible.

Copy link

codecov bot commented Mar 22, 2025

Codecov Report

Attention: Patch coverage is 78.94737% with 12 lines in your changes missing coverage. Please review.

Project coverage is 87.47%. Comparing base (cb83b6d) to head (f3ec65c).
Report is 1 commits behind head on main.

Files with missing lines Patch % Lines
src/find/mod.rs 41.66% 5 Missing and 2 partials ⚠️
src/find/matchers/mod.rs 88.88% 0 Missing and 5 partials ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main     #509      +/-   ##
==========================================
- Coverage   87.55%   87.47%   -0.08%     
==========================================
  Files          31       31              
  Lines        6715     6770      +55     
  Branches      300      305       +5     
==========================================
+ Hits         5879     5922      +43     
- Misses        623      628       +5     
- Partials      213      220       +7     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@hanbings hanbings linked an issue Mar 23, 2025 that may be closed by this pull request
@Crypto-Darth Crypto-Darth marked this pull request as ready for review March 23, 2025 12:52
@sylvestre sylvestre requested a review from Copilot March 23, 2025 13:27
Copy link

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR implements support for the "-files0-from" argument, allowing the program to read starting points from either stdin or a specified file, in line with GNU findutils behavior. Key changes include:

  • Adding the "from_file" field to the configuration structure.
  • Integrating logic in "parse_args" to handle "-files0-from" for both piped input and file-based input.
  • Adding tests in both the core module and integration tests along with an updated Cargo.toml dependency.

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

File Description
src/find/mod.rs Added support for "-files0-from" argument and file/stdi management.
tests/find_cmd_tests.rs Introduced new tests validating "-files0-from" behavior under various conditions.
Cargo.toml Added the "atty" dependency required for checking the STDIN state.
Comments suppressed due to low confidence (1)

src/find/mod.rs:168

  • [nitpick] The error message for an empty starting point in the file branch includes an extra phrase ("OR File is empty") compared to the STDIN branch. Consider unifying the messages to improve consistency.
return Err("Empty starting point detected in -files0-from input OR File is empty".into());

@Crypto-Darth
Copy link
Contributor Author

I have migrated the tests so as to avoid using panic! ,works as intended.
Please review @sylvestre .

Thank you.

@Crypto-Darth Crypto-Darth requested a review from sylvestre March 24, 2025 06:11
@Crypto-Darth Crypto-Darth requested a review from sylvestre March 25, 2025 16:27
@Crypto-Darth Crypto-Darth requested a review from tavianator March 25, 2025 19:53
@Crypto-Darth
Copy link
Contributor Author

Crypto-Darth commented Apr 1, 2025

@tavianator I'm still clueless on how should I go on to fix testcase 2 , my solution above won't work
Would request your suggestion.

@tavianator
Copy link
Contributor

@tavianator I'm still clueless on how should I go on to fix testcase 2 , my solution above won't work Would request your suggestion.

So looking more at how GNU find works here, I think what you should do is: when you see -files0-from, save the argument in the Config, but don't open/read it yet. Then, after parsing everything, if you saved a -files0-from argument, go read it and add all the root paths. At that point you can also check if any other paths were already given, and give an error instead.

The reason I suggest this is that only the last -files0-from argument actually takes effect:

tavianator@graphene $ printf a > a
tavianator@graphene $ printf b > b
tavianator@graphene $ find -files0-from a -files0-from b
b

This is intentional by GNU find: https://savannah.gnu.org/bugs/?66965

@Crypto-Darth
Copy link
Contributor Author

So looking more at how GNU find works here, I think what you should do is: when you see -files0-from, save the argument in the Config, but don't open/read it yet. Then, after parsing everything, if you saved a -files0-from argument, go read it and add all the root paths. At that point you can also check if any other paths were already given, and give an error instead.

@tavianator ,
I think I found a better way by writing to the new paths directly via insert() rather than get_and_insert().
Therefore if there's more than 1 usage of -files0-from the last one will be considered.

image


Regarding the original Issue (Test 2) is now also solved.

image

Please review the latest commit .
Thank you.

@Crypto-Darth Crypto-Darth requested a review from tavianator April 1, 2025 14:36
@tavianator
Copy link
Contributor

@tavianator ,
I think I found a better way by writing to the new paths directly via insert() rather than get_and_insert().
Therefore if there's more than 1 usage of -files0-from the last one will be considered.

That would almost work, but you also have to ensure that

$ find -files0-from - -files0-from file

doesn't read from stdin at all, so I think it's easiest to delay reading the file.

@Crypto-Darth
Copy link
Contributor Author

Crypto-Darth commented Apr 1, 2025

@tavianator ,
I think I found a better way by writing to the new paths directly via insert() rather than get_and_insert().
Therefore if there's more than 1 usage of -files0-from the last one will be considered.

That would almost work, but you also have to ensure that

$ find -files0-from - -files0-from file

doesn't read from stdin at all, so I think it's easiest to delay reading the file.


To be really honest that is a very edge case , I don't see any user or a script using -files0-from twice but I understand the need for maximum compatibility hence, I have fixed it as per your suggestion in latest commit.
@tavianator
image

.collect();
// empty starting point checker
if string_segments.iter().any(|s| s.is_empty()) {
println!("find: invalid zero-length file name");
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be eprintln!(), and you should also set the exit code to 1

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Any clue on how can I set the exit code to 1?
I don't see how current implementation allows me to set custom exit code without exiting in between of execution.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems like it would take more refactoring to make that work nicely. Feel free to defer it to a future PR

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Surely.

@tavianator
Copy link
Contributor

This LGTM now except for those few nits. Sorry for so much back and forth, it's a surprisingly finicky feature!

@Crypto-Darth
Copy link
Contributor Author

This LGTM now except for those few nits. Sorry for so much back and forth, it's a surprisingly finicky feature!

No worries :)
Really got to learn a lot.
Thanks a lot to you for your time and patience :)
Thanks to @sylvestre and @tertsdiepraam too for their time and reviews.

@Crypto-Darth
Copy link
Contributor Author

@hanbings @sylvestre ,
Can we get this merged now if all good to go ? :)

@hanbings hanbings merged commit 1a40442 into uutils:main Apr 2, 2025
18 of 20 checks passed
@hanbings
Copy link
Collaborator

hanbings commented Apr 2, 2025

This is truly amazing work! Many thanks to all the contributors!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Implement -files0-from
5 participants