Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow the compression level to be set in zip files #252

Merged
merged 2 commits into from
Jun 6, 2023

Conversation

jjhelmus
Copy link
Contributor

@jjhelmus jjhelmus commented Feb 10, 2023

Description

Since Python 3.7 the Python zipfile module has allowed setting the
default compression level in zipfiles using the compresslevel
argument.

Pass the compression level specified on the command line to the ZipFile class.

Checklist - did you ...

  • Add a file to the news directory (using the template) for the next release's release notes?
  • Add / update necessary tests?
  • Add / update outdated documentation?

Since Python 3.7 the Python zipfile module has allowed setting the
default compression level in zipfiles using the `compresslevel`
argument.
@conda-bot conda-bot added the cla-signed [bot] added once the contributor has signed the CLA label Feb 10, 2023
@jjhelmus jjhelmus marked this pull request as ready for review February 10, 2023 19:16
@jjhelmus jjhelmus requested a review from a team as a code owner February 10, 2023 19:16
@jjhelmus
Copy link
Contributor Author

A basic comparison of compression size vs time:

conda create -p ./to_pack python=3.11
conda-pack -p ./to_pack -o test_0.zip --compress-level=0 --force
conda-pack -p ./to_pack -o test_4.zip --compress-level=4 --force
conda-pack -p ./to_pack -o test_9.zip --compress-level=9 --force
conda-pack -p ./to_pack -o test_default.zip --force
level time (seconds) size (MB)
0 1.5 163
4 3.4 62
9 13.3 61
default (4) 3.5 62

Copy link
Member

@jezdez jezdez left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm a little uneasy adding this without a test, do you know if Python has sensible unit tests for the compression level?

@@ -75,8 +75,7 @@ def build_parser():
default=4,
help=("The compression level to use, from 0 to 9. "
"Higher numbers decrease output file size at "
"the expense of compression time. Ignored for "
"``format='zip'``. Default is 4."))
"the expense of compression time. Default is 4."))
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's mention that 9 is the highest value accepted for zip files using zlib.

@jjhelmus
Copy link
Contributor Author

I need to think a bit about a strategy for testing this. The compression level is not recorded in the resulting zip file so there is no way to validate what level was used. Comparing the sizes of different level could work as a heuristic but there is no guarantee that higher compression level will reduce the file size.

Copy link
Member

@jezdez jezdez left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thinking about this more, I can't think of a great way to unit test this as well. Let's do it.

@jezdez jezdez changed the title allow the compression level to be set in zip files Allow the compression level to be set in zip files Jun 6, 2023
@jezdez jezdez merged commit 031be65 into conda:main Jun 6, 2023
@github-actions github-actions bot added the locked [bot] locked due to inactivity label Jun 6, 2024
@github-actions github-actions bot locked as resolved and limited conversation to collaborators Jun 6, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
cla-signed [bot] added once the contributor has signed the CLA locked [bot] locked due to inactivity
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

3 participants