Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scheduled scans periodically fail because of missing Tranco list #90

Open
ghostwords opened this issue Mar 24, 2025 · 0 comments
Open
Labels
bug Something isn't working

Comments

@ghostwords
Copy link
Member

ghostwords commented Mar 24, 2025

Seems to happen fairly frequently at this point.

Running scan in Docker...
Traceback (most recent call last):
  File "/home/user/./crawler.py", line 1203, in <module>
    domains = crawler.get_sitelist()
              ^^^^^^^^^^^^^^^^^^^^^^
  File "/home/user/./crawler.py", line 897, in get_sitelist
    domains = Tranco(cache_dir=tempfile.gettempdir()).list(TRANCO_VERSION).top()
              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ^
  File "/home/user/.local/lib/python3.12/site-packages/tranco/tranco.py", line 65, in list 
    top_list_text = self._download_zip_file(list_id)
                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/user/.local/lib/python3.12/site-packages/tranco/tranco.py", line 106, in _download_zip_file
    raise AttributeError("The daily list for this date is currently unavailable.")
AttributeError: The daily list for this date is currently unavailable.
Scan failed. See log.txt for details.
@ghostwords ghostwords added the bug Something isn't working label Mar 24, 2025
@ghostwords ghostwords changed the title Scheduled scans periodically failing because of missing Tranco list Scheduled scans periodically fail because of missing Tranco list Mar 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant