fix: replace curl_cffi with scrapling in codeforces metadata
Problem: `codeforces.py` used `curl_cffi` to bypass Cloudflare when fetching contest problem HTML, making it unavailable in the nix python env and requiring an extra dependency. Solution: rewrite `_fetch_problems_html` to use scrapling `StealthySession` with `solve_cloudflare=True`, matching the existing CF submit pattern. Extend `needs_browser` in `scraper.lua` to route CF `metadata` and `tests` through the FHS env on NixOS. Remove `curl-cffi` from `pyproject.toml`, `flake.nix`, and test mocks.
This commit is contained in:
parent
543480a4fe
commit
297c71e7c7
5 changed files with 30 additions and 35 deletions
|
|
@ -8,7 +8,6 @@ dependencies = [
|
|||
"backoff>=2.2.1",
|
||||
"beautifulsoup4>=4.13.5",
|
||||
"scrapling[fetchers]>=0.4",
|
||||
"curl-cffi>=0.13.0",
|
||||
"httpx>=0.28.1",
|
||||
"ndjson>=0.3.1",
|
||||
"pydantic>=2.11.10",
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue