issues Search Results · language:Dune language:Python language:JavaScript language:Java language:HTML language:JavaScript
Filter by
61.8M results
Description
Static HTML fallback for Petco if Amazon is consistently blocked.
Acceptance Criteria
- [ ] PetcoScraper.get_product(url) returns RawProductData
- [ ] Uses Requests + BeautifulSoup (no ...
backend
tdd
Description
Open circuit after 3 consecutive blocks; close after 1 hour cooldown.
Acceptance Criteria
- [ ] After 3 consecutive ScraperBlockedError, circuit opens
- [ ] While circuit open, get_product() ...
backend
tdd
FAST_REFORM_005_POST_RECLAIM_VALIDATION_AND_KPIONE2_REWORK
Parent epic: #5 Depends on: #10 Previous phase: FAST_REFORM_004_EXECUTION_OK_WITH_VIEW_DIVERGENCE
Current state after FAST_REFORM_004
Emergency ...
enhancement
Description
Detect CAPTCHA pages and anti-bot blocks from Amazon.
Acceptance Criteria
- [ ] Detects CAPTCHA page by checking for known selectors/text
- [ ] Detects block by checking for 503 or robot ...
backend
tdd
Description
Full Amazon scraper: launch browser, navigate, parse, return RawProductData.
Acceptance Criteria
- [ ] AmazonScraper.get_product(url) returns RawProductData
- [ ] Rate limit: ≤ 0.5 req/sec ...
backend
tdd
Follow-up hardening for the release tooling landed in #611 (single-source plugin.json from stable tags). A late
high-effort review surfaced these — all verified against merged main. The script-injection ...
enhancement
Description
Parse Amazon product page HTML (rendered by Playwright) to extract product data.
Acceptance Criteria
- [ ] parse_product_page(html) extracts same fields as Chewy parser
- [ ] Handles ...
backend
tdd
Context
A library-wide re-cluster was launched on 2026-06-20 to apply three fixes:
- DB burst_uuid cleared on force-rescan (commit f642da4)
- In-memory stale burst_uuid cleared after clear_stacks_in_folder ...
area:python
priority:p1
type:chore
Thanks for sharing this project! Could you provide license info. for Annoy-PyEdu-Rs-Raw and Annoy-PyEdu-Rs? thanks!
Description
Set up Playwright browser launcher with anti-detection measures.
Acceptance Criteria
- [ ] app/scrapers/amazon.py launches headless Chromium
- [ ] Stealth patches: randomized viewport, ...
backend

Learn how you can use GitHub Issues to plan and track your work.
Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub IssuesProTip! Restrict your search to the title by using the in:title qualifier.