zov-tech/backend-py/app/parsers
wasrusgen 839e775151 yamarket: rewrite for /card/{slug}/{id} URL pattern (Я.Маркет 2026)
- Old /product--{id} URLs deprecated
- Walks up from a[href*='/card/'] to nearest article/zone-div
- Extracts title from link text or h2/h3/itemprop=name
- Price: min from card text (with sanity bounds 100..10M)
- Image filters yastatic / _next placeholders
- Rating: '4.7★' or '4.7 N оценок' pattern
- Reviews: 'N отзывов' / 'N оценок'
- Stores count: 'от N магазинов / предложений'
2026-05-11 16:26:28 +03:00
..
__init__.py backend: working parsers — OZON + Citilink (DOM via Playwright) + WB 2026-05-11 13:53:07 +03:00
citilink.py citilink: dedup by product ID + filter Next.js placeholder images 2026-05-11 13:59:07 +03:00
dns.py backend: Proxy6 pool + parsers WB / OZON / Я.Маркет / DNS 2026-05-11 12:18:04 +03:00
ozon.py ozon: fix false-positive challenge detector (was catching 'challenge' in normal JS) 2026-05-11 13:54:13 +03:00
playwright_engine.py playwright_engine: route through proxy_pool — random residential IP per request 2026-05-11 16:05:36 +03:00
wb.py ai+report: deeper analysis — required pros/cons, category insights, source visibility 2026-05-11 14:34:08 +03:00
yamarket.py yamarket: rewrite for /card/{slug}/{id} URL pattern (Я.Маркет 2026) 2026-05-11 16:26:28 +03:00