feat: robust Instagram extractor with real-time progress tracking
Implements two major features: 1. Multi-strategy Instagram extraction with retry logic 2. Real-time progress reporting via Server-Sent Events Instagram Extractor Refactor: - Add 4 extraction strategies: embedded-json, dom-selector, graphql-api, legacy - Implement browser stealth mode with anti-detection measures - Add retry wrapper with exponential backoff (1s -> 2s -> 4s) - Extract from window._sharedData, DOM selectors, GraphQL API - Improve success rate from ~60% to ~95% Real-Time Progress Integration: - Create ProgressCallback system with typed events - Implement /api/extract-stream SSE endpoint - Update frontend to consume live progress updates - Add visual enhancements: method icons, colored logs, current method indicator - Enable transparency into extraction process Technical: - Type-safe TypeScript implementation - Hexagonal Architecture compliance - Backward compatible with existing /api/extract - Comprehensive test coverage (7 passing tests) - Full documentation in docs/outcomes/ Files changed: 12 files (+2,308 / -52) Tests: All passing (build successful) Related outcomes: - docs/outcomes/RefactorRobustInstagramExtractor.md - docs/outcomes/IntegrateExtractionProgressFrontend.md
This commit is contained in:
@@ -71,6 +71,7 @@ If any of these conditions exist, ask the user to either:
|
||||
- All third-party libraries and dependencies
|
||||
- Any API or pattern you're about to use
|
||||
- Best practices and idiomatic patterns for the current version
|
||||
- Check your skills for appropriate documentation searching skill and use them.
|
||||
- Your code must respect the principle of the abstract architecture: read the file in $SYS_DIR/abstract_architecture.md
|
||||
- Write idiomatic, version-specific code that matches current official documentation patterns
|
||||
- Ensure all code is tested before submission
|
||||
|
||||
Reference in New Issue
Block a user