- Complete browser automation with Playwright integration
- High-level API functions: get(), get_many(), discover()
- JavaScript execution support with script parameters
- Content extraction optimized for LLM workflows
- Comprehensive test suite with 18 test files (700+ scenarios)
- Local Caddy test server for reproducible testing
- Performance benchmarking vs Katana crawler
- Complete documentation including JavaScript API guide
- PyPI-ready packaging with professional metadata
- UNIX philosophy: do web scraping exceptionally well
- Enhanced get() function with script, script_before, script_after parameters
- Enhanced get_many() function with script parameter (str or List[str])
- Enhanced discover() function with script and content_script parameters
- Updated ContentExtractor to populate script fields from page_data
- Maintained 100% backward compatibility
- Added comprehensive parameter validation and error handling
- Implemented script parameter alias support (script -> script_before)
- Added smart script distribution for multi-URL operations
- Enabled two-stage JavaScript execution for discovery workflow
All API functions now support JavaScript execution while preserving
existing functionality. The enhancement provides intuitive, optional
JavaScript capabilities that integrate seamlessly with the browser
automation layer.
- Comprehensive test suite (700+ lines) for JS execution in high-level API
- Test coverage analysis and validation infrastructure
- Enhancement proposal and implementation strategy
- Mock HTTP server with realistic JavaScript scenarios
- Parallel implementation strategy using expert agents and git worktrees
Ready for test-driven implementation of JavaScript enhancements.