mcwaddams

Author	SHA1	Message	Date
Ryan Malloy	2f39c4ec5b	Fix critical xpath API bug breaking chapter/heading detection python-docx elements don't support xpath() with namespaces kwarg. The calls silently failed in try/except blocks, causing chapter search and heading detection to never find matches. Fixed by replacing xpath(..., namespaces={...}) with: - findall('.//' + qn('w:t')) for text elements - find(qn('w:pPr')) + find(qn('w:pStyle')) for style detection - get(qn('w:val')) for attribute values Also fixed logic bug where elif prevented short-text fallback from running when a non-heading style existed on the paragraph.	2026-01-11 05:20:05 -07:00
Ryan Malloy	af6aadf559	Refactor: Extract processing logic into utility modules Complete architecture cleanup - eliminated duplicate server files: - Deleted server_monolithic.py (2249 lines) - Deleted server_legacy.py (2209 lines) New utility modules created: - utils/word_processing.py - Word extraction/conversion (preserves page range fixes) - utils/excel_processing.py - Excel extraction - utils/powerpoint_processing.py - PowerPoint extraction - utils/processing.py - Universal helpers (parse_page_range, health checks, etc.) Updated mixins to import from utils instead of server_monolithic. Entry point remains server.py (48 lines) using mixin architecture. All 53 tests pass. Coverage improved from 11% to 22% by removing duplicate code.	2026-01-11 05:08:18 -07:00
Ryan Malloy	76c7a0b2d0	Add decorators for field defaults and error handling, fix Excel performance - Create @resolve_field_defaults decorator to handle Pydantic FieldInfo objects when tools are called directly (outside MCP framework) - Create @handle_office_errors decorator for consistent error wrapping - Apply decorators to Excel and Word mixins, removing ~100 lines of boilerplate code - Fix Excel formula extraction performance: load workbooks once before loop instead of per-cell (100x faster with calculated values) - Update test suite to use correct mock patch paths (patch where names are looked up, not where defined) - Add torture_test.py for real document validation	2026-01-10 23:51:30 -07:00
Ryan Malloy	b681cb030b	Initial commit: MCP Office Tools v0.1.0 - Comprehensive Microsoft Office document processing server - Support for Word (.docx, .doc), Excel (.xlsx, .xls), PowerPoint (.pptx, .ppt), CSV - 6 universal tools: extract_text, extract_images, extract_metadata, detect_office_format, analyze_document_health, get_supported_formats - Multi-library fallback system for robust processing - URL support with intelligent caching - Legacy Office format support (97-2003) - FastMCP integration with async architecture - Production ready with comprehensive documentation 🤖 Generated with Claude Code (claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-08-18 01:01:48 -06:00

4 Commits