📝 Update README: 40 tools, v2.0.7 table features, token management
Some checks failed
Security Scan / security-scan (push) Has been cancelled
Some checks failed
Security Scan / security-scan (push) Has been cancelled
This commit is contained in:
parent
dfbf3d1870
commit
19bdeddcdf
19
README.md
19
README.md
@ -24,19 +24,19 @@
|
||||
|
||||
> 🎯 **The Problem**: PDFs contain incredible intelligence, but extracting it reliably is complex, slow, and often fails.
|
||||
>
|
||||
> ⚡ **The Solution**: MCP PDF delivers **AI-powered document intelligence** with **23 specialized tools** that understand both content and structure.
|
||||
> ⚡ **The Solution**: MCP PDF delivers **AI-powered document intelligence** with **40 specialized tools** that understand both content and structure.
|
||||
|
||||
<table>
|
||||
<tr>
|
||||
<td>
|
||||
|
||||
### 🏆 **Why MCP PDF Leads**
|
||||
- **🚀 24 Specialized Tools** for every PDF scenario
|
||||
- **🚀 40 Specialized Tools** for every PDF scenario
|
||||
- **🧠 AI-Powered Intelligence** beyond basic extraction
|
||||
- **🔄 Multi-Library Fallbacks** for 99.9% reliability
|
||||
- **⚡ 10x Faster** than traditional solutions
|
||||
- **🌐 URL Processing** with smart caching
|
||||
- **👥 User-Friendly** 1-based page numbering
|
||||
- **🎯 Smart Token Management** prevents MCP overflow errors
|
||||
|
||||
</td>
|
||||
<td>
|
||||
@ -119,7 +119,12 @@ Add to your `claude_desktop_config.json`:
|
||||
health = await analyze_pdf_health("quarterly-report.pdf")
|
||||
classification = await classify_content("quarterly-report.pdf")
|
||||
summary = await summarize_content("quarterly-report.pdf", summary_length="medium")
|
||||
tables = await extract_tables("quarterly-report.pdf", pages=[5,6,7])
|
||||
|
||||
# Smart table extraction - prevents token overflow on large tables
|
||||
tables = await extract_tables("quarterly-report.pdf", pages="5-7", max_rows_per_table=100)
|
||||
# Or get just table structure without data
|
||||
table_summary = await extract_tables("quarterly-report.pdf", pages="5-7", summary_only=True)
|
||||
|
||||
charts = await extract_charts("quarterly-report.pdf")
|
||||
|
||||
# Get instant insights
|
||||
@ -177,7 +182,7 @@ citations = await extract_text("research-paper.pdf", pages=[15,16,17])
|
||||
|
||||
---
|
||||
|
||||
## 🛠️ **Complete Arsenal: 23 Specialized Tools**
|
||||
## 🛠️ **Complete Arsenal: 40+ Specialized Tools**
|
||||
|
||||
<div align="center">
|
||||
|
||||
@ -195,8 +200,8 @@ citations = await extract_text("research-paper.pdf", pages=[15,16,17])
|
||||
|
||||
| 🔧 **Tool** | 📋 **Purpose** | ⚡ **Speed** | 🎯 **Accuracy** |
|
||||
|-------------|---------------|-------------|----------------|
|
||||
| `extract_text` | Multi-method text extraction | **Ultra Fast** | 99.9% |
|
||||
| `extract_tables` | Intelligent table processing | **Fast** | 98% |
|
||||
| `extract_text` | Multi-method text extraction with auto-chunking | **Ultra Fast** | 99.9% |
|
||||
| `extract_tables` | Smart table extraction with token overflow protection | **Fast** | 98% |
|
||||
| `ocr_pdf` | Advanced OCR for scanned docs | **Moderate** | 95% |
|
||||
| `extract_images` | Media extraction & processing | **Fast** | 99% |
|
||||
| `pdf_to_markdown` | Structure-preserving conversion | **Fast** | 97% |
|
||||
|
||||
Loading…
x
Reference in New Issue
Block a user