📝 Update README: 40 tools, v2.0.7 table features, token management
Some checks failed
Security Scan / security-scan (push) Has been cancelled
Some checks failed
Security Scan / security-scan (push) Has been cancelled
This commit is contained in:
parent
dfbf3d1870
commit
19bdeddcdf
19
README.md
19
README.md
@ -24,19 +24,19 @@
|
|||||||
|
|
||||||
> 🎯 **The Problem**: PDFs contain incredible intelligence, but extracting it reliably is complex, slow, and often fails.
|
> 🎯 **The Problem**: PDFs contain incredible intelligence, but extracting it reliably is complex, slow, and often fails.
|
||||||
>
|
>
|
||||||
> ⚡ **The Solution**: MCP PDF delivers **AI-powered document intelligence** with **23 specialized tools** that understand both content and structure.
|
> ⚡ **The Solution**: MCP PDF delivers **AI-powered document intelligence** with **40 specialized tools** that understand both content and structure.
|
||||||
|
|
||||||
<table>
|
<table>
|
||||||
<tr>
|
<tr>
|
||||||
<td>
|
<td>
|
||||||
|
|
||||||
### 🏆 **Why MCP PDF Leads**
|
### 🏆 **Why MCP PDF Leads**
|
||||||
- **🚀 24 Specialized Tools** for every PDF scenario
|
- **🚀 40 Specialized Tools** for every PDF scenario
|
||||||
- **🧠 AI-Powered Intelligence** beyond basic extraction
|
- **🧠 AI-Powered Intelligence** beyond basic extraction
|
||||||
- **🔄 Multi-Library Fallbacks** for 99.9% reliability
|
- **🔄 Multi-Library Fallbacks** for 99.9% reliability
|
||||||
- **⚡ 10x Faster** than traditional solutions
|
- **⚡ 10x Faster** than traditional solutions
|
||||||
- **🌐 URL Processing** with smart caching
|
- **🌐 URL Processing** with smart caching
|
||||||
- **👥 User-Friendly** 1-based page numbering
|
- **🎯 Smart Token Management** prevents MCP overflow errors
|
||||||
|
|
||||||
</td>
|
</td>
|
||||||
<td>
|
<td>
|
||||||
@ -119,7 +119,12 @@ Add to your `claude_desktop_config.json`:
|
|||||||
health = await analyze_pdf_health("quarterly-report.pdf")
|
health = await analyze_pdf_health("quarterly-report.pdf")
|
||||||
classification = await classify_content("quarterly-report.pdf")
|
classification = await classify_content("quarterly-report.pdf")
|
||||||
summary = await summarize_content("quarterly-report.pdf", summary_length="medium")
|
summary = await summarize_content("quarterly-report.pdf", summary_length="medium")
|
||||||
tables = await extract_tables("quarterly-report.pdf", pages=[5,6,7])
|
|
||||||
|
# Smart table extraction - prevents token overflow on large tables
|
||||||
|
tables = await extract_tables("quarterly-report.pdf", pages="5-7", max_rows_per_table=100)
|
||||||
|
# Or get just table structure without data
|
||||||
|
table_summary = await extract_tables("quarterly-report.pdf", pages="5-7", summary_only=True)
|
||||||
|
|
||||||
charts = await extract_charts("quarterly-report.pdf")
|
charts = await extract_charts("quarterly-report.pdf")
|
||||||
|
|
||||||
# Get instant insights
|
# Get instant insights
|
||||||
@ -177,7 +182,7 @@ citations = await extract_text("research-paper.pdf", pages=[15,16,17])
|
|||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
## 🛠️ **Complete Arsenal: 23 Specialized Tools**
|
## 🛠️ **Complete Arsenal: 40+ Specialized Tools**
|
||||||
|
|
||||||
<div align="center">
|
<div align="center">
|
||||||
|
|
||||||
@ -195,8 +200,8 @@ citations = await extract_text("research-paper.pdf", pages=[15,16,17])
|
|||||||
|
|
||||||
| 🔧 **Tool** | 📋 **Purpose** | ⚡ **Speed** | 🎯 **Accuracy** |
|
| 🔧 **Tool** | 📋 **Purpose** | ⚡ **Speed** | 🎯 **Accuracy** |
|
||||||
|-------------|---------------|-------------|----------------|
|
|-------------|---------------|-------------|----------------|
|
||||||
| `extract_text` | Multi-method text extraction | **Ultra Fast** | 99.9% |
|
| `extract_text` | Multi-method text extraction with auto-chunking | **Ultra Fast** | 99.9% |
|
||||||
| `extract_tables` | Intelligent table processing | **Fast** | 98% |
|
| `extract_tables` | Smart table extraction with token overflow protection | **Fast** | 98% |
|
||||||
| `ocr_pdf` | Advanced OCR for scanned docs | **Moderate** | 95% |
|
| `ocr_pdf` | Advanced OCR for scanned docs | **Moderate** | 95% |
|
||||||
| `extract_images` | Media extraction & processing | **Fast** | 99% |
|
| `extract_images` | Media extraction & processing | **Fast** | 99% |
|
||||||
| `pdf_to_markdown` | Structure-preserving conversion | **Fast** | 97% |
|
| `pdf_to_markdown` | Structure-preserving conversion | **Fast** | 97% |
|
||||||
|
|||||||
Loading…
x
Reference in New Issue
Block a user