Convert PDF to Excel — improved parser
This single-file tool extracts text from PDFs (using pdf.js) and intelligently groups tokens into columns and rows. It works best for digitally-created PDFs with selectable text. Scanned PDFs require OCR (see notes below).
About reliability
- Designed for PDFs with selectable text (digital PDFs).
- Scanned PDFs (images) won't extract text — you'll get a warning and need OCR.
- Adjust Gap sensitivity if columns merge or split incorrectly.
- If conversion still fails for a specific file, you can share the PDF and I can inspect it.
Drop PDF here or click to choose
Supports multi-page PDFs. Processing happens locally in your browser.
No file loaded.
Conversion notes
If output is a single column or text seems scrambled, try increasing the Gap sensitivity so the parser treats wider gaps as column breaks. If you need OCR for scanned PDFs, I can add server-side OCR using Tesseract or a cloud OCR API.