P→X
PDF → Excel
Visit

Convert PDF to Excel — improved parser

This single-file tool extracts text from PDFs (using pdf.js) and intelligently groups tokens into columns and rows. It works best for digitally-created PDFs with selectable text. Scanned PDFs require OCR (see notes below).

About reliability

  • Designed for PDFs with selectable text (digital PDFs).
  • Scanned PDFs (images) won't extract text — you'll get a warning and need OCR.
  • Adjust Gap sensitivity if columns merge or split incorrectly.
  • If conversion still fails for a specific file, you can share the PDF and I can inspect it.
Drop PDF here or click to choose
Supports multi-page PDFs. Processing happens locally in your browser.
No file loaded.

Conversion notes

If output is a single column or text seems scrambled, try increasing the Gap sensitivity so the parser treats wider gaps as column breaks. If you need OCR for scanned PDFs, I can add server-side OCR using Tesseract or a cloud OCR API.