PDF to Excel Converter
Turn the text in your PDF into an Excel spreadsheet you can open in Microsoft Excel, Google Sheets, or LibreOffice Calc. Each line on the page becomes a row, and columns are split where there's a clear gap between values. Scanned, image-only PDFs are recognized with built-in OCR first, so even photographed statements and reports can flow into a worksheet — all without uploading your file anywhere.
What does this tool do?
The PDF to Excel converter reads the text content of a PDF and writes a native .xlsx workbook. For PDFs that already contain text, it extracts the words and their positions directly; for image-only scans, it runs optical character recognition in your browser before building the sheet. Each recognized line is placed on its own row, and wide horizontal gaps between words are interpreted as column breaks so tabular layouts land in separate cells. The result is a standard Office Open XML spreadsheet, not a picture of a table, so the cells are real, selectable, and editable.
How it works
The tool rasterizes scanned pages and recognizes their text with a Tesseract LSTM engine, or pulls embedded text straight from text-based PDFs. The recognized words carry pixel positions, which are grouped into visual lines by vertical alignment and ordered left to right. Within each line, the spacing between words is measured: a gap noticeably wider than ordinary word spacing is treated as a column boundary, splitting the line into multiple cells. Those rows and cells are serialized into the worksheet XML and zipped into a valid .xlsx file using the Office Open XML standard, ready to open in any spreadsheet application.
Features
- Outputs a real .xlsx workbook (opens in Excel, Google Sheets, LibreOffice Calc)
- Automatic OCR for scanned, image-only PDFs
- Each page line becomes a spreadsheet row
- Best-effort column splitting on wide gaps between values
- Optional blank row between pages
- 100% in-browser — your file never leaves your device
- Standard Office Open XML format
How to use
- 1
Upload your PDF
Drag any PDF onto the drop zone. Text-based PDFs are read directly; scanned or photographed pages are detected and OCR'd automatically.
- 2
Choose column handling
Keep column splitting on to break each line into cells at wide gaps, or turn it off to put each full line in a single column. Optionally add a blank row between pages.
- 3
Convert to Excel
Click Convert to Excel. Text is extracted (with OCR when needed) and assembled into rows and cells in a native .xlsx workbook.
- 4
Open in your spreadsheet app
Download the .xlsx file and open it in Microsoft Excel, Google Sheets, or LibreOffice Calc. Adjust columns and clean up cells as needed.
Common use cases
Bank statements into a sheet
Pull transaction lines from a PDF statement into Excel rows so you can sort, total, and categorize them instead of retyping by hand.
Reports and invoices
Extract line items and figures from PDF reports or invoices into a worksheet for further calculation and analysis.
Scanned tables
Recognize text from scanned or photographed pages and drop it into a spreadsheet, with columns split where the original had clear spacing.
Data reuse
Move tabular content out of read-only PDFs and into a format you can filter, chart, and share with formulas.
Tips & best practices
- Column splitting is a best-effort heuristic based on spacing, not true table detection — review and adjust columns after opening the file
- Tightly packed numeric columns or merged cells may misalign, since clear gaps aren't always present — some manual cleanup is normal
- For scanned PDFs, higher-quality scans produce more accurate OCR and cleaner columns
- Turn off column splitting when you just want each line as a single cell for further processing