PDF Tools

PDF to Excel Converter

Turn the text in your PDF into an Excel spreadsheet you can open in Microsoft Excel, Google Sheets, or LibreOffice Calc. Each line on the page becomes a row, and columns are split where there's a clear gap between values. Scanned, image-only PDFs are recognized with built-in OCR first, so even photographed statements and reports can flow into a worksheet — all without uploading your file anywhere.

JavaScript Required

This tool requires JavaScript to run. Please enable JavaScript in your browser settings to use PDF to Excel Converter.

Why JavaScript? This tool processes your files entirely in your browser using WebAssembly — nothing is uploaded to servers. This privacy-first approach requires JavaScript to be enabled.

What does this tool do?

The PDF to Excel converter reads the text content of a PDF and writes a native .xlsx workbook. For PDFs that already contain text, it extracts the words and their positions directly; for image-only scans, it runs optical character recognition in your browser before building the sheet. Each recognized line is placed on its own row, and wide horizontal gaps between words are interpreted as column breaks so tabular layouts land in separate cells. The result is a standard Office Open XML spreadsheet, not a picture of a table, so the cells are real, selectable, and editable.

How it works

The tool rasterizes scanned pages and recognizes their text with a Tesseract LSTM engine, or pulls embedded text straight from text-based PDFs. The recognized words carry pixel positions, which are grouped into visual lines by vertical alignment and ordered left to right. Within each line, the spacing between words is measured: a gap noticeably wider than ordinary word spacing is treated as a column boundary, splitting the line into multiple cells. Those rows and cells are serialized into the worksheet XML and zipped into a valid .xlsx file using the Office Open XML standard, ready to open in any spreadsheet application.

Features

Outputs a real .xlsx workbook (opens in Excel, Google Sheets, LibreOffice Calc)
Automatic OCR for scanned, image-only PDFs
Each page line becomes a spreadsheet row
Best-effort column splitting on wide gaps between values
Optional blank row between pages
100% in-browser — your file never leaves your device
Standard Office Open XML format

How to use

1

Upload your PDF

Drag any PDF onto the drop zone. Text-based PDFs are read directly; scanned or photographed pages are detected and OCR'd automatically.
2

Choose column handling

Keep column splitting on to break each line into cells at wide gaps, or turn it off to put each full line in a single column. Optionally add a blank row between pages.
3

Convert to Excel

Click Convert to Excel. Text is extracted (with OCR when needed) and assembled into rows and cells in a native .xlsx workbook.
4

Open in your spreadsheet app

Download the .xlsx file and open it in Microsoft Excel, Google Sheets, or LibreOffice Calc. Adjust columns and clean up cells as needed.

Common use cases

Bank statements into a sheet

Pull transaction lines from a PDF statement into Excel rows so you can sort, total, and categorize them instead of retyping by hand.

Reports and invoices

Extract line items and figures from PDF reports or invoices into a worksheet for further calculation and analysis.

Scanned tables

Recognize text from scanned or photographed pages and drop it into a spreadsheet, with columns split where the original had clear spacing.

Data reuse

Move tabular content out of read-only PDFs and into a format you can filter, chart, and share with formulas.

Tips & best practices

Column splitting is a best-effort heuristic based on spacing, not true table detection — review and adjust columns after opening the file
Tightly packed numeric columns or merged cells may misalign, since clear gaps aren't always present — some manual cleanup is normal
For scanned PDFs, higher-quality scans produce more accurate OCR and cleaner columns
Turn off column splitting when you just want each line as a single cell for further processing

Frequently asked questions

Does it work on scanned PDFs?

Yes. Image-only pages are detected and recognized with built-in OCR before the spreadsheet is built, so scanned statements and reports can be converted too. Recognition quality depends on the clarity of the scan.

Will it perfectly reconstruct my tables?

No. This tool dumps recognized lines into rows and splits columns on wide spacing — it does not do true table-structure detection. Bordered or tightly packed tables may need column cleanup after opening in your spreadsheet app.

Are the cells real or just an image?

Real. The output is a standard .xlsx workbook with selectable, editable cells — not a picture of a table embedded in a sheet.

Is this really free and private?

Yes. Everything runs in your browser using client-side processing and OCR. There are no uploads, no subscriptions, and no usage limits.