A web application to extract tables from PDF files, with support for password-protected PDFs and multi-page documents.
- Upload PDF files
- Extract tables from bank statements
- Support for encrypted PDFs
- Download the extracted table as Excel
- Python 3.8+
- Java Runtime Environment (for tabula-py)
- Clone the repository
- Create a virtual environment
- Install dependencies:
pip install -r requirements.txt
python app.py
- Navigate to
http://localhost:5000
- Upload a PDF
- Enter password if PDF is encrypted
- View and download the extracted table