Skip to content

chrisfs/File-Scraping-for-Publisher

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

File-Scraping-for-Publisher

Code written to scrape 2000 page documents and create more workable documents

Pdf2split.py : This is a script I did for a publshing company. It looks through a 2000 page report on author royalties and separates it into many different documents, one for each author paid during that month. The former process was to copy and paste by hand so this script sped up the process immensely.

Makecsv111NEW.py :This script takes a huge text file generated from the same PDF file, finds a few fields and produces a csv file for import into Excel. I put comments throughout in response to client concern about a need for future changes.

About

Code written to scrape 2000 page documents and create more workable documents

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages