File-Scraping-for-Publisher

Code written to scrape 2000 page documents and create more workable documents

Pdf2split.py : This is a script I did for a publshing company. It looks through a 2000 page report on author royalties and separates it into many different documents, one for each author paid during that month. The former process was to copy and paste by hand so this script sped up the process immensely.

Makecsv111NEW.py :This script takes a huge text file generated from the same PDF file, finds a few fields and produces a csv file for import into Excel. I put comments throughout in response to client concern about a need for future changes.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Makecsv111NEW.py		Makecsv111NEW.py
README.md		README.md
pdfsplit2.py		pdfsplit2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

File-Scraping-for-Publisher

About

Uh oh!

Releases

Packages

Languages

chrisfs/File-Scraping-for-Publisher

Folders and files

Latest commit

History

Repository files navigation

File-Scraping-for-Publisher

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages