Skip to content

Latest commit

 

History

History
25 lines (18 loc) · 724 Bytes

File metadata and controls

25 lines (18 loc) · 724 Bytes

pdftables

A library for extracting tables from PDF documents. pdftables is a fork from pdftables (0.0.4) which was developed by ScraperWiki.

Features

  • TODO (for now)
    • make it work with the latest version of pdfminer (20140328)
    • tidy up code base including PEP 8 compliance
    • review test cases and identify a set of pdf files to use for testing
    • Add some documentation
    • Confirm that Scraper Wiki is no longer interested in this and if this is the case change name of the package for release on PyPI