A PDF file scanner used to scan pdfs in bulk for automation using PyPDF2, textract & nltk libraries in Python. The project is developed for personal automation purpose using Python libraries. The scanner able to scan text and images convert them into readable text. The is still on going where we planning to create it as a web application. It also going to have whitelist filter where we can count the frequencies of words in the PDF. There will be further development to the system soon after we are free. There are lot of features we are planning to add to this system.
-
Notifications
You must be signed in to change notification settings - Fork 0
Gv3N/PDF_File_Scanner
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
A pdf file scanner used to scan pdfs in bulk for automation using PyPDF2, textract & nltk libraries in Python.
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published