Skip to content

DataSciencePros/DocumentClassificationNLP

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Document Classification With Natural Language Processing


About

In this blog post we covered document classification using Scikit-learn and the 20 News Groups dataset. We went over the basics of term frequency-inverse document frequency, pipelines and the Naive Bayes classifier

Using Notebook


You can install the dependencies and access the notebook using Docker by building the Docker image with the following:

docker build -t doc_class .

Followed by running the command container:

docker run -p 8888:8888 -t doc_class

See here for more info. Otherwise in addition to Python 3.5 and Jupyter notebooks, install the additional libraries listed in requirements.txt which can be installed with the command,

pip install -r requirements.txt

About

Document Classification With Natural Language Processing

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 97.7%
  • Dockerfile 2.3%