Skip to content

This application cleans text by removing stopwords and blacklisted words, allowing these texts to be used in artificial intelligence applications and natural language processing.

Notifications You must be signed in to change notification settings

wissam-al-kahwaji/CleanData

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

About

This application cleans text by removing stopwords and blacklisted words, allowing these texts to be used in artificial intelligence applications and natural language processing.

Install Requirements

pip install -r requirements.txt

Requirements

  • Python 3.x
  • nltk library
  • pandas library
  • tqdm library
  • enchant library

How To Run

python main.py

Settings

# main.py

class CleanData(App):
    index = 'comment' #The name of the table that will be cleaned
    input_csv_file_path = 'data.csv' # Name of the file to be played
    output_csv_file_path = 'cleaned_data.csv' # The name of the file where the changes will be saved

You can add words to the blacklist and put a space between each word to be deleted from the text

# blacklist.txt
game commit hello

Note

Please make sure to use this application responsibly according to local and global data protection laws and policies.

About

This application cleans text by removing stopwords and blacklisted words, allowing these texts to be used in artificial intelligence applications and natural language processing.

Resources

Code of conduct

Security policy

Stars

Watchers

Forks

Packages

No packages published

Languages