Skip to content

Latest commit

 

History

History
4 lines (3 loc) · 295 Bytes

File metadata and controls

4 lines (3 loc) · 295 Bytes

duplicate_cleansing

Takes an input CSV and produces a CSV of duplicate records. Then the input CSV is cleansed to remove duplicates.

First run find_duplicates.py, then, if there is more than the header line in the dupes file, remove_duplicates.py will run and the clean file will be created.