In this directory you can find data related with dataset Annotated Corpus for Named Entity Recognition from Kaggle.
In particular the dataset is present in three different format.
- iob scheme: not used in notebooks but you can use this in place of biluo dataset in bilstm ner notebook
- biluo scheme: used in bilstm ner notebook
- jsonl scheme: used in spacy ner notebook
In this directory you can find also a notebook with code to transform dataset from a format to another. In particular you can transform dataset from iob to biluo scheme, from biluo scheme to jsonl format and vice versa.