- project for paper "Harvesting Events from Multiple Sources: Towards a Cross-Document Extraction Paradigm"
- Firstly, we gather document-level event data from Wikipedia. The original dataset is located in "dataset/document_level_dataset".
- Secondly, we extract events from the documents using OmniEvent as our tool to obtain the raw dataset.
- We utilize human validation to obtain the final dataset, as described in the paper.