Evaluation¶
- Authors
Tong Zhu
- Last Update
Jan. 4th, 2021
You may wondering what are those terms in
Exps/<task_name>/Output/dee_eval.(dev|test).(pred|gold)_span.<model_name>.<epoch>.json.
Here are the explanation.
Doc Type¶
Document types are combined with the number of event types and the number of event instances per type.
- o2o
There is only one event type with one instance.
- o2m
There are only one event type with multiple instances.
- m2m
There are multiple event types.
Metrics¶
- classification
The event type classification measurements.
- entity
The Named Entity Recognition (NER) part of measurements.
- overall
The final metric with role-level evaluation as introduced in Doc2EDAG 1.
- instance
The instance-level measurements. One instance is recognised as True Positive (TP) iff all the argument roles have filled with correct arguments.
- trigger
For PTPCG,
triggermeans the evaluation of pseudo triggers.- adj_mat
For PTPCG,
adj_matmeans the evaluation of adjacent matrix for each document.- connection
For PTPCG,
connectionmeans the evaluation of connections between pseudo triggers and ordinary arguments.- rawCombination
In PTPCG,
rawCombinationis the combination evaluation results after the BK extraction without further instance generation and argument filtering.- combination
combinationis the combination evaluation results after the final instance generation process. Some arguments inrawCombinationmay be filtered out.
References¶
- 1
Shun Zheng, Wei Cao, Wei Xu, and Jiang Bian. 2020. Doc2EDAG: An end-to-end document-level framework for Chinese financial event extraction. EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference:337–346.