TemProb-NAACL18

View the Project on GitHub CogComp/TemProb-NAACL18

Prerequisites

Reproduce NAACL’18 Results

All the following commands should be run from the root dir of the project, i.e., TemProb-NAACL18/.

git clone git@github.com:qiangning/TemProb-NAACL18.git
cd TemProb-NAACL18
tar xf data/TemProb.txt.tar.gz -C data
sh scripts/mvn_install.sh
mvn compile

If no error messages pop up, you’re can move forward by

mkdir logs
mkdir logs/Awareness
mkdir logs/Awareness/CompareStateoftheArt_AllEventTimex
mkdir logs/Awareness/CompareStateoftheArt_PartialTBDense
mkdir logs/eval_corpus_prior
sh scripts/RunThis_All.sh > RunThis_All_log.txt

Note: Since Github is limiting the bandwidth for large files (our data/TemProb.txt is a large file), it’s very likely that you will see an error saying that TemProb.txt fails to be downloaded. In that case, please go to here and find the backup link to download it.

Again, if no errors are encountered, you should now have all the tables reported in the paper. Take a look at scripts/RunThis_All.sh and it should be rather easy to understand. For example,

One exception is Table 7, which was not generated automatically, but we have included the numbers in logs_refs/DONOTDELETE_Table7.txt. Another exception is the description right above Table 5 (these numbers couldn’t fit into Table 5, so we had to put them in the text). These numbers can be found in logs_refs/DONOTDELETE_Table5_improvement.

If you met with errors while evaluating the temporal awareness scores of each system, probably it’s due to python 2 vs 3 issues. Please change corresponding python commands to be python2.

Where do we find the logs?

Standard metrics (prec, rec, and F1):

Temporal awareness scores:

Note that sometimes the awareness evaluations are not finished by itself (due to an unknown instability in the awareness evaluation tools provided by TempEval3). You can go to the log of the standard metrics and locate a line starting with sh scripts/evaluate_general_dir.sh (usually at the bottom of each file). For example, if you see logs/CompareStateoftheArt_PartialTBDense/naacl.txt is incomplete. Since that corresponds to the 3rd line of Table 8, go to logs/Table8_line3_proposed_on_partialTBDense.txt and you will see a line I intentionally created for this situation:

sh scripts/evaluate_general_dir.sh output/Awareness/gold output/Awareness/CompareStateoftheArt_PartialTBDense/naacl naacl Awareness/CompareStateoftheArt_PartialTBDense

Run this from TemProb-NAACL18/ and you will see that logs/CompareStateoftheArt_PartialTBDense/naacl.txt is updated and complete now (may take a few seconds to complete).

I have also put the original logs I generated into logs_refs/ for your reference.

Where do we find the system outputs?

Citation

Please kindly cite the following paper: Qiang Ning, Hao Wu, Haoruo Peng, Dan Roth, “Improving Temporal Relation Extraction with a Globally Acquired Statistical Resource”, NAACL 2018 (pdf)

@inproceedings{NingWuPeRo18,
    author = {Qiang Ning and Hao Wu and Haoruo Peng and Dan Roth},
    title = {Improving Temporal Relation Extraction with a Globally Acquired Statistical Resource},
    booktitle = {NAACL},
    month = {6},
    year = {2018},
    publisher = {Association for Computational Linguistics},
    url = "http://cogcomp.org/papers/NingWuPeRo18.pdf",
}