Three Wikipedia Datasets for Joint Information Extraction and Reasoning

=================================================================================================
train: the training context data.
test: the test context data.

Each file is represented as a TAB-separated triple of head entity, tail entity, and the context. 

relation.graph: the TAB-separated relation triple that includes the head and tail entities, as well as the relation.
=================================================================================================

The detailed descriptions of the three datasets can be found in the following paper:

William Yang Wang and William W. Cohen, "Joint Information Extraction and Reasoning: A Scalable Statistical Relational Learning Approach", to appear in Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and The 7th International Joint Conference of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2015), long paper for oral presentation, Beijing, China, July 26-31, ACL. 
=================================================================================================

Note that there are absolutely no guarantees with the dataset,
and you are welcome to report the errors and bugs of the preliminary version
of this dataset.

You are allowed to use this dataset for research purposes only.
You may re-distribute the dataset, but you must retain this readme file in the re-distribution.

For more question about the dataset, please contact:
William Wang, yww@cs.cmu.edu

v1.0, 05/27/2015

