These are the datasets used in the prediction and generation experiments in:

William Yang Wang and Miaomiao Wen, "I Can Has Cheezburger? A Nonparanormal Approach to Combining Textual and Visual Information for Predicting and Generating Popular Meme Descriptions", to appear in the 2015 Conference of the North American Chapter of the Association for Computational Linguistics – Human Language Technologies (NAACL HLT 2015), long paper, Denver, CO., USA, May 31-June 5, ACL.

-------------------------------------------------------
generation: the related documents from the generation experiment.  

images: the original raw meme images.  

prediction: the related documents from the prediction experiment.

See the detailed README.txt within each directory for the descriptions of the documents.

If you are just looking for resources for meme-related research,
the images, and the reference.csv in the generation directory might be helpful to you. 

-------------------------------------------------------
The original sources retain the copyright of the data.

Note that there are absolutely no guarantees with this data,
and we provide this dataset "as is",
but you are welcome to report the issues of the preliminary version
of this data.

You are allowed to use this dataset for research purposes only.
You may re-distribute the dataset, but you must retain this readme file in the re-distribution.

For more question about the dataset, please contact:
William Wang, yww@cs.cmu.edu

v1.0 03/20/2015
