Semantic Image-Text-Classes

This dataset is introduced by the paper "Understanding, Categorizing and Predicting Semantic Image-Text Relations". If you use it please cite:

@inproceedings{otto2019understanding, title={Understanding, Categorizing and Predicting Semantic Image-Text Relations}, author={Otto, Christian and Springstein, Matthias and Anand, Avishek and Ewerth, Ralph}, booktitle={In Proceedings of ACM Internation Conference on Multimedia Retrieval (ICMR 2019)}, year={2019} }

To create the full tar use the following command in the command line:

cat train.tar.part* > train_concat.tar

Then simply untar it via

tar -xf train_concat.tar

The jsonl files contain metadata of the following format:

id, origin, ITClass, CMI, SC, STAT, text, tagged text, image_path

License Information:

This dataset is composed of various open access sources as described in the paper. We thank all the original authors for their work.

Pitt Image Ads Dataset: http://people.cs.pitt.edu/~kovashka/ads/ Image-Net challenge: http://image-net.org/ Visual Storytelling Dataset (VIST): http://visionandlanguage.net/VIST/ Wikipedia: https://www.wikipedia.org/ Microsoft COCO: http://cocodataset.org/#home

Data and Resources

Cite this as

Christian Otto (2019). Dataset: Semantic Image-Text-Classes. https://doi.org/10.25835/0010577

Retrieved: 14:20 12 Nov 2019 (GMT)

Additional Info

Field Value
Author Christian Otto
Maintainer Christian Otto
Version 1.0
Last Updated May 21, 2019, 15:44 (CEST)
Created April 23, 2019, 12:18 (CEST)
License Creative Commons Attribution-NonCommercial 3.0