Semantic Image-Text-Classes

doi:doi:10.25835/0010577

Semantic Image-Text-Classes

This dataset is introduced by the paper "Understanding, Categorizing and Predicting Semantic Image-Text Relations".

If you are using this dataset it in your work, please cite:

@inproceedings{otto2019understanding, title={Understanding, Categorizing and Predicting Semantic Image-Text Relations}, author={Otto, Christian and Springstein, Matthias and Anand, Avishek and Ewerth, Ralph}, booktitle={In Proceedings of ACM International Conference on Multimedia Retrieval (ICMR 2019)}, year={2019} }

To create the full tar use the following command in the command line:

cat train.tar.part* > train_concat.tar

Then simply untar it via

tar -xf train_concat.tar

The jsonl files contain metadata of the following format:

id, origin, CMI, SC, STAT, ITClass, text, tagged text, image_path

License Information:

This dataset is composed of various open access sources as described in the paper. We thank all the original authors for their work.

Pitt Image Ads Dataset: http://people.cs.pitt.edu/~kovashka/ads/
Image-Net challenge: http://image-net.org/
Visual Storytelling Dataset (VIST): http://visionandlanguage.net/VIST/
Wikipedia: https://www.wikipedia.org/
Microsoft COCO: http://cocodataset.org/#home

Data and Resources

train.tar.partaa
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partab
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partac
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partad
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partae
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partaf
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partag
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partah
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partai
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partaj
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partak
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partal
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partam
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partan
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partao
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partap
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partaq
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partar
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partas
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partat
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partau
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partav
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partaw
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partax
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partay
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partaz
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partba
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbb
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbc
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbd
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbe
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbf
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbg
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbh
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbi
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbj
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbk
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbl
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbm
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbn
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbo
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbp
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbq
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbr
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbs
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbt
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbu
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbv
File size: 953.7 MByte
Explore
- More information
- Download
train.tar.partbw
File size: 507.6 MByte
Explore
- More information
- Download
training_metadata.jsonl
File size: 138.9 MByte
Explore
- More information
- Download
test.tarTAR
File size: 155.6 MByte
Explore
- More information
- Download
test_metadata.jsonl
File size: 1.1 MByte
Explore
- More information
- Download

Cite this as

Christian Otto, Matthias Springstein, Avishek Anand, Ralph Ewerth (2019). Semantic Image-Text-Classes [Data set]. LUIS. https://doi.org/10.25835/0010577

Retrieved: 03:55 16 Jul 2026 (UTC)

BibTeX

Additional Info

Field	Value
Author	Christian Otto, Matthias Springstein, Avishek Anand, Ralph Ewerth
Maintainer	Christian Otto
Version	1.0
Last Updated	January 20, 2022, 14:14 (UTC)
Created	April 23, 2019, 10:18 (UTC)
License	Creative Commons Attribution-NonCommercial 3.0
Dataset Size	45.5 GByte