-
TIB-SID: TIB Subject Indexing Dataset
The TIB Subject Indexing Dataset (TIB-SID) is a bilingual benchmark for extreme multi-label text classification (XMTC) over real library... -
Second NFDI4Chem User Survey
NFDI4Chem Online Survey 2023 Dataset The NFDI4Chem consortium regulary conducts user survey to monitor the state of research data management in Chemistry. The dataset contains... -
YESciEval Corpus
YESciEval is a benchmark dataset for evaluating the robustness of Large Language Models (LLMs) as evaluators in scientific question answering (scienceQ&A).... -
ORKG Properties and LLM-Generated Research Dimensions Evaluation Dataset
This dataset contains a collection of 103 research comparisons from the Open Research Knowledge Graph (ORKG) with annotated properties and corresponding research dimensions... -
Integration of Systematic Review Services with a Scholarly Knowledge Graph (Data)
Supplementary data for the MSc thesis Schiepanski, T. (2023). Integration of Systematic Review Services with a Scholarly Knowledge Graph. Leibniz University Hannover. -
MM_Claims Dataset
This dataset is introduced by the paper "MM-Claims: A Dataset for Multimodal Claim Detection in Social Media" If you use this dataset in your work, please cite:... -
ORKG Similar Papers Recommendation Service Evaluation Dataset
This dataset was created to compare and evaluate the Semantic Scholar recommendation service and Open Research Knowledge Graph (ORKG) similar papers recommendation service based... -
Evaluating SQuAD-based Question Answering for the Open Research Knowledge...
This dataset is part of the bachelor thesis "Evaluating SQuAD-based Question Answering for the Open Research Knowledge Graph Completion". It was created for the finetuning of... -
CS-NER
Computer Science Named Entity Recognition in the Open Research Knowledge Graph 1) About This work proposes a standardized CS-NER task by defining a set of seven... -
SaL - Dataset
If you use our data please cite this submission: @inproceedings{DBLP:conf/chiir/OttoRPGH0HHDHKE22, author = {Christian Otto and Markus Rokicki and Georg Pardi and Wolfgang Gritz... -
Contributions Similarity in the Open Research Knowledge Graph
This evaluation set has been created for evaluating a content-based recommender system in the context of the Open Research Knowledge Graph (ORKG). The recommender system accepts... -
Templates Recommendation in the Open Research Knowledge Graph
This dataset has been created for implementing a content-based recommender system in the context of the Open Research Knowledge Graph (ORKG). The recommender system accepts... -
STEM-NER-60k
A Large-scale Dataset of STEM Science as PROCESS, METHOD, MATERIAL, and DATA Named Entities This repository hosts data as a follow-up study to the following publications... -
TamperedNews & News400 (IJMIR'21 Update)
Multimodal Analytics for Real-world News using Measures of Cross-modal Entity Consistency This repository contains the TamperedNews and News400 datasets... -
ORKG DILS2018 use case dataset
DILS 2019 use-case dataset, collected via subject matter experts to represent DILS 2019 papers as a machine readable graph model -
NLPContributionGraph Trial Dataset
An Annotation Scheme for Machine Reading of Scholarly Contributions in Natural Language Processing Literature This dataset is the result of a pilot annotation exercise to... -
TamperedNews Dataset
Multimodal Analytics for Real-world News using Measures of Cross-modal Entity Consistency This repository contains the TamperedNews dataset introduced in the paper:... -
Semantic Image-Text-Classes
This dataset is introduced by the paper "Understanding, Categorizing and Predicting Semantic Image-Text Relations". If you are using this dataset it in your work, please cite:... -
SlideImages
Please note: this archive requires support for dangling symlinks, which excludes the Windows operating system. To use this dataset, you will need to download the MS COCO 2017... -
A Neural Approach for Text Extraction from Scholarly Figures
A Neural Approach for Text Extraction from Scholarly Figures This is the readme for the supplemental data for our ICDAR 2019 paper. You can read our paper via IEEE here:...