Find our datasets including data for our shared tasks in our repositories.
2024
Type | Title | Year | URL / DOI |
---|---|---|---|
dataset | FAIR4ML metadata schema | 2024 | https://w3id.org/fair4ml |
dataset | FoRC Shared Task Subtask I | 2024 | https://zenodo.org/records/10777735 |
dataset | FoRC4CL | 2024 | https://zenodo.org/records/10777674 |
dataset | GESIS KG | 2024 | (to come) |
dataset | GESIS MethodsHub KG | 2024 | (to come) |
dataset | GSAP KG | 2024 | (to come) |
dataset | GSAP-NER | 2024 | https://github.com/ottowg/gsap-ner/tree/emnlp_submission/data |
dataset | Hybrid Scholarly Question Answering (QA) dataset | 2024 | https://codalab.lisn.upsaclay.fr/competitions/19747 |
dataset | KG on AI & DS Methods | 2024 | (to come) |
dataset | LLMs4OL 2024 @ ISWC Challenge dataset | 2024 | https://sites.google.com/view/llms4ol |
dataset | machine-actionable Software Management Plan Ontology (maSMP Ontology) | 2024 | https://doi.org/10.5281/zenodo.7806638 |
dataset | Metadata Extraction | 2024 | (to come) |
dataset | NFDI4DS KG | 2024 | (to come) |
dataset | SOMD - SOftware Mention Detection | 2024 | https://zenodo.org/records/10974890 |
dataset | Usage guidance (aka profiles) for the machine-actionable Software Management Plan Ontology | 2024 | https://doi.org/10.5281/zenodo.10582121 |
2023
Type | Title | Year | URL / DOI |
---|---|---|---|
dataset | DBLP-QuAD: A Question Answering Dataset over the DBLP Scholarly Knowledge Graph | 2023 | https://zenodo.org/records/7643971 |
dataset | GESIS Datasearch KG | 2023 | https://data.gesis.org/gesisdatasearchkg |
dataset | Metadata crosswalks for software management plans at NFDI4DS hackathon maSMP 2023 | 2023 | https://doi.org/10.5281/zenodo.10275895 |
dataset | Open Source Large Language Models | 2023 | https://github.com/Jamarpaul/OSLLMs/tree/main/Datasets |
dataset | SciQA benchmark: Dataset and RDF dump | 2023 | https://doi.org/10.5281/zenodo.7707888 |
dataset | SOTA? Tracking the State-of-the-Art in Scholarly Publications | 2023 | https://github.com/jd-coderepos/sota/ |
dataset | TD4CLTabs Corpus | 2023 | https://zenodo.org/records/10972922 |
dataset | Towards metadata for machine learning - Crosswalk tables | 2023 | https://doi.org/10.5281/zenodo.10407320 |
2022
Type | Title | Year | URL / DOI |
---|---|---|---|
dataset | ClaimsKG | 2022 | https://data.gesis.org/claimskg/ |
dataset | dblp KG RDF (dump download) | 2022 | https://dblp.org/rdf/release/ |
dataset | dblp XML (dump download) | 2022 | https://dblp.org/xml/release/ |
dataset | SoftwareKG | 2022 | https://data.gesis.org/softwarekg/ |
dataset | TweetsCOV19KG | 2022 | https://data.gesis.org/tweetscov19/ |
dataset | TweetsKB | 2022 | https://data.gesis.org/tweetskb/ |