Description du vocabulaire
Titre
Vocabulary of natural language processing (POC)
Description
The vocabulary of natural language processing (NLP) is a bilingual (French-English) terminological resource. It is the result of transforming a hierarchical list of terms into SKOS. It includes more than 1,600 concepts, some of which have one or more definitions.
This vocabulary is based on:
- reusing, merging, unifying and appending classes and properties from existing ontologies, i.e. the Vocabulary of Linguistics, the Thesaurus of Text Mining, the Vocabulary of Signal Theory and Processing, the Artes (Aide à la rédaction de textes scientifiques) dictionary created by the joined research team of UFR EILA and CLILLAC-ARP departments of Université Paris Cité and based on Bénard (2019);
- extracting terms from subject-based corpora (Istex, ACL Anthology Reference Corpus);
- manual identification of problematic terms during an experimental post-editing session (Bawden et al., 2024).
This vocabulary can be downloaded in the following formats: CSV, SKOS-XML and JSON-LD.
This vocabulary is based on:
- reusing, merging, unifying and appending classes and properties from existing ontologies, i.e. the Vocabulary of Linguistics, the Thesaurus of Text Mining, the Vocabulary of Signal Theory and Processing, the Artes (Aide à la rédaction de textes scientifiques) dictionary created by the joined research team of UFR EILA and CLILLAC-ARP departments of Université Paris Cité and based on Bénard (2019);
- extracting terms from subject-based corpora (Istex, ACL Anthology Reference Corpus);
- manual identification of problematic terms during an experimental post-editing session (Bawden et al., 2024).
This vocabulary can be downloaded in the following formats: CSV, SKOS-XML and JSON-LD.
Créateur
Institute for scientific and technical information (Inist) - CNRS/UAR76
ANR-22-CE23-0033 project MaTOS Machine Translation for Open Science - F. Yvon (dir.)
Version
1.0
Date de création
Friday, April 26, 2024 00:00:00
Date de dernière modification
Wednesday, July 3, 2024 00:00:00
Nom d'attribution
Institute for scientific and technical information (Inist) - CNRS/UAR76
cc:attributionURL
dc:alternative
NLP vocabulary
Identifiant
Description
This resource contains 1620 terminological entries.
skosmos:shortName
NLP Vocabulary
URI
http://data.loterre.fr/ark:/67375/8LP
Nombre d'entrées par type
Type | Nombre |
---|
Nombre de termes par langue
Langue | Termes préférentiels | Termes synonymes | Termes cachés |
---|