Información del vocabulario
Título
Vocabulary of natural language processing (POC)
Descripción
The vocabulary of natural language processing (NLP) is a bilingual (French-English) terminological resource. It is the result of transforming a hierarchical list of terms into SKOS. It includes more than 1,600 concepts, some of which have one or more definitions.
This vocabulary is based on:
- reusing, merging, unifying and appending classes and properties from existing ontologies, i.e. the Vocabulary of Linguistics, the Thesaurus of Text Mining, the Vocabulary of Signal Theory and Processing, the Artes (Aide à la rédaction de textes scientifiques) dictionary created by the joined research team of UFR EILA and CLILLAC-ARP departments of Université Paris Cité and based on Bénard (2019);
- extracting terms from subject-based corpora (Istex, ACL Anthology Reference Corpus);
- manual identification of problematic terms during an experimental post-editing session (Bawden et al., 2024).
This vocabulary can be downloaded in the following formats: CSV, SKOS-XML and JSON-LD.
This vocabulary is based on:
- reusing, merging, unifying and appending classes and properties from existing ontologies, i.e. the Vocabulary of Linguistics, the Thesaurus of Text Mining, the Vocabulary of Signal Theory and Processing, the Artes (Aide à la rédaction de textes scientifiques) dictionary created by the joined research team of UFR EILA and CLILLAC-ARP departments of Université Paris Cité and based on Bénard (2019);
- extracting terms from subject-based corpora (Istex, ACL Anthology Reference Corpus);
- manual identification of problematic terms during an experimental post-editing session (Bawden et al., 2024).
This vocabulary can be downloaded in the following formats: CSV, SKOS-XML and JSON-LD.
Creador
Institute for scientific and technical information (Inist) - CNRS/UAR76
ANR-22-CE23-0033 project MaTOS Machine Translation for Open Science - F. Yvon (dir.)
Versión
1.0
Creado
Friday, April 26, 2024 00:00:00
Última modificación
Wednesday, July 3, 2024 00:00:00
cc:attributionName
Institute for scientific and technical information (Inist) - CNRS/UAR76
cc:attributionURL
cc:license
dc:alternative
NLP vocabulary
dc:identifier
Descripción
This resource contains 1620 terminological entries.
skosmos:shortName
NLP Vocabulary
URI
http://data.loterre.fr/ark:/67375/8LP
Listado de recursos por tipo
Tipo | Recuento |
---|
Recuento de términos por lengua
Lengua | Términos preferidos | Términos alternativos | Términos ocultos |
---|