Skip to main content

Vocabulary of natural language processing (POC)

Search from vocabulary

Vocabulary information


Title

Vocabulary of natural language processing (POC)

Description

The vocabulary of natural language processing (NLP) is a bilingual (French-English) terminological resource. It is the result of transforming a hierarchical list of terms into SKOS. It includes more than 1,600 concepts, some of which have one or more definitions.

This vocabulary is based on:
- reusing, merging, unifying and appending classes and properties from existing ontologies, i.e. the Vocabulary of Linguistics, the Thesaurus of Text Mining, the Vocabulary of Signal Theory and Processing, the Artes (Aide à la rédaction de textes scientifiques) dictionary created by the joined research team of UFR EILA and CLILLAC-ARP departments of Université Paris Cité and based on Bénard (2019);
- extracting terms from subject-based corpora (Istex, ACL Anthology Reference Corpus);
- manual identification of problematic terms during an experimental post-editing session (Bawden et al., 2024).

This vocabulary can be downloaded in the following formats: CSV, SKOS-XML and JSON-LD.

Creator

Institute for scientific and technical information (Inist) - CNRS/UAR76
ANR-22-CE23-0033 project MaTOS Machine Translation for Open Science - F. Yvon (dir.)

Version

1.0

Created

Friday, April 26, 2024 00:00:00

Last modified

Wednesday, July 3, 2024 00:00:00

Attribution Name

Institute for scientific and technical information (Inist) - CNRS/UAR76

cc:attributionURL

dc:alternative

NLP vocabulary

Description

This resource contains 1620 terminological entries.

skosmos:shortName

NLP Vocabulary

URI

http://data.loterre.fr/ark:/67375/8LP

Resource counts by type

TypeCount

Term counts by language

Language Preferred terms Alternate terms Hidden terms