Skip to main content

Vocabulary of natural language processing (POC)

Search from vocabulary

Concept information

Preferred term

tokenization  

Definition

  • The task/process of recognizing and tagging tokens (words, punctuation marks, digits etc.) in a text. (Loterre)

Broader concept

Entry terms

  • text segmentation
  • tokenisation

In other languages

  • French

  • découpage de texte
  • segmentation de texte

URI

http://data.loterre.fr/ark:/67375/8LP-T7Q0JFBM-5

Download this concept:

RDF/XML TURTLE JSON-LD Last modified 5/27/24