Embedding Based Multilingual Atlas of Semantic Fields

Research output: Contribution to journalConference articlepeer-review

Abstract

Semantic fields(domains) are an important construct in neuroscience, linguistics, psychology, and natural language processing. However, semantic field resources typically lack scalability and are not based on language usage, but on scientific and commercial taxonomies. The present project aims to create maps of semantic fields for multiple languages constructed from Word Embeddings. The clustering process is systematically described, and preliminary results for the Spanish language are presented, showing similarities and differences compared to current classifications. The present work opens up possibilities for a usage-based word classification of semantic fields and for the generation of language atlases that allow for multilingual comparison and improve the development of the aforementioned disciplines.

Original languageEnglish
Pages (from-to)36-40
Number of pages5
JournalCEUR Workshop Proceedings
Volume3516
StatePublished - 2023
Event2023 Annual Conference of the Spanish Association for Natural Language Processing: Projects and System Demonstrations, SEPLN-PD 2023 - Jaen, Spain
Duration: 27 Sep 202329 Sep 2023

Keywords

  • Distributional Semantic Models
  • Semantic domains
  • Word Embeddings

Fingerprint

Dive into the research topics of 'Embedding Based Multilingual Atlas of Semantic Fields'. Together they form a unique fingerprint.

Cite this