Abstract
Semantic fields(domains) are an important construct in neuroscience, linguistics, psychology, and natural language processing. However, semantic field resources typically lack scalability and are not based on language usage, but on scientific and commercial taxonomies. The present project aims to create maps of semantic fields for multiple languages constructed from Word Embeddings. The clustering process is systematically described, and preliminary results for the Spanish language are presented, showing similarities and differences compared to current classifications. The present work opens up possibilities for a usage-based word classification of semantic fields and for the generation of language atlases that allow for multilingual comparison and improve the development of the aforementioned disciplines.
Original language | English |
---|---|
Pages (from-to) | 36-40 |
Number of pages | 5 |
Journal | CEUR Workshop Proceedings |
Volume | 3516 |
State | Published - 2023 |
Event | 2023 Annual Conference of the Spanish Association for Natural Language Processing: Projects and System Demonstrations, SEPLN-PD 2023 - Jaen, Spain Duration: 27 Sep 2023 → 29 Sep 2023 |
Keywords
- Distributional Semantic Models
- Semantic domains
- Word Embeddings