Startseite Chapter 5. Evaluating a bracketing protocol for multiword terms
Kapitel
Lizenziert
Nicht lizenziert Erfordert eine Authentifizierung

Chapter 5. Evaluating a bracketing protocol for multiword terms

  • Pilar León-Araúz und Melania Cabezas-García
Weitere Titel anzeigen von John Benjamins Publishing Company

Abstract

Multiword terms (MWTs) are frequently used to encapsulate and convey meaning in scientific and technical texts. However, they can also make these texts difficult to understand because the relations between constituents are not transparent. When MWTs have more than two constituents, a dependency analysis (bracketing) is often necessary to facilitate their interpretation. NLP has proposed various models to automatize bracketing operations, but none has been entirely satisfactory. This paper presents a protocol that combines various models and applies it to a set of three-constituent MWTs in order to: (i) sort rules by their disambiguation potential, based on their likelihood of retrieving results from any corpus and their ability to solve bracketing; and (ii) ascertain the influence of corpus size and type in the results obtained.

Abstract

Multiword terms (MWTs) are frequently used to encapsulate and convey meaning in scientific and technical texts. However, they can also make these texts difficult to understand because the relations between constituents are not transparent. When MWTs have more than two constituents, a dependency analysis (bracketing) is often necessary to facilitate their interpretation. NLP has proposed various models to automatize bracketing operations, but none has been entirely satisfactory. This paper presents a protocol that combines various models and applies it to a set of three-constituent MWTs in order to: (i) sort rules by their disambiguation potential, based on their likelihood of retrieving results from any corpus and their ability to solve bracketing; and (ii) ascertain the influence of corpus size and type in the results obtained.

Heruntergeladen am 20.9.2025 von https://www.degruyterbrill.com/document/doi/10.1075/cilt.366.05leo/html?lang=de
Button zum nach oben scrollen