Startseite Linguistik & Semiotik Lexical syntax for Arabic SMT
Kapitel
Lizenziert
Nicht lizenziert Erfordert eine Authentifizierung

Lexical syntax for Arabic SMT

  • Hany Hassan
Weitere Titel anzeigen von John Benjamins Publishing Company

Abstract

The current approaches of Phrase-based Statistical Machine Translation lacks the capabilities of producing grammatical translations and handling long-range reordering. In this chapter, we presnet our work for extending Phrase-based SMT with lexical syntactic descriptions that localize global syntactic information on the word without introducing syntactic redundant ambiguity. We presente a novel model of Phrase-based SMT which integrates linguistic lexical descriptions supertags into the target language model and the target side of the translation model. Moreover, we introduce a novel Incremental Dependency-based Syntactic Language Model (IDLM) based on wide-coverage CCG incremental parsing which we integrate into a direct translation SMT system. Our proposed approach is the first to integrate full dependency parsing in SMT systems with a very attractive computational cost since it deploys the linear decoders widely used in Phrase-based SMT systems. The experimental results. show a good improvement over top-ranked state-of-the-art systems.

Abstract

The current approaches of Phrase-based Statistical Machine Translation lacks the capabilities of producing grammatical translations and handling long-range reordering. In this chapter, we presnet our work for extending Phrase-based SMT with lexical syntactic descriptions that localize global syntactic information on the word without introducing syntactic redundant ambiguity. We presente a novel model of Phrase-based SMT which integrates linguistic lexical descriptions supertags into the target language model and the target side of the translation model. Moreover, we introduce a novel Incremental Dependency-based Syntactic Language Model (IDLM) based on wide-coverage CCG incremental parsing which we integrate into a direct translation SMT system. Our proposed approach is the first to integrate full dependency parsing in SMT systems with a very attractive computational cost since it deploys the linear decoders widely used in Phrase-based SMT systems. The experimental results. show a good improvement over top-ranked state-of-the-art systems.

Heruntergeladen am 16.2.2026 von https://www.degruyterbrill.com/document/doi/10.1075/nlp.9.07has/html
Button zum nach oben scrollen