Avalingua
-
Pablo Gamallo Otero
Abstract
The objective of this article is to present an automatic tool for detecting and classifying grammatical errors in written language as well as to describe the evaluation protocol we have carried out to measure its performance on learner corpora. The tool was designed to detect and analyse the linguistic errors found in text essays, assess the writing proficiency, and propose solutions with the aim of improving the linguistic skills of students. It makes use of natural language processing and knowledge-rich linguistic resources. So far, the tool has been implemented for the Galician language. The system has been evaluated on two learner corpora reaching 91% precision and 65% recall (76% F-score) for the task of detecting different types of grammatical errors, including spelling, lexical and syntactic ones.
Abstract
The objective of this article is to present an automatic tool for detecting and classifying grammatical errors in written language as well as to describe the evaluation protocol we have carried out to measure its performance on learner corpora. The tool was designed to detect and analyse the linguistic errors found in text essays, assess the writing proficiency, and propose solutions with the aim of improving the linguistic skills of students. It makes use of natural language processing and knowledge-rich linguistic resources. So far, the tool has been implemented for the Galician language. The system has been evaluated on two learner corpora reaching 91% precision and 65% recall (76% F-score) for the task of detecting different types of grammatical errors, including spelling, lexical and syntactic ones.
Chapters in this book
- Prelim pages i
- Table of contents v
- Learner corpora in language testing and assessment 1
-
New corpus resources, tools and methods
- The Marburg Corpus of Intermediate Learner English (MILE) 13
- Avalingua 35
- Data commentary in science writing 59
- First steps in assigning proficiency to texts in a learner corpus of computer-mediated communication 85
-
Data-driven approaches to the assessment of proficiency
- The English Vocabulary Profile as a benchmark for assigning levels to learner corpus data 115
- A multidimensional analysis of learner language during story reconstruction in interviews 141
- Article use and criterial features in Spanish EFL writing 163
- Tense and aspect errors in spoken learner English 191
- Authors 217
- Index 219
Chapters in this book
- Prelim pages i
- Table of contents v
- Learner corpora in language testing and assessment 1
-
New corpus resources, tools and methods
- The Marburg Corpus of Intermediate Learner English (MILE) 13
- Avalingua 35
- Data commentary in science writing 59
- First steps in assigning proficiency to texts in a learner corpus of computer-mediated communication 85
-
Data-driven approaches to the assessment of proficiency
- The English Vocabulary Profile as a benchmark for assigning levels to learner corpus data 115
- A multidimensional analysis of learner language during story reconstruction in interviews 141
- Article use and criterial features in Spanish EFL writing 163
- Tense and aspect errors in spoken learner English 191
- Authors 217
- Index 219