Investigating the scopes of textual metrics for learner level discrimination and learner analytics
-
Thomas Gaillat
Abstract
This chapter investigates the linguistic interpretation of complexity metrics in L2 proficiency assessment. By analysing 84 formulas of metrics linked to lexical diversity, readability and syntactic complexity, we identify a taxonomy of their underlying linguistic scopes. These metrics are classified according to text, sentence, clause, phrase and word scopes with attributes and methods. Homogeneity of scopes was evaluated by applying a mixed clustering PCA approach to metrics computed for 328 L2 texts. Discriminative power was evaluated with a random forest approach on the same dataset including the CEFR levels. Results show that metrics are diversely clustered but they also suggest in-cluster homogeneity. The CEFR classification shows mixed results suggesting that diversity, repetition and size in word and text scopes are significant.
Abstract
This chapter investigates the linguistic interpretation of complexity metrics in L2 proficiency assessment. By analysing 84 formulas of metrics linked to lexical diversity, readability and syntactic complexity, we identify a taxonomy of their underlying linguistic scopes. These metrics are classified according to text, sentence, clause, phrase and word scopes with attributes and methods. Homogeneity of scopes was evaluated by applying a mixed clustering PCA approach to metrics computed for 328 L2 texts. Discriminative power was evaluated with a random forest approach on the same dataset including the CEFR levels. Results show that metrics are diversely clustered but they also suggest in-cluster homogeneity. The CEFR classification shows mixed results suggesting that diversity, repetition and size in word and text scopes are significant.
Kapitel in diesem Buch
- Prelim pages i
- Table of contents v
- Complexity, accuracy and fluency in learner corpus research 1
- Investigating the scopes of textual metrics for learner level discrimination and learner analytics 21
- Syntactic complexity measures as linguistic correlates of proficiency level in learner Russian 51
- Development of L2 writing complexity 81
- Phraseological complexity in EFL learners’ spoken production across proficiency levels 115
- Persistent errors in spoken English among Taiwanese and Czech learners at CEFR B2 and C1 137
- Measuring lexical accuracy 159
- The effect of time and dimensions of collocational relationship on phraseological accuracy 181
- Interaction between grammatical accuracy and syntactic complexity at different proficiency levels 209
- Accuracy, syntactic complexity and task type at play in examination writing 241
- Contextualizing fluency in advanced spoken learner language 273
- Exploring the use of repeats in learners’ native and interlanguage production 299
- Index 325
Kapitel in diesem Buch
- Prelim pages i
- Table of contents v
- Complexity, accuracy and fluency in learner corpus research 1
- Investigating the scopes of textual metrics for learner level discrimination and learner analytics 21
- Syntactic complexity measures as linguistic correlates of proficiency level in learner Russian 51
- Development of L2 writing complexity 81
- Phraseological complexity in EFL learners’ spoken production across proficiency levels 115
- Persistent errors in spoken English among Taiwanese and Czech learners at CEFR B2 and C1 137
- Measuring lexical accuracy 159
- The effect of time and dimensions of collocational relationship on phraseological accuracy 181
- Interaction between grammatical accuracy and syntactic complexity at different proficiency levels 209
- Accuracy, syntactic complexity and task type at play in examination writing 241
- Contextualizing fluency in advanced spoken learner language 273
- Exploring the use of repeats in learners’ native and interlanguage production 299
- Index 325