Home Linguistics & Semiotics Mobile-Assisted Language Learning in a Preparatory Year English Program: Enhancing Motivation and Receptive Skills
Article Open Access

Mobile-Assisted Language Learning in a Preparatory Year English Program: Enhancing Motivation and Receptive Skills

  • ORCID logo EMAIL logo , ORCID logo , ORCID logo and ORCID logo
Published/Copyright: March 10, 2026
Become an author with De Gruyter Brill

Abstract

The Education and Training Evaluation Commission in Saudi Arabia reports that many students entering university preparatory year programs face difficulties with basic English skills, particularly receptive skills in reading and listening, which are key to academic study. These difficulties are commonly associated with low learner motivation and limited exposure to relevant English practice. Although Mobile-Assisted Language Learning has gained international attention for its gamified and adaptive features, empirical evidence from Saudi preparatory year contexts remains limited. This mixed-methods pilot study considers the use of Duolingo and ReadTheory in relation to learner motivation and receptive skill development. Sixty male students at A1–A2 levels of the Common European Framework of Reference for Languages were purposively selected from a Saudi public university. The intervention was structured using the PF-4M framework, with a focus on authenticity, personalization, autonomy, and contextualization. Quantitative data were collected using a CEFR-aligned receptive skills test administered before and after the intervention and a post-intervention adapted Language Learning Motivation Scale. Qualitative data were drawn from reflective journals and weekly classroom observations documenting learner engagement. Results showed increases in receptive test scores, with mean scores rising from 8.12 to 15.17, alongside higher reported motivation levels. While both applications were well received, some learners reported challenges with comprehension and independent learning. The findings show that Mobile-Assisted Language Learning is associated with receptive skill development among beginner learners.

1 Introduction

Although English functions as a global lingua franca and as the medium of instruction for many international conferences, research activities, and publications, English as a foreign language continues to offer challenges for many Saudi nationals, according to the 2024 Pearson Report, 46 % of Saudi professionals report using English daily, yet only 23 % demonstrate confidence in their English proficiency (Pearson 2024). In the Kingdom of Saudi Arabia, the Education and Training Evaluation Commission (2023) has noted that students entering university preparatory year programs often demonstrate limited foundational English skills, particularly in receptive areas such as reading and listening, which are widely regarded as important for academic study. This gap between English exposure and academic demands becomes more apparent during the transition to higher education.

Within Saudi PYP programs, English is used as the medium of instruction; however, many students at CEFR A1–A2 levels report difficulty comprehending academic texts and lectures, a concern frequently discussed alongside lower academic performance (Al Zumor and Abdesslem 2022). A recent needs analysis conducted among Saudi PYP students classified at A1–A2 levels similarly identified limited receptive academic skills, with students reporting low confidence in reading and listening and describing their English proficiency as a constraint on academic participation. These problems were commonly attributed to limited opportunities for meaningful language engagement beyond textbook-based instruction (Alshehri 2023). Related evaluations have characterized teaching practices in Saudi preparatory programs as predominantly textbook- and lecture-oriented, with limited exposure to authentic language use and reduced learner engagement, particularly in lower-level courses (Alfehaid et al. 2024).

Across these studies, motivation is frequently discussed as a psychological dimension associated with the development of receptive skills among A1–A2 learners (Alshammari 2022). In response, this study is situated within the Pedagogical Framework for Mobile-Assisted Language Learning (PF-4M), which emphasizes authenticity, personalization, autonomy, and contextualization. The framework conceptualizes intrinsic motivation, such as enjoyment and self-determination, alongside extrinsic features, including immediate feedback, as elements commonly linked to sustained engagement in reading and listening practice. This perspective is particularly relevant in the Saudi PYP context, where disengagement from noncommunicative instructional practices has been repeatedly reported alongside lower motivation among A1–A2 learners (Alshammari 2022).

Mobile-Assisted Language Learning (MALL) refers to the use of mobile devices and applications to support language learning inside and outside the classroom. MALL environments are commonly characterized by portability, immediacy, and interactivity, features widely associated with learner engagement. Platforms such as Duolingo and ReadTheory provide structured practice, adaptive feedback, and scaffolded exposure to language input. For example, 91.8 % of Saudi EFL students rated Duolingo as engaging and enjoyable, citing its feedback systems and motivational structure (Althiyabi 2025). Experimental research has also reported higher reading and listening outcomes among A2–B1 learners using Duolingo than in control groups (Jiang et al. 2022).

Despite growing international interest in MALL, research examining its curriculum-integrated use in Saudi EFL contexts, particularly among CEFR A1–A2 learners, remains limited (Keezhatta and Omar 2019). Existing studies have largely focused on learner perceptions or the optional use of MALL tools rather than on required inclusion into institutional instruction. To address this gap, the present pilot feasibility study examines how curriculum-embedded MALL tools are associated with learner motivation, receptive skills, and perceptions among CEFR A1–A2 Saudi PYP students. By examining required rather than optional use, the study aims to generate findings closely aligned with institutional EFL practice at the beginner level.

1.1 The Present Study

This pilot study investigates the potential of using Duolingo and ReadTheory within the EPP in Saudi Arabia to support the learning of English among preparatory-year students at the CEFR A1–A2 levels under the PF-4M framework. Specifically, it aims to find out whether there is a relationship between using MALL applications and the improvement in motivational level and in reading and listening skills of students, and to find out how students perceive the learning with Duolingo and ReadTheory in the classroom environment. The following research questions shape the investigation:

  1. To what extent does MALL integration relate to increased intrinsic and extrinsic motivation among CEFR A1–A2 Saudi preparatory-year learners?

  2. To what extent do CEFR A1–A2 learners enhance their receptive language skills (reading and listening) after incorporating MALL tools?

  3. How do CEFR A1–A2 learners view the usability, engagement, and perceived effectiveness of MALL tools in the formal classroom setting?

2 Literature Review

The adoption of new technologies has transformed approaches to knowledge acquisition, progressing from distance learning to e-learning and, more recently, to mobile learning (Traxler 2018). Mobile learning (m-learning) refers to learning activities conducted on portable devices, such as smartphones and tablets, enabling multi-contextual education that is accessible anytime and anywhere, regardless of learners’ age, background, or location.

2.1 MALL Overview

MALL is a subcategory of m-learning, which refers to the use of language-learning tools enabled by the mobility of handheld devices. MALL is Computer-Assisted and Web-Based Language Learning in practice, using mobile phone technology to teach languages. This means that applications and multimedia resources for language learning stored on and accessed through handheld devices provide opportunities for language learners to practice and learn a second language in a learner-centered, autonomous, flexible, and self-paced manner. What also differentiates MALL from other learning scenarios is that learners can practice and learn anywhere, at any time, and can progress more naturally and gradually through common themes across demarcated or inter-zones that extend beyond the predefined limits of a classroom, into both formal and informal learning contexts. Students in MALL typically access learning materials outside the classroom via mobile devices, enabling instant feedback and interaction through game theory and content integration, learning management system tools, and social networking tools. All of these tools help build students’ enthusiasm, support students’ autonomy in their paths, and enhance students’ participation. These systems are especially beneficial in disadvantaged environments, as mobile devices have the highest potential and the easiest scalability of reserve resources.

2.2 Adaptive Learning and Receptive Skill Development

Systems such as ReadTheory, which adjust content difficulty in real time based on students’ performance, exemplify the latest trend in instructional practices in Saudi Arabia. Alhumsi et al. (2021) reported that, despite limited adaptability, e-learning apps increased the motivation of Saudi EFL learners and facilitated the use of reading comprehension strategies. Mohammed et al. (2023) concluded that an adaptive stretch-text system significantly outperformed the conventional method in improving reading skills, including vocabulary, cohesion, decoding, and fluency, among university students in southern Saudi Arabia. Okasha (2020) found that EFL students in Saudi Arabia were more responsive to adaptive, strategy-based reading instruction than to traditional teacher-centered reading instruction, underscoring the need for tailored instruction to foster receptive skills. While technology is increasingly used, longitudinal assessments in the Saudi Arabian context remain scarce (Al-Seghayer 2022).

2.3 Motivation in MALL Contexts

In the Saudi context, which has a sociocultural preference for teacher-centered learning (with 90 % of students in the preparatory year preferring this style and 50 % interested in tests) (Alrabai 2017), it can be difficult to implement gamification, which is inherently student-centered. Nonetheless, when linked to cultural scaffolding, it can avoid demotivation, which occurs with traditional drills, and motivate toward a test-oriented system (Alshammari 2022). One study published in 2025 tested the impact of MALL apps on Saudi EFL university students. It concluded that these apps motivated students to improve their language skills, both receptive and productive (Mansor 2025). Engagement and motivation exhibit a bidirectional relationship: sustained engagement bolsters motivation by reinforcing success experiences, while heightened motivation drives deeper engagement through increased effort and persistence. MALL nurtures this interplay via gamification (e.g., rewards for progress) and personalization (e.g., adaptive paths), as outlined in the PF-4M framework’s principles of authenticity and autonomy, ultimately supporting prolonged receptive skill practice (Wang et al. 2024; Althiyabi 2025).

2.4 Relationships and Gaps Leading to Research Questions

The relationship among MALL, motivation, and receptive skills is clarified by frameworks such as PF-4M, which leverage authenticity, personalization, autonomy, and contextualization to direct motivation toward effective learning. By boosting intrinsic motivation through engaging, self-directed activities and extrinsic motivation through feedback, MALL supports consistent practice that leads to receptive improvements – evidenced by Duolingo users at A2/B1 levels achieving reading and listening gains comparable to those achieved over several university semesters (Jiang et al. 2022). Barriers, such as limited tolerance for autonomy, exacerbate these gaps, with few studies examining how MALL-motivation links lead to structured receptive outcomes amid a cultural emphasis on exams (Alshammari 2022).

2.5 Implementation Barriers in the Saudi Educational Context

Saudi Arabia’s Vision 2030 supports the use of technology in education; however, MALL continues to face numerous obstacles. Many English teachers primarily use lectures and view mobile devices as distractions or as unsuitable for their curricula. Al-Alami and Alhamami (2024) found that many Saudi English teachers do not participate in professional development, which reduces their confidence in using technology for MALL. In government preparatory-year programs, strict curricular rules, high-stakes exams, and limited teaching autonomy make it difficult to implement learner-centered technology. Administrators often focus on memorization and test scores, which discourages the adoption of new teaching methods (Alshammari 2024). There are also practical problems. Rural and remote areas often lack reliable mobile internet or IT support, and in cities, not all students have their own devices, thereby widening the digital divide and limiting access (Al-Seghayer 2022). Many MALL apps are not designed for Saudi learners, as they lack Arabic support or do not align with Islamic values and the national English curriculum, making them less attractive and less likely to be used (Alnujaidi 2021; Alaghbary 2021).

2.6 Theoretical Consideration: The PF-4M Framework

The current study grounded its theoretical underpinnings in the PF-4M developed by Wang et al. (2024). PF-4M revolved around four major themes that underlie the integration of MALL: authenticity, personalization, autonomy, and contextualization. These themes should underlie MALL practices to ensure the quality of teaching. The PF-4M is a novel framework, but its roots extend to prior literature on MALL, which underscores the importance of scaffolding, authentic input, learner independence, and personalized learning in enhancing motivation and engagement. Personalization is evident in apps such as ReadTheory, where the difficulty level adjusts based on students’ performance. Previous MALL frameworks have been criticized for emphasizing technological aspects over pedagogical ones, which may render them unworkable in formal classrooms, particularly in the Saudi EFL context, where instruction is heavily teacher-centered and exam-focused. This is where the PF-4M seems to fill this gap by integrating mobile learning with pedagogical practice. Therefore, it is a fitting framework for investigating the impact of Duolingo and ReadTheory on the motivation and receptive language skills of CEFR A1–A2 level students. In addition, the PF-4M framework mentions the role of content for language learning, mirroring the Technological Pedagogical Content Knowledge (TPACK) model. The framework includes items such as scaffolding, interaction, authenticity, and motivation that are present both in the classroom and the mobile app (see Figure 1). Therefore, the framework was adopted for this study because it provides guidance and integrates relevant content for the study population: low-proficiency Saudi preparatory-year students studying in a strict, exam-oriented environment. This would enhance the quality of instruction and the effectiveness of MALL on the students’ motivation and receptive skills.

Figure 1: 
Instructional features supporting the PF-4M framework.
Figure 1:

Instructional features supporting the PF-4M framework.

3 Methodology

The current study employed a concurrent mixed-methods research design to investigate the effects of Mobile-Assisted Language Learning (MALL) tools, namely Duolingo and ReadTheory, on the Motivation and receptive skills (reading and listening) of Saudi CEFR A1–A2 EFL learners in a preparatory-year program. The concurrent mixed-methods design enabled the simultaneous collection of quantitative and qualitative data. Both quantitative and qualitative data were given equal priority in addressing the research questions. This permitted the triangulation of various variables, increased the reliability of the results, and deepened the understanding of students’ experiences.

The study was designed to examine the impact of two MALL tools, ReadTheory and Duolingo, on students’ motivation and receptive reading and listening skills when used as part of classroom instruction. Specifically, ReadTheory provided individualized reading-comprehension practice, whereas Duolingo offered self-paced, gamified vocabulary and listening-comprehension practice. The reasons for selecting these two tools were their popularity and ease of access. Another reason for selecting these tools was that they incorporated three characteristics of successful MALL tools: interactivity, adaptability, and design for learner engagement. To design and analyze the intervention, the study employed the PF-4M, which was proposed by Wang et al. (2024). It was necessary to employ PF-4M to ensure that digital tool integration was grounded in a sound pedagogical rationale that accounted for learners’ pedagogical profiles and needs, defined as CEFR A1–A2. The four elements of the framework, namely, authenticity, personalization, autonomy, and contextualization, were employed in the design and analysis of the intervention. The personalization framework was implemented through the real-time adaptive text responsiveness provided to students by ReadTheory. The autonomy framework was realized through self-paced practice tasks that students could select in the Duolingo app.

Authenticity was ensured through tasks that involved language use and contextualization, informed by cultural factors through prompts and reflective journals. The use of the PF-4M framework ensured that the intervention did not remain at the level of tool use; rather, it provided a pedagogically sound and replicable framework for integrating MALL into EFL foundational-level instruction as a teaching practice.

3.1 Participants and Sampling

Sixty Saudi male EFL students taking the foundation level preparatory year English course at an all-male public university in Saudi Arabia participated in the study. The participants’ English proficiency levels ranged from A1 to A2 on the CEFR as measured by the university placement test. A purposive sampling technique was employed to ensure the sample was as homogeneous as possible and confined to a single proficiency band. The inclusion criteria for the sample were that participants be registered in the course for the current semester, have English proficiency at A1–A2, and have access to a mobile device outside the classroom. Although the participants’ teachers were approached to facilitate data collection, they were not considered participants in the study. The sampling technique helped control for individual differences and enhance the study’s internal validity.

3.2 Research Tools and Measures

Four main instruments were used in this study to ensure its originality and reproducibility. A 20-item Language Learning Motivation Scale (LLMS) was developed for this study to assess students’ motivation to learn English. Although the scale items were informed by the principles of the PF-4M model, and more specifically by autonomy and personalization, they were phrased in general terms and referenced mobile learning to elicit more objective responses from students. The LLMS was informed by motivation theory and MALL practice. Students responded to the LLMS items on a 5-point Likert scale. A pilot test of the LLMS was conducted with a convenience sample of 10 EFL students whose demographic characteristics were similar to those of the study participants. The results of the pilot test were used to refine the wording of some items on the LLMS.

A Receptive Skills Test (RST) was purposefully developed for this study to measure the students’ reading and listening comprehension skills at the CEFR A1 and A2 levels. The test consisted of multiple-choice, cloze, and short-answer items. The content and face validity of the RST were established by asking two experienced senior EFL instructors to evaluate it. The RST was administered to the participants at the beginning and end of the intervention program. Participants wrote weekly reflective journals throughout the six-week intervention period to document their reflections on and perceptions of using Duolingo and ReadTheory.

Guiding prompts were provided to participants to reflect on their experiences with the two apps, perceived usefulness, difficulties associated with integrating them into the language class, and potential improvements in their reading and listening skills and motivation. The weekly reflective journals helped to capture the students’ attitudinal, emotional, and experiential changes that the RST might not detect.

A classroom observation checklist was designed for this study to assess participants’ engagement with and interaction with their mobile devices, as well as the extent of integration of the two apps in the EFL classroom. Two trained observers, the researchers, used the checklist to observe one session per week throughout the intervention period. To establish inter-rater reliability, one observer attended each session. Rather than having both observers in the same session, the observers compared their ratings across sessions.

3.3 Intervention and Timeline

The duration of the intervention was six weeks. The participants were introduced to and trained to use Duolingo and ReadTheory. Duolingo was used in class and at home for about 15 min per day to practice vocabulary and sentence structure. ReadTheory was used at home as homework assignments to practice reading comprehension.

3.4 Data Collection

Data were collected over six weeks through a series of simple yet clear steps. During the first week, students were given the RST to assess their reading and listening comprehension skills. Between weeks 2 and 5, students used Duolingo and ReadTheory in the classroom and at home as part of their regular learning tasks. During this period, students’ reflective journals were collected weekly, and researchers observed participants’ engagement with and interactions with their mobile devices. In the final week of the study, the RST was administered again, the LLMS was administered to determine the students’ motivation to learn English, and their reflective journals were collected. Throughout the data collection process, participants were assured that their anonymity and confidentiality would be protected. Informed consent for voluntary participation was obtained from all the participants. They were also informed that they could withdraw from the study at any time. All the data collection procedures complied with the ethical and protocol requirements stipulated by the IRB at the host institution.

3.5 Data Analysis

Quantitative and qualitative data were analyzed separately and integrated during interpretation. Descriptive statistics were used to describe the quantitative data collected from the RST and the LLMS. Paired-samples t-tests were conducted to compare the means of participants’ receptive skills and motivation before and after the intervention. Cohen’s d was calculated to determine the effect size of the intervention on the participants’ receptive skills and motivation. Pearson product-moment correlation was used to examine the strength and direction of association between participants’ post-intervention motivation and improvement in receptive skills, controlling for participants’ initial receptive skills proficiency. The quantitative data were analyzed using SPSS (version 27) with an alpha level of 0.05. The quantitative data analysis was complemented by a post hoc power analysis using G*Power (version 3.1) to determine whether the sample size was adequate to detect a medium effect size. A reflexive thematic analysis was used to analyze qualitative data from reflective journals and classroom observation protocols. Initial codes were generated inductively from the data and subsequently refined through several rounds of data review and revision. Inter-coder agreement was achieved on the final set of codes used for data analysis. The data collected from the classroom observation protocols were triangulated with the data collected from the reflective journals to verify and contextualize patterns and themes emerging from the data. Finally, the results of the quantitative and qualitative analyses were integrated through a joint interpretation process that aimed to interpret quantitative findings in terms of students’ perceptions and experiences and observed behaviors.

3.6 Justification and Replicability

The mixed-methods design was adopted in the current study because it enables the collection of novel, nuanced data from students’ perspectives. Combining respondent (LLMS data and diaries) and non-respondent (observational) data offers a more nuanced and authentic picture of the outcomes. Based on a theoretical framework (PF-4M), the study aims to achieve more replicability. Compared with other technocentric models, the PF-4M framework offered a pedagogically oriented framework to support the implementation of educationally sound, contextually relevant m-learning in a Saudi PYP. Each data collection method and tool was aligned to one or more of the PF-4M framework components. This provides an opportunity for other researchers to replicate the current model in other EFL low-proficiency, examination-based contexts. Moreover, the tools used, the time, and the procedures for sampling students were described in detail to enable other researchers to replicate the study in similar EFL contexts with comparable students and contexts. This is evident in the research matrix in Table 1.

Table 1:

Research matrix.

Research question Approach Sample Tool(s) Data type
To what extent does MALL integration improve intrinsic and extrinsic motivation among CEFR A1–A2 Saudi preparatory-year learners? Quantitative + Qualitative 60 Saudi EFL learners LLMS, Journals, Observations Numeric + Textual
To what extent do CEFR A1–A2 learners improve their receptive language skills (reading and listening) after integrating MALL tools? Quantitative Same Receptive Skills Test (pre-post) Numeric
How do CEFR A1–A2 learners perceive the Usability, engagement level, and effectiveness of MALL tools within the formal classroom environment? Qualitative Subset (n = 60, journaling) Reflective Journals, Observations Thematic
  1. MALL, Mobile-Assisted Language Learning; LLMS, Language Learning Motivation Scale; EFL, English as a Foreign Language.

3.7 Ethical Considerations

To ensure the rights, dignity, and welfare of the participants, this research was conducted in accordance with ethical research standards. The Institutional Academic and Research Ethics Committee reviewed the research. The Institutional Review Board (IRB) Approval Number is IRB/24 08/051 dated 24/08/2024. The IRB Approval document adheres to the Declaration of Helsinki (2013) and the Institutional policy on the responsible conduct of research. The participants were briefed about the purpose of the study, and written consent was obtained. Respondents’ confidentiality and anonymity were assured, and they retained the right to withdraw from the study at any time. The research adopted non-invasive techniques, including a voluntary anonymous questionnaire and semi-structured interviews. The collected data was stored in a locked cabinet, and the key was held solely by the researcher. No personally identifiable information was recorded or reported.

4 Results

This section reports the quantitative and qualitative findings obtained from the MALL implementation. Quantitative findings are drawn from the Language Learning Motivation Scale (LLMS), the Receptive Skills Test (RST), and structured classroom observation checklists. Qualitative findings are based on reflective learner journals and systematic classroom observations. The section presents results only; interpretation and implications are reserved for the Discussion section.

4.1 Motivation Assessment (LLMS)

Learner motivation was assessed using a post-intervention administration of the Language Learning Motivation Scale (LLMS), a 20-item instrument adapted to measure intrinsic and extrinsic motivational orientations among EFL learners. The instrument demonstrated acceptable internal consistency, with a reliability coefficient of 0.87. All 60 participants completed the LLMS, yielding a 100 % response rate. Responses were recorded on a five-point Likert scale ranging from 1 (strongly disagree) to 5 (strongly agree). Mean values greater than 3.0 were treated as indicative of a positive motivational orientation for reporting purposes.

Total LLMS scores ranged from 49 to 80, with a mean score of 64.80 (SD = 3.40). The overall mean motivation score across participants was 3.24 (SD = 0.17). The mean score for intrinsic motivation items was 3.45 (SD = 0.19), while the mean score for extrinsic motivation items was 3.02 (SD = 0.15). No floor or ceiling effects were observed in the distribution of LLMS scores. Mean scores for all individual items exceeded the neutral midpoint of the scale. A correlation analysis indicated a positive association between LLMS scores and RST scores (r = 0.42, p = 0.01). Descriptive statistics for all LLMS items are presented in Figure 2.

Figure 2: 
Mean Likert scores across the 20 LLMS items. Each item was rated from 1 (strongly disagree) to 5 (strongly agree).
Figure 2:

Mean Likert scores across the 20 LLMS items. Each item was rated from 1 (strongly disagree) to 5 (strongly agree).

4.2 Receptive Skills Assessment

Receptive language ability was measured using the Receptive Skills Test (RST), a 20-item assessment aligned with CEFR A1–A2 proficiency levels, comprising 10 reading comprehension and 10 listening comprehension items. Prior to the intervention, participants completed the RST pre-test. Pre-test scores ranged from 4 to 13, with a mean score of 8.12 (SD = 2.16) out of a possible 20 points.

Following the intervention period, the same participants completed the RST post-test. Post-test scores ranged from 11 to 19, with a mean score of 15.17 (SD = 1.79). Figure 3 presents the percentage of correct responses for each RST item at pre-test and post-test. Across all 20 items, higher percentages of correct responses were observed at the post-test. Differences in magnitude were noted across items, with some items (e.g., Items 2, 4, 7, 12, and 18) showing larger percentage differences and others (e.g., Items 10, 13, and 14) showing smaller differences. Items associated with lower pre-test accuracy, including Item 1, displayed higher post-test accuracy.

Figure 3: 
The proportion of correct responses per item on the Receptive Skills Test (RST) for pre-test and post-test. Each item is scored as correct (1) or incorrect (0).
Figure 3:

The proportion of correct responses per item on the Receptive Skills Test (RST) for pre-test and post-test. Each item is scored as correct (1) or incorrect (0).

4.3 Reflective Learner Journals

Over six weeks, a total of 480 reflective journal entries were collected from 60 participants. Thematic coding of the journal data yielded the following frequency counts: Engagement (n = 144), Improvement (n = 138), Usability (n = 108), Motivation (n = 54), Challenges (n = 54), and Recommendations (n = 36). An additional 114 entries were coded as other due to vague or nonspecific content.

Journal entries included descriptive references to learning tasks, application features, and user experiences. Frequently occurring terms related to engagement included references to gamified elements, visual design, feedback mechanisms, and task completion. Some entries described task pacing, repetition of vocabulary items, and audio speed. Other entries recorded difficulties related to understanding instructions, completing tasks independently, or navigating application interfaces. Several entries specifically referenced ReadTheory narrative passages, with learners noting familiarity with or interest in the story content. Responses to the final prompt regarding recommendations of the applications to peers were predominantly affirmative. Table 2 summarizes the frequency distribution of thematic categories identified in the journal dataset.

Table 2:

Frequency of thematic categories in reflective journals.

Thematic category Number of responses
Engagement 144
Improvement 138
Usability 108
Motivation 54
Challenge 54
Recommendation 36
Other 114

4.4 Classroom Observations

Classroom observations were conducted weekly between weeks 2 and 5 by two trained observers using a shared 10-item observation protocol (see Appendix D). Each item was rated on a three-point scale (0 = not observed, 1 = partially observed, 2 = fully observed). Mean ratings were calculated separately for Observer A and Observer B for each observation item.

Differences in mean ratings were observed across several items. Observer A reported higher mean ratings for autonomous task use, peer discussion related to application tasks, and careful completion of ReadTheory items. Observer B reported higher mean ratings for teacher-dependent behaviors, including requests for assistance and monitoring of task completion. Both observers reported identical mean ratings for learner selection of ReadTheory passages (M = 1.00). Across observation weeks, both observers reported low mean ratings of self-correction behaviors during initial task attempts. Mean ratings for all observation items are presented in Table 3.

Table 3:

Mean ratings from classroom observations (Weeks 2–5).

Observation item Observer A Observer B Difference
Students use Duolingo without the teacher’s help 1.25 0.75 0.50
Students try the ReadTheory passages again after mistakes 1.50 1.00 0.50
Students look happy or focused when using apps 1.50 1.00 0.50
Students talk to peers about Duolingo tasks 1.75 1.00 0.75
Students answer ReadTheory questions carefully 1.50 0.25 1.25
Students use app feedback to improve 1.50 1.25 0.25
Students ask the teacher about app problems 1.00 1.25 −0.25
Students finish Duolingo exercises on time 1.25 1.50 −0.25
Students choose their own ReadTheory stories 1.00 1.00 0.00
Students share app scores with friends 1.50 1.25 0.25

4.5 Mixed-Methods Integration

A convergent mixed methods design was employed, with quantitative and qualitative datasets analyzed separately and subsequently examined together using joint displays and narrative comparison procedures. Quantitative datasets included LLMS scores, pre- and post-test RST results, and correlation coefficients, whereas qualitative datasets consisted of coded journal entries and classroom observation records.

Across datasets, alignment was observed between LLMS item means, RST score distributions, journal themes, and observation frequencies. Statistical analysis of RST scores indicated a significant pre–post difference (t(59) = −12.45, p < 0.001, d = 1.85). Journal themes related to engagement, usability, and improvement co-occurred with higher LLMS item means, whereas observation records documented task interaction patterns across the intervention period. This section reports convergence patterns only; interpretive synthesis is presented in the Section 5.

5 Discussion

This section interprets the findings derived from the triangulation of quantitative and qualitative data, including LLMS scores, RST pre- and post-test results, reflective learner journals, and classroom observations. The discussion is structured around the constructs of the PF-4M. It focuses on explaining observed patterns, variations, and constraints associated with the curriculum-integrated use of MALL tools in a Saudi PYP context. Interpretations are presented cautiously, given the pilot nature of the study and the absence of a control group.

5.1 ReadTheory and Motivation: Autonomy Within the Constraints of Self-Efficacy

The LLMS findings indicated an overall positive motivational orientation, with higher mean scores for intrinsic motivation than extrinsic motivation; however, item-level variation revealed that motivational experiences were not uniform across learners. Lower mean scores on items related to persistence and confidence in independent learning suggest that autonomy was experienced unevenly, particularly among learners encountering linguistically demanding texts. From a PF-4M perspective, this pattern may be interpreted as partial alignment with the framework’s principles of personalization and autonomy. This interpretation is supported by reflective journal entries in which learners frequently described confusion, fatigue, or difficulty when engaging with challenging reading passages, as well as by classroom observation records indicating inconsistent task re-attempts on ReadTheory. Rather than indicating a limitation of the platform itself, these findings are more appropriately understood as reflecting the challenges faced by CEFR A1–A2 learners when engaging in adaptive, self-directed digital reading tasks, a pattern also reported in previous Saudi EFL research emphasizing the need for structured instructional support at lower proficiency levels (Alshammari 2022; Keezhatta and Omar 2019; Al Zumor and Abdesslem 2022).

5.2 Duolingo and Receptive Skills: Performance Patterns and Engagement Variability

The RST results showed a clear difference between pre- and post-test scores, accompanied by variability across learners and test items, which, when interpreted alongside journal reflections and observation data, suggests diverse engagement trajectories with Duolingo tasks. Learners frequently described Duolingo as accessible and enjoyable, particularly in relation to its gamified structure, immediate feedback, and multimodal presentation, perceptions that align with earlier findings among Saudi EFL learners (Althiyabi 2025). At the same time, qualitative data indicated that engagement levels fluctuated over the intervention period, with some learners reporting boredom or anxiety as tasks became repetitive or assessment-oriented. From a PF-4M standpoint, these patterns may be interpreted as highlighting the importance of instructional mediation and task variation in sustaining learner engagement, rather than attributing performance trends solely to app-based features. Similar variability in learner response to Duolingo across proficiency levels has been documented in prior research, which emphasizes that mobile applications interact with learner characteristics and classroom context rather than producing uniform outcomes (Jiang et al. 2022).

5.3 Dual-Tool Use and Task Rotation: Interpreting Engagement Patterns

The combined use of Duolingo and ReadTheory introduced variation in task modality, cognitive demand, and interactional patterns, which learners explicitly noted in their reflective journals. Comments comparing the two tools suggest that task rotation produced contrastive learning experiences perceived as less monotonous than sustained use of a single application. Observation data further documented shifts in peer interaction and task focus across sessions, which may be interpreted as contextual responses to novelty and task diversity rather than direct outcomes of tool alternation. This interpretation is consistent with prior studies emphasizing the role of task variation in maintaining engagement in mobile-assisted learning environments (Indrawan et al. 2023; Pearlin and Gandhi 2024). Within the PF-4M framework, the two tools appeared to foreground different dimensions, with ReadTheory emphasizing personalization and adaptive feedback, whereas Duolingo emphasized multimodality and gamified interaction. The alternation between tools may therefore be understood as exposing learners to complementary engagement pathways rather than producing additive or synergistic effects in a causal sense (Stockwell and Hubbard 2013).

5.4 Theoretical Implications: Adapting PF-4M for Foundational Learners

The application of the PF-4M framework in a guided Saudi PYP classroom provides insight into how its principles operate when learner autonomy is constrained by proficiency level and digital confidence. Observation data indicating frequent instructor consultation, limited independent task selection, and partial autonomy suggest that foundational EFL learners relied heavily on teacher mediation to navigate digital tasks. These patterns may be interpreted as indicating that PF-4M, which was initially conceptualized in contexts emphasizing learner self-determination, requires contextual adaptation for low-proficiency institutional settings. In particular, the findings point to the importance of distinguishing between inbuilt app-based supports and external instructional scaffolding provided by teachers and peers. Similar concerns regarding assumed levels of autonomy and digital competence have been raised in Saudi EFL research, where mobile learning frameworks have been found to require pedagogical mediation to function effectively with beginner learners (Keezhatta and Omar 2019; Alshammari 2022).

5.5 Practical and Policy Implications

From a practical perspective, the findings suggest that curriculum-integrated use of MALL may have pedagogical value when accompanied by structured instructional support and clear alignment with learning objectives. Previous research has emphasized that teacher mediation plays a central role in helping learners interpret digital feedback, remain engaged, and connect mobile tasks to curricular goals (Phetsut and Waemusa 2022; Alnujaidi 2021). In the Saudi context, where teachers have reported limited training in mobile pedagogy, professional development focused on MALL integration, data interpretation, and learner scaffolding appears particularly relevant (Alshammari and Oudah 2018). At an institutional level, considerations related to app selection, accessibility, and equitable access to devices and connectivity remain salient. Policy-level guidance on the purposeful integration of MALL within EFL curricula may therefore be necessary to ensure that mobile tools are used in ways that align with institutional objectives and learner needs (Alnujaidi 2021).

5.6 Limitations

This pilot study employed a single-group, within-subject design without a control group and therefore does not permit causal inference. The sample comprised 60 male CEFR A1–A2 learners from a single Saudi public university, which limits generalizability across gender, institutions, and regions. The intervention duration was relatively short, capturing only short-term patterns, and motivation was measured post-intervention without a baseline comparison. In addition, qualitative findings relied on self-reported data, which may be influenced by social desirability bias despite triangulation. Finally, the findings are specific to the two tools examined, Duolingo and ReadTheory, and future research should explore other platforms and employ experimental or longitudinal designs to extend the present findings.

5.7 Recommendations

Implications based on this pilot study, several initial implications for the future implementation of MALL in Saudi preparatory-year programs are proposed. However, these suggestions need to be evaluated in small-scale, randomized controlled trials to assess their effectiveness before they can be widely implemented. For example, the suggested pair of tools (ReadTheory and Duolingo) can be implemented in the classroom by rotating between them weekly to maintain students’ interest and manage their workload. Furthermore, these tools are more effective when used as in-class activities rather than as homework, thereby enhancing student engagement and aligning classroom activities with intended learning outcomes. Moreover, teachers need to be trained to monitor students’ performance using tool analytics, provide virtual support, and support students who lack motivation. Additionally, the content of these tools can be customized to accommodate Saudi students’ cultural context and reflect their daily experiences by using bilingual glossaries and topics relevant to their lives. A national assessment framework can be developed to evaluate the extent to which tools align with the curriculum, are usable, and yield educational benefits. Such a framework can help establish partnerships between the governmental and private sectors. It is also important for institutions to continually assess both in-class learning outcomes and app-based data to improve instruction continually. By undergoing a controlled pilot process, MALL can eventually be sustainably integrated into the Saudi educational system, thereby enhancing equity and evidence-based practice.

6 Conclusions

This pilot feasibility study explored the impact of using Duolingo and ReadTheory within the PF-4M framework on the motivation and reading and listening skills of CEFR A1–A2 Saudi EFL preparatory-year students learning English in a test-driven, teacher-centered learning environment. Although several initial issues with sustained engagement and adaptation to adaptive tasks arose, the combined use of gamified and individualized apps increased students’ engagement. It showed overall improvements in their reading and listening skills over six weeks. The primary implication of this study is that MALL integration can promote motivation and reading and listening skills among low-proficiency EFL learners in a resource-constrained, test-driven setting when such tools are structured and embedded in the formal curriculum. The integration of Duolingo and ReadTheory within the PF-4M framework indicated that the framework applies to the design and implementation of MALL activities in basic EFL classes. Moreover, the results suggest that combining a feedback-based tool (ReadTheory) with a gamified tool (Duolingo) can balance challenge and enjoyment. Overall, the results of this study suggest that MALL integration can promote equity and access to English education in Saudi Arabia, particularly for low-proficiency learners, who may not be targeted by MALL integration in the Saudi educational system.

6.1 Future Research Directions

There are two main directions for future research. First, the use of culturally adapted content in MALL tools (e.g., incorporating topics relevant to the Saudi culture, bilingual glossaries, and Islamic values) should be explored to promote students’ engagement and reduce resistance to using new learning tools in teacher-centered preparatory-year classes. Second, research should investigate how different teacher scaffolding strategies (directive vs. facilitative) and peer-to-peer interaction tools within MALL can promote autonomy and long-term motivation among CEFR A1–A2 EFL students. In sum, MALL tools, when purposefully integrated into the formal classroom environment with teacher mediation, can contribute to the development of autonomous and motivated language learners at the early stages of language learning.


Corresponding author: Shafiq ur Rehman, Senior Lecturer at the Foundation Program Unit, University of Doha for Science & Technology, Doha, Qatar, E-mail:

  1. Research ethics: This study strictly adhered to internationally recognized ethical research standards to safeguard participants’ privacy, dignity, and rights. Ethical approval was granted by the institutional academic and research oversight committee under IRB Approval Number IRB/24-08/051 (dated August 24, 2024). The research design complies with the principles outlined in the Declaration of Helsinki (2013) and institutional guidelines for responsible conduct of research. Participants were fully informed about the nature and purpose of the study and provided informed consent prior to participation. They were assured of the confidentiality and anonymity of their responses and were informed of their right to withdraw at any stage without any consequences. The study involved non-invasive, voluntary participation through anonymous surveys and semi-structured interviews. All collected data were securely stored and accessible only to the research team, with no personally identifiable information recorded or reported.

  2. Informed consent: Informed consent was obtained from all individuals included in this study, or their legal guardians or wards.

  3. Author contributions: S.u.R. supervised the research process and provided critical revisions. B.H. put her efforts into the instrumentation of the study and holding classroom observations. N.u.R. contributed to the study design and supported B.H. in the classroom observation. S.N.A. collected the data, led the manuscript drafting, and proofreading. All authors reviewed and approved the final manuscript.

  4. Use of Large Language Models, AI and Machine Learning Tools: Some sections of this manuscript required language enhancement and were refined using Grammarly. No generative AI tools or large language models (LLMs), including ChatGPT, were used for content creation, analysis, or interpretation. Although some figures may appear stylistically similar to those generated by ChatGPT or AI tools, however all visualizations were created using Python’s Matplotlib library.

  5. Conflict of interest: The authors state no conflict of interest.

  6. Research funding: None declared.

  7. Data availability: The datasets generated and analyzed during the current study are not publicly available due to institutional confidentiality protocols. However, they are available from the corresponding author upon reasonable request. Please note that access to the data may require formal approval from the affiliated institution before release.

References

Al Zumor, A., and H. Abdesslem. 2022. “English Medium Instruction (EMI) in Saudi Higher Education: Challenges and Prospects.” In English-Medium Instruction in Higher Education in the Middle East and North Africa, edited by S. Curle, H. Ali, and S. Scatolini, 87–108. London: Bloomsbury Academic.10.5040/9781350238572.0012Search in Google Scholar

Alaghbary, G. S. 2021. “Mobile-assisted language learning in Saudi college education: Exploring learners and teachers’ perceptions and practices.” Asian ESP Journal 17 (3.1): 8–31.Search in Google Scholar

Al-Alami, A. M., and M. Alhamami. 2024. “Saudi K-12 Teachers’ Views on the Use of Technology in English Instruction.” English Learning Innovation 5 (1): 38–52. https://doi.org/10.22219/englie.v5i1.31613.Search in Google Scholar

Al-Seghayer, K. 2022. “Saudi EFL teachers’ self-efficacy in technology-assisted language learning (TALL).” International Journal of Applied Linguistics & English Literature 11 (2): 53, http://dx.doi.org/10.7575/aiac.ijalel.v.11n.2p.53.10.7575/aiac.ijalel.v.11n.2p.53Search in Google Scholar

Alfehaid, A., N. Alkhatib, and J. Mukundan. 2024. “The Retrospective Evaluation of English Language Teaching Materials in a Preparatory Year English Language Program in Saudi Arabia.” PASAA: Journal of Language Teaching and Learning in Thailand 69: 335–69. https://doi.org/10.58776/pasaa.v69i0.956.Search in Google Scholar

Alhumsi, M. H., R. A. Alshaye, and K. K. Sendi. 2021. “The Effect of E-Learning Sessions on the Development of Reading Comprehension: A Case of EFL Students’ Perceptions at Saudi Electronic University.” Journal of Education and E-Learning Research 8 (4): 431–9. https://doi.org/10.20448/journal.509.2021.84.431.439.Search in Google Scholar

Alnujaidi, S. 2021. “Adoption of Mobile-Assisted Language Learning (MALL) in Saudi Arabian EFL Classrooms.” Journal of Language Teaching and Research 12 (2): 312–22. https://doi.org/10.17507/jltr.12.2.13.Search in Google Scholar

Alrabai, F. 2017. “Exploring the Unknown: The Autonomy of Saudi EFL Learners.” English Language Teaching 10 (5): 222–33. https://doi.org/10.5539/elt.v10n5p222.Search in Google Scholar

Alshammari, R. M. 2022. “Investigating the Low English Proficiency of Saudi EFL Learners.” Arab World English Journal 13 (1): 129–44. https://doi.org/10.24093/awej/vol13no1.9.Search in Google Scholar

Alshammari, R. M. 2024. Saudi K-12 Teachers’ Views on the Use of Technology in English Instruction. ResearchGate.Search in Google Scholar

Alshammari, S. R., and F. Oudah. 2018. “Saudi EFL Teacher Training Programs: Teachers’ Perceptions and Needs.” Theory and Practice in Language Studies 8 (11): 1411–21. https://doi.org/10.17507/tpls.8.11.04.Search in Google Scholar

Alshehri, A. 2023. “The Impact of Vocabulary Size on the Receptive Skills of Saudi EFL Learners.” World Journal of English Language 13 (7): 108–20. https://doi.org/10.5430/wjel.v13n7p108.Search in Google Scholar

Althiyabi, R. F. A. 2025. “Saudi EFL Students’ Perceptions of Using the Duolingo Application in Learning English.” Journal of English for Academic and Healthcare Studies 42. Advance online publication. https://doi.org/10.33193/JEAHS.42.2025.592.Search in Google Scholar

Education and Training Evaluation Commission. 2023. Overview of Qiyas English Language Tests. https://www.etec.gov.sa/en/qiyas.Search in Google Scholar

Indrawan, F., P. E. Daristin, and E. N. Laili. 2023. “The Influence of the Duolingo Application on EFL Classroom Students’ Motivation in Learning English.” Journal of English Language and Pedagogy 6 (3): 626–33. https://doi.org/10.24093/awej/call.7.1.Search in Google Scholar

Jiang, X., L. Portnoff, L. Plonsky, and B. Pajak. 2022. Developing Reading and Listening Proficiency with Duolingo: A Cross-Sectional Analysis (Duolingo Research Report DRR-22-11). Duolingo. https://duolingo-papers.s3.amazonaws.com/reports/Duolingo_whitepaper_language_read_listen_A2_to_B1_2022.pdf.Search in Google Scholar

Keezhatta, S., and A. A. M. Omar. 2019. “Enhancing Reading Skills for Saudi Secondary School Students Through Mobile-Assisted Language Learning (MALL): An Experimental Study.” International Journal of English Linguistics 9 (1): 437–47. https://doi.org/10.5539/ijel.v9n1p437.Search in Google Scholar

Mansor, A. H. 2025. “Teachers’ perceptions of using MALL in teaching vocabulary to young EFL learners in Saudi Arabia.” Theory & Practice in Language Studies (TPLS) 15 (7): 2299–307, https://doi.org/10.17507/tpls.1507.22.Search in Google Scholar

Mohammed, E. A. H. A., M. E. El Nagar, and G. Sidky. 2023. “The Effectiveness of Designing an Adaptive Learning System Based on Stretch Text Content in Developing English Reading Skills of EFL University Students in K.S.A. Dirasat.” Journal of Educational and Social Studies 26 (12): 1–33. https://doi.org/10.21608/jsu.2020.169916.Search in Google Scholar

Okasha, M. A. 2020. “Using Strategic Reading Techniques for Improving EFL Reading Skills.” Arab World English Journal 11 (2): 311–22. https://doi.org/10.24093/awej/vol11no2.22.Search in Google Scholar

Pearlin, E., and S. M. G. Gandhi. 2024. “Impact of Read Theory in Mobile-Assisted Language Learning on Engineering First-Year Students’ Reading Comprehension Using Bi-LSTM.” International Journal of Advanced Computer Science and Applications 15 (9): 688–97. https://doi.org/10.14569/IJACSA.2024.015.09.069.Search in Google Scholar

Pearson. 2024. Mastering English in Saudi Arabia: How English Empowers Your Tomorrow. Pearson. https://www.pearson.com/content/dam/one-dot-com/one-dot-com/pearson-languages/en-gb/pdfs/gse/gse-resources/gse-research-reports/saudi-arabia/gse-research-saudi-arabia-report-en.pdf.Search in Google Scholar

Phetsut, P., and Z. Waemusa. 2022. “Effectiveness of Mobile-Assisted Language Learning (MALL)-Based Intervention on Developing Thai EFL Learners’ Oral Accuracy.” International Journal of Technology in Education 5 (4): 571–85. https://doi.org/10.46328/ijte.271.Search in Google Scholar

Stockwell, G., and P. Hubbard. 2013. Some Emerging Principles for Mobile-Assisted Language Learning (TIRF Report). Teachers of English to Speakers of Other Languages.Search in Google Scholar

Traxler, J. 2018. “Learning with Mobiles in Developing Countries: Technology, Language, and Literacy.” International Journal of Mobile and Blended Learning 10 (1): 1–12. https://doi.org/10.4018/IJMBL.2018010101.Search in Google Scholar

Wang, X., Y. Ye, and D. Bozdoğan. 2024. Pedagogical Framework for MALL (PF4M). https://doi.org/10.13140/RG.2.2.23456.78901.Search in Google Scholar

Received: 2025-06-26
Accepted: 2026-02-07
Published Online: 2026-03-10

© 2026 the author(s), published by De Gruyter on behalf of Chongqing University, China

This work is licensed under the Creative Commons Attribution 4.0 International License.

Downloaded on 20.3.2026 from https://www.degruyterbrill.com/document/doi/10.1515/dsll-2025-0013/html
Scroll to top button