Abstract
This study investigates the interplay between temporal fluency measures, self-assessment, and language proficiency scores in novice- to intermediate- level language learners of Spanish and French. Analyzing data from 163 participants, the research employs both traditional linear regression and advanced XGBoost machine learning models. Findings demonstrate a moderate positive correlation between self-assessment and Oral Proficiency Interview by Computer (OPIc) scores, underscoring the dependable self-awareness of learners. Notably, XGBoost performs as well as linear regression in predicting OPIc scores and has more potential, underlining the efficacy of advanced methodologies. The study identifies Mean Length of Utterance (MLU) as a crucial predictor, highlighting specific temporal fluency measures' significance in determining proficiency. These findings contribute to language assessment practices, advocating for the integration of machine learning for enhanced precision in predicting language proficiency and informing tailored instructional approaches.
Degree
MA
College and Department
Humanities; Center for Language Studies
Rights
https://lib.byu.edu/about/copyright/
BYU ScholarsArchive Citation
Erickson, Ethan D., "Predicting Speaking Proficiency with Fluency Features Using Machine Learning" (2023). Theses and Dissertations. 10239.
https://scholarsarchive.byu.edu/etd/10239
Date Submitted
2023-12-18
Document Type
Thesis
Handle
http://hdl.lib.byu.edu/1877/etd13077
Keywords
machine learning, oral proficiency, OPIc, self-assessment, temporal fluency
Language
english