In this study, we present a novel system for the automatic classification of text complexity in the Italian language, focusing on the phraseological dimension. This quantitative assessment of text complexity is crucial for various applications, including text readability measurement, text simplification, and support for educators during evaluation processes. We use a dataset comprising texts written by Italian L2 learners and classified according to the levels of the Common European Framework of Reference for Languages. The dataset texts serve as a basis for calculating phraseological features, which are then used as input for multiple machine-learning classifiers to compare their performance in predicting proficiency levels. Our experimental results demonstrate that the proposed framework effectively harnesses phraseological complexity features to achieve high classification accuracy in determining proficiency levels.

Classification of Text Writing Proficiency of L2 Learners

Santucci, Valentino
2023-01-01

Abstract

In this study, we present a novel system for the automatic classification of text complexity in the Italian language, focusing on the phraseological dimension. This quantitative assessment of text complexity is crucial for various applications, including text readability measurement, text simplification, and support for educators during evaluation processes. We use a dataset comprising texts written by Italian L2 learners and classified according to the levels of the Common European Framework of Reference for Languages. The dataset texts serve as a basis for calculating phraseological features, which are then used as input for multiple machine-learning classifiers to compare their performance in predicting proficiency levels. Our experimental results demonstrate that the proposed framework effectively harnesses phraseological complexity features to achieve high classification accuracy in determining proficiency levels.
2023
978-3-031-37104-2
978-3-031-37105-9
Text Complexity, Natural Language Processing, Text Classification
File in questo prodotto:
File Dimensione Formato  
proceedings_p73-p86.pdf

non disponibili

Descrizione: Versione Editoriale
Tipologia: Versione Editoriale (PDF)
Licenza: Copyright dell'editore
Dimensione 514.86 kB
Formato Adobe PDF
514.86 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12071/36748
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact