This paper presents RITA (Resource for Italian Tests Assessment), a new dataset of academic exam texts written in Italian by second-language learners for obtaining the CEFR certification of proficiency level. In addition to the tests, RITA provides a variety of speech elements, annotations, and statistics, including phraseological units and their syntactic dependencies. The dataset consists of two corpora: one containing the task assignment and the other containing the texts elaborated by the learners in response to the assignment. This work describes the data collection and annotation process, structure, and statistics computed to facilitate the analysis of the phraseological text. RITA is a valuable resource for researchers and educators interested in Italian phraseology, language assessment, and natural language processing.

RITA: A Phraseological Dataset of CEFR Assignments and Exams for Italian as a Second Language

Milani, Alfredo;Santucci, Valentino
2023-01-01

Abstract

This paper presents RITA (Resource for Italian Tests Assessment), a new dataset of academic exam texts written in Italian by second-language learners for obtaining the CEFR certification of proficiency level. In addition to the tests, RITA provides a variety of speech elements, annotations, and statistics, including phraseological units and their syntactic dependencies. The dataset consists of two corpora: one containing the task assignment and the other containing the texts elaborated by the learners in response to the assignment. This work describes the data collection and annotation process, structure, and statistics computed to facilitate the analysis of the phraseological text. RITA is a valuable resource for researchers and educators interested in Italian phraseology, language assessment, and natural language processing.
2023
979-8-3503-0918-8
Natural Language Processing , NLP , Italian L2 , L2 , text complexity
File in questo prodotto:
File Dimensione Formato  
rita_editorial.pdf

non disponibili

Descrizione: Versione editoriale
Tipologia: Versione Editoriale (PDF)
Licenza: Copyright dell'editore
Dimensione 489.2 kB
Formato Adobe PDF
489.2 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12071/40208
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact