This paper outlines ongoing research into lexical innovation in contemporary Italian in the context of social media. To date, the study has used a dataset of 5.32M timestamped and geotagged tweets extracted from the 2022 Italian timeline, yielding 720 emerging word forms. Here, we describe the reproducible pipeline developed to extract candidate neologisms from our dataset and introduce a custom tool to visualise the emergence and spread of candidate neologisms in the time-period under investigation
Uncovering the spread of lexical innovation in Italian tweets
Spina S
2024-01-01
Abstract
This paper outlines ongoing research into lexical innovation in contemporary Italian in the context of social media. To date, the study has used a dataset of 5.32M timestamped and geotagged tweets extracted from the 2022 Italian timeline, yielding 720 emerging word forms. Here, we describe the reproducible pipeline developed to extract candidate neologisms from our dataset and introduce a custom tool to visualise the emergence and spread of candidate neologisms in the time-period under investigationFile in questo prodotto:
File | Dimensione | Formato | |
---|---|---|---|
AIUCD2024-proceedings.pdf
accesso aperto
Tipologia:
Versione Editoriale (PDF)
Licenza:
Creative commons
Dimensione
1.14 MB
Formato
Adobe PDF
|
1.14 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.