ChatPaper.aiChatPaper

描繪意大利計算語言學十年發展:《CLiC-it》語料庫

Charting a Decade of Computational Linguistics in Italy: The CLiC-it Corpus

September 23, 2025
作者: Chiara Alzetta, Serena Auriemma, Alessandro Bondielli, Luca Dini, Chiara Fazzone, Alessio Miaschi, Martina Miliani, Marta Sartor
cs.AI

摘要

在過去十年間,計算語言學(CL)與自然語言處理(NLP)領域發展迅速,尤其是隨著基於Transformer架構的大型語言模型(LLMs)的出現。這一轉變重塑了研究目標與優先事項,從詞彙與語義資源轉向語言建模與多模態研究。在本研究中,我們透過分析CLiC-it會議的投稿,追蹤了義大利CL與NLP社群的研究趨勢,CLiC-it可謂該領域在義大利的領先會議。我們將CLiC-it會議前10屆(2014年至2024年)的論文集結成CLiC-it語料庫,對其元數據(包括作者來源、性別、所屬機構等)以及論文內容(涵蓋多種主題)進行了全面分析。我們的目標是為義大利及國際研究社群提供關於新興趨勢與關鍵發展的寶貴見解,支持該領域的明智決策與未來方向。
English
Over the past decade, Computational Linguistics (CL) and Natural Language Processing (NLP) have evolved rapidly, especially with the advent of Transformer-based Large Language Models (LLMs). This shift has transformed research goals and priorities, from Lexical and Semantic Resources to Language Modelling and Multimodality. In this study, we track the research trends of the Italian CL and NLP community through an analysis of the contributions to CLiC-it, arguably the leading Italian conference in the field. We compile the proceedings from the first 10 editions of the CLiC-it conference (from 2014 to 2024) into the CLiC-it Corpus, providing a comprehensive analysis of both its metadata, including author provenance, gender, affiliations, and more, as well as the content of the papers themselves, which address various topics. Our goal is to provide the Italian and international research communities with valuable insights into emerging trends and key developments over time, supporting informed decisions and future directions in the field.
PDF01September 30, 2025