Linguistic-cultural Heritage & Digital Humanities

The project focuses on the following research areas:

  • The study of linguistic-cultural contact phenomena between the East and the West. Through the analysis of different types of texts, both from the past and from the contemporary age, we will set up digital corpora in order to allow consultation and further study.

  • The study of language for specific purposes – e.g. trade, law, tourism, politics, journalism and international relations – thanks to the creation of textual corpora for the study of terminology and for speech analysis, also with the purpose of preparing glossaries, terminology guides and other texts necessary for teaching and for experts of the specific field, as well as for the common user.

  • The study and conservation of minority languages, with reference to the German-speaking areas of Northern Italy (especially in the Veneto area), through the design of a software tool and a platform that makes data elicitation possible through innovative crowdsourcing techniques.

Team Leader


This project aims at creating a corpus of texts belonging to various genres (e.g. legal texts), coming from different Chinese-speaking territorial entities (e.g. Mainland China, Taiwan, Hong Kong, Macao, Singapore) with a focus on their foreign relations.
Qualitative and quantitative analyses will be carried out with the assistance of digital tools.
We also intend to start collaborations with scholars affiliated to other institutions.

Expected results
Creating a corpus, analysing it and carrying out dissemination activities of our research findings.


Fixed-term Researchers

External collaborators

Tanina Zappone

(University of Turin)

This project aims at developing tools for the dissemination of specialized Chinese culture as well as for the training and promotion of research and knowledge transmission. This is done with an eye to making such tools accessible and potentially usable by an increasingly wide audience, including the general public.

Expected results
Scientific and specialised materials, e.g. articles, essays and glossaries for various purposes, including scientific research, divulgation and teaching activities.


Fixed-term Researchers

TaLitE (Traveling and Living in the East) studies not only tourist texts/traveling literature including diaries and guides, published in English but focusing on China. The study is based on printed materials published between the end of the 19th century and the beginning of the 20th century, as well as digital materials pertaining to the contemporary era.

The study is aimed at unveiling the evolution of China perception by western travellers, its stereotypes and the characteristics of the text types examined even from a diachronic perspective.

The methodology draws on corpus linguistic tools for both quantitative and qualitative analyses, and includes keywords, collocations as well as discourse analysis.

Expected Results
The results of the analysis will enable us to identify how the perception of “the other” varied in the last century. A corpus will be created and the research results will be published in scientific journals.


Fixed-term collaborators

contract, 01.2021-04.2021

Jessica Jane Nocella

intern, 11/2020-12/2020

Sofia Giardini

intern, 03/2021-04/2021

Marisa Lettiero

intern, 10/2021-02/2022

Veronica Barbiroli

intern, 02/2022-05/2022

Amina Aarab

This line of research within the Excellence Initiative aims at studying the cultural and linguistic contacts between the East and the West, that is, between France and Spanish-speaking countries on the one hand, and the East, particularly China, on the other.
The research is based on the construction of a corpus of texts written between the second half of the 19th and the beginning of the 20th century. This time frame coincides with a historical period of special significance for the cultural, commercial and political relationships between those two macro areas.
The corpus dataset includes different text types, such as press articles and international politics essays. The construction of the corpus has undergone various phases, which include the selection and classification of data, their processing and storing, as well as the subsequent tagging and analysis of the corpus.

The main objective of the research is to build the corpus and to analyse some linguistic aspects that emerge from the contacts examined along with the study of the perception of China in France and Spanish-speaking countries in a contrastive perspective.

Expected results
The online platform will allow to query the corpus and will offer a representation of the acquired data.


External collaborators

09/2021 - 12/2021

María de los Ángeles Sidrach de Cardona López

(Universidad de Murcia)

Silvia Domenica Zollo

(Università di Napoli Parthenope)

Fixed-term collaborators


Ángela Luque Giráldez


Barbara Tognella


Lorenza Pavarotti

intern, 03/2021 - 09/2021

Julia Poyato Montes

intern, 06/2021 - 08/2021

Elena Cabrero

intern, 07/2021 - 09/2021

Valeria Grancini

intern, 05/2022 - 10/2022

José Santos Hernández Justo

intern, 06/2022 - 09/2022

Emilio Blanco González

The VinKo project consists of an online platform with questionnaires on morphology, syntax and phonology and is aimed at collecting linguistic data from Romance and Germanic non-standard speech varieties in Trentino/South Tyrol and Veneto (and some language islands in Friuli). VinKo makes use of online crowd-sourcing in order to reach as many participants as possible and to collect data from a wide range of localities across the regions. Participants can contribute to the project by registering on the VinKo website and recording their own responses using the built-in microphone of the platform. There is a variety of tasks on the page - e.g., translations from Italian/German into their local speech variety, the description of pictures and conversations, the pronunciation of single-word dialect items - which the participants are asked to complete. The recordings are collected and stored in the online database which serves as a corpus for both the researchers and the speech community members of the involved linguistic varieties. Furthermore, a selection of the recordings is visualized on the open-access section of the webpage through an interactive map (GIS), which can be used by participants and anyone interested in exploring the collected data and learning more about the multilingual environment of the Trentino/South Tyrol and Veneto regions. The full corpus can be downloaded from the repository of the Eurac Research Clarin Centre (ERCC):

VinKo (Varieties in Contact) Corpus:

The project has two main objectives. The first is to provide a detailed description of linguistic microvariation in the area and the linguistic influences existing in phonology, morphology and syntax between the Germanic and Romance varieties, and to formulate generalizations which are valid for the general theory of language contact. The second objective is to increase the knowledge of widespread multilingualism in the area and the awareness of local languages which are a crucial part of the cultural heritage as it merits safeguarding and refinement. In this respect, an important integration is constituted by the VinKiamo subproject, a PCTO activity developed in collaboration with the Regional School Office for Veneto, in which the students from upper secondary schools are trained and act as research assistants within their local communities, also promoting the intergenerational dialogue with elderly speakers of the local speech varieties.


The research is realized in collaboration with researchers at the Universities of Trento and Bolzano.

Project page:


Expected result
The analysis of the data will serve to confirm what is already known about language contact and its effects on language change and to extend the existing generalizations to other varieties which up until now have been less-studied from a contact point of view. The promotion of the open-access archive of the speech recordings will contribute to more widespread knowledge of the linguistic and cultural wealth of the regions. Previous studies have demonstrated that digital resources have substantially contributed to preserving and revitalizing endangered languages.


Fixed-term Researchers

External collaborators

Fixed-term collaborators

01/2021 – 12/2021

Marta Tagliani

09/2020 – 02/2021

Francesca Bussola

PolDisc (Political Discourse) focuses on political discourse and is articulated on the contrastive study of the language used in the USA, the UK and in Italy from a text linguistic and semantico-pragmatic perspective, including the lexicon employed in public speeches and in social media (Facebook, Twitter, Instagram) posts, with special reference to populist discourse at an international level.

A corpus-based methodology is used, while from a theoretical point of view the project is based on the principles of text linguistics (Beaugrande and Dressler 1981), Critical Discourse Analysis (Van Dijk 1993) and the discourse-historical approach (Wodak 2015).

The goal of the research is to shed light on the discursive strategies of both right-wing and left-wing populist discourse from a contrastive perspective and through multiple semiotic systems.

Expected Results
The expected results include the creation of a corpus and the study of the phenomena analysed in scientific publications.


Fixed-term collaborators

contract, 10.2020-01.2021

Virginia Zorzi

research grant holder, 2.9.2019 – 2.3.2020

Sara Corrizzato

(University of Verona)

The InterDiplo project focuses on the language used in spoken interactions by diplomats, ambassadors, and international relations, a topic which has hitherto attracted little attention.

A key aim of the project is to compile a corpus (the InterDiplo Corpus), which comprises broadcast interviews and debates where diplomats and international operators are interviewed in English by journalists of differing cultural backgrounds. This will ensure the suitability of the corpus for both sociolinguistic and intercultural studies.

The topics of the interviews and debates cover a variety of subject fields – including health, politics, society, and the environment.

The corpus is being tagged for metadata, parts of speech, pragmatic and discursive elements, in order to analyse lexico-grammatical, syntactic, semantic and pragmatic characteristics as well as the discursive strategies of the interlocutors.

As part of the project a pilot corpus (InterDiploCovid-19 corpus) is also currently under development, the main aim of which is to pilot both the tagging system and our research hypotheses.

Another key aspect of the project is that a joint research project has also been set up between the Verona team and the University of Konstanz (Dr Annette Hautli-Janisz).

Expected results
The data yielded by the corpus will lead to scientific publications to share with the academic community, as well as teaching materials for prospective/in-service journalists, diplomats and international relations officers.


External collaborators

Annette Hautli-Janisz

(University of Konstanz)

Fixed-term collaborators

intern (11.2020 - 01.2021), contract (01.06.2021 - 01.08.2021)

Sergio Eugenio Zanotto

intern, 02.2021 - 03.2021

Veronica Zin

intern, 15/05/2021 - 15/07/2021

Sara Maini

intern, 20/06/2021 - 20/08/2021

Stefano Scullino

intern, 07/07/2021 - 27/08/2021

Gabriel Sodini

This project aims at creating a corpus of Chinese legal texts coming from different Chinese-speaking territorial entities (e.g. Mainland China, Taiwan, Hong Kong, Macao, Singapore) and analysing them qualitatively and quantitatively by means of digital tools. The research focuses on Chinese legal language and its features, both as a language for special purposes (LSP) and as discourse.
We also intend to start collaborations with scholars affiliated to other institutions.

Expected results
Creating a corpus, analysing it and carrying out dissemination activities of our research findings.


Fixed-term Researchers

DIACOM belongs to the linguistic area of our Excellence Project, i.e. languages for specific purposes. This sub-project deals with the diachronic study of trade terminology in French and Spanish.

This study is based on the creation of a specialized corpus according to three main criteria. A chronological criterion, according to which the corpus subdivision follows three key steps for the evolution of trade on an international level: the periods 1850-1914 and 1945-1970 for both languages; as regards the recent history of trade, the period 1990-2018 for Spanish and 1985-2020 for French. We have also adopted an extra-linguistic criterion, which allows the trade field to be classified into sub-fields, and a textual criterion, on the basis of which texts of different types are collected and filed. The implementation of the corpus is followed by the phases of filing, pre-editing, archiving, tagging and corpus analysis, with the subsequent creation of accessible resources for our online platform.

Taking the corpus as a reference point, the main objective of DIACOM is the analysis of trading terminology in the two languages, both in their evolution across time and in a specific historical moment. For example, the phenomena of phraseology, definitions and the diachronic evolution of key terms in the sector are examined and results are published in scientific publications. The corpus will also represent a teaching resource available to both UNIVR students and any external users.

Expected results
Software will be used for the automatic extraction of terminological data and for the processing and display of data on the online platform. This platform will allow access to concordances and contexts extracted from the corpus and will host a database of terms in the form of a network.


Fixed-term Researchers

External collaborators

Patrick Drouin

(Université de Montréal)

Silvia Domenica Zollo

(Università di Napoli Parthenope)

Fixed-term collaborators


Klara Dankova

(University of Verona)

intern, 12/2020 – 05/2021

Martina Bellomo

intern, 09-11/2020

Patricia Díaz Muñoz

intern, 08-11/2020

Andrea Calpe Álvarez

intern, 02-03/2020

Cristina Manzano Carlos

intern, 09/2019 – 01/2020

Laura Falcón Gil

intern, 09-11/2019

Josh Alushani

intern, 10/2018 – 03/2019

Pablo D. Montes González

intern, 03/2021 - 09/2021

Julia Poyato Montes

intern, 06/2021 - 08/2021

Elena Cabrero

intern, 07/2021 - 08/2021

Sergio Luque Ortiz

intern, 09/2021 - 04/2022

Míriam Pérez Carrasco

intern, 08/2021 - 10/2021

Lorena Martín Rodríguez

intern, 10/2021 - 11/2021

Alessia Manna

intern, 02/2022 – 05/2022

Daniel Hontanilla Díaz

intern, 04/2022 – 07/2022

Rafael Estrada Morillo

intern, 05/2022 - 08/2022

Victoria Muñoz García

contract, 09/2022 - 10/2022

Matteo Pessot

intern, 08/2022 - 11/2022

Estefanía Redi Ortega

intern, 09/2022 - 01/2023

Cristina García Martínez

intern, 09/2022 - 03/2023

Alonso López Alonso

NEOTUR is part of the Linguistic Area of our Project of Excellence. More specifically, our project intends to study neology and terminology in contemporary French for tourism.
With this purpose, we aim to build a French language corpus relating to tourism. The corpus texts cover the period from 2000 to 2020 and belong to different text types, including specialized press, academic papers and institutional texts. The implementation of the corpus is followed by the phases of filing, pre-editing, archiving, tagging and corpus analysis, with the subsequent creation of accessible resources for our online platform.

The main objective of this research is to record and examine the frequency and evolution of neologisms and terms in the field of tourism through the use of IT tools. In particular, we intend to focus on word-formation, phraseology and the definition of terms.

Expected results
Through an online platform it will be possible to carry out linguistic research and extract concordances, contexts and other linguistic data from the corpus. Furthermore, a database of terms will be created in the form of a network.


Fixed-term collaborators

contract, 04/2022 - 07/2022

Barbara Bellinaso