Corpora
LABLITA Corpus of Spontaneous Spoken Italian (Adult and Child)
C-ORAL-ROM Integrated Reference Corpora for Spoken Romance Languages (IST-2000-26-228)
Stammerjohann's corpus The first Spoken Italian Corpus available on the net
Metadata of our corpora are freely available on this web site.
Access conditions
The Metadata collection of all corpora is freely avaliable in this web site
The Lablita Corpus is avaliable in accordance with the following possibilities:
- within the C-ORAL-ROM corpus
- A significant portion of the Lablita Corpus of Adult Spoken Italian (roughly 36 hours for 300.000 transcribed words) is distributed by ELDA (European Language Resource Dinstrubution Agency, Paris) within the C-ORAL-ROM collection of romance corpora, through licence agreement http://www.elda.org/catalogue/en/speech/S0172.html
- The C-ORAL-ROM corpus is also distributed in encrypted form for personal use by J. Benjamins Publishing Company in one DVD, together with E. Cresti & M. Moneglia (eds.) C-ORAL-ROM. Integrated Reference Corpora for Spoken Romance Languages http://www.benjamins.com/cgi-bin/t_bookview.cgi?bookid=SCL%2015
- A collection of short samples of the C-ORAL-ROM corpus is freely avaliable in DEMO version in this web site
- Longitudinal samples of the LABLITA Collections of Longitudinal Corpora of Early Acquisition of Italian are freely avaliable in this web site
- A sampling of the Stammerjohan Corpus is going to be published
- Larger selections of the LABLITA corpus can be accessed by priviate or public bodies for research and development within the frame of specific projects.
The terms of the access are extabilished thorough licence agreement with the Italian Departement of the University of Florence which will limit the use of corpora to the purpose of the project itself.
The user can select his preferred sampling using the metadata in this web site and then contact E. Cresti. Costs and condition vary in accordance with the sampling and purpose of the project.