C-ORAL-ROM. Integrated Reference Corpora for Spoken Romance Languages
Edited by Emanuela Cresti and Massimo Moneglia
University of Florence
The C-ORAL-ROM book and DVD provide a unique set of comparable corpora of spontaneous speech for the main Romance languages, French, Italian, Portuguese and Spanish. The corpora are accompanied by comparative linguistic studies, models and standard linguistic measures of spoken language variability. Each corpus is built to the same design using identical sampling techniques, and each corpus is presented in multimedia format, allowing simultaneous access to aligned acoustic and textual information. Texts are headed with information about provenance, participants, etc. and the transcriptions show changes of speaker. Speech acts are tagged according to the evidence of prosodic criteria. Each corpus totals 300,000 words and presents formal and informal speech in a variety of contexts of use, dialogue structure and text genres, semantic domains and speech act typologies. The corpora have great statistical relevance for spoken language structures and can address key issues in human language technology such as speech recognition in unrestricted discourse, the suitability of speech synthesis in natural prosody, and multilingual applications of the spoken language interface. The work provides new data and innovative theoretical perspectives that are relevant for corpus linguistics, romance linguistics, syntactic theory, speech and prosody research, and second language acquisition.
[Studies in Corpus Linguistics, 15] 2005. xviii, 304 pp. (incl. DVD)
Hb 90 272 2286 X EUR 120.00 / 158811 548 8 USD 144.00
2005
2005
Preface
Claire Blanche-Benveniste
- The C-ORAL-ROM resource
Massimo Moneglia and Philippe Martin - The Italian corpus
Emanuela Cresti, Alessandro Panunzi and Antonietta Scarano - The French corpus
Estelle Campione, Jean Vèronis and Josè Deulofeu - The Spanish corpus
Antonio Moreno Sandoval, Gillermo de la Madrid, Manuel Alcántara Ana Gonzalez, José M. Guirao and Raúl De la Torre - The Portuguese corpus
Maria Fernanda Bacelar do Nascimento, José Bettencourt Gonçalves, Rita Veloso, Sandra Antunes, Florbela Barreto and Raquel Amaro - Notes on lexical strategy, structural
strategies and surface clause indexes in
the C-ORAL-ROM spoken corpora
Emanuela Cresti - Appendix
Massimo Moneglia, Marco Fabbri, Silvia Quazza, Andrea Panizza, Morena Danieli, Juan MarĂia Garrido and Marc G.J. Swerts - Bibliography