Standard textual and acoustic entries
C-ORAL-ROM aim to ensure full exploitation of the all linguistic information and maximum possibility to reuse the resource in both academy and industry.
C-ORAL-ROM texts are delivered in standard textual format (txt CHAT format) with a additional xml entry.
The DTD for the C-ORAL-ROM textual format and a PERL script for xml conversion has been realized
Alignment of txt files generate an alg file linking the acoustic signal and the text. An additional xml entry of the alg file is provided for the reuse of the utterance database in different frames.
Processing of C-ORAL-ROM speech resources