Description of corpora
Romance Language Survey
Description
- The word list for this corpus was originally designed by Laura Colantoni and Jeffrey Steele for the Romance Phonetics Database (containing audio recordings).
- The current data are from 11 speakers representing two varieties of French and three varieties of Spanish.
- The simultaneous EPG and audio recordings were collected in the Linguistics Phonetics Lab in 2008-09 (Spanish) and 2014-15 (French).
Speaker codes
- French: FRCf01, FRCf02, FRQf01, FRQf02 (where C = Continental/France, Q = Quebec; f = female)
- Spanish: SPAf01, SPAf02, SPAf03, SPAf04, SPAm05, SPCf01, SPPf01 (where A = Argentine, C = Cuban, P = Peninsular/Spain, f = female, m = male)
Materials
- French
- Words (alphabetical and phonetic), produced in the carrier phrase Je dis __ une fois 'I say ___ again' and in isolation. On average 6 repetitions of words in carrier sentences and 2 repetitions of single words were collected.
- Text: La Bise et le Soleil (The North Wind and the Sun)
- Spanish
- Words (alphabetical and phonetic), produced in the carrier phrase Digo __ otra vez 'I say ___ again' and in isolation. On average 6 repetitions of words in carrier sentences and 2 repetitions of single words were collected.
- Text: La Bise et le Soleil (The North Wind and the Sun)
- Text: El Viento Norte y el Sol (The North Wind and the Sun)
- For both languages
- Multiple repetitions were elicited of all the materials (on average 6 for words in carrier sentences, at least 2 for single words, and 4 for the text).
- In file names, 'c' refers to words in carrier sentences, 's' to words in isolation,and 'p' to the text (e.g. abaisse_c1_epFRCf01: the first repetition of the word 'abaisse' in the carrier sentence by FRCf01).
- All files are coded for segments (consonants and vowels) and graphemes.
The Cross-Language Articulatory Database (CLAD) @ CHASS / University of Toronto Copyright © 2026 University of Toronto