Dataset

Adults' Production of Speech (Audio-Visual Recordings)

Western Sydney University
Professor Denis Burnham (Aggregated by)
Viewed: [[ro.stat.viewed]] Cited: [[ro.stat.cited]] Accessed: [[ro.stat.accessed]]
ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rfr_id=info%3Asid%2FANDS&rft_id=http://handle.westernsydney.edu.au:8081/1959.7/514883&rft.title=Adults' Production of Speech (Audio-Visual Recordings)&rft.identifier=http://handle.westernsydney.edu.au:8081/1959.7/514883&rft.publisher=Western Sydney University&rft.description=This data set contains audio-visual recording of adult native speakers of Thai, Cantonese, Mandarin and Malaysian-Mandarin producing syllables with all their tones and native Lancaster English speakers producing syllables with different intonations. There are 9,500 tokens from 10 Thai speaker; 7,440 tokens from 8 Cantonese speakers; 6,144 tokens from 8 Mandarin speakers; 9,360 tokens from 10 Malaysian-Mandarin speakers; 6,500 tokens from additional 5 Thai speakers and 4,752 tokens from 8 Lancaster English speakers. Audio files were extracted from every video file using Adobe Premier Pro 2.0 or VirtualDub software then they were segmented and labelled for each individual syllable within each file using the PRAAT program. These segmentation files were then used as segmentation cues to cut video files into individual syllables to be used as stimuli in speech perception experiments. The videos are in .avi format and all the audio files extracted from them are in .wav format which can be viewed via most media players. Labelled segmentation files are in .TextGrid format and can be viewed with the PRAAT program. The size of each video file is approximately 2 to 4 GB depending on the language with 266 files all together. Size of each extracted audio file is approximately 100 to 250 MB depending on the language. Size of each TextGrid file is approximately 150 to 250 KB depending on the language.&rft.creator=Professor Denis Burnham&rft.date=2012&rft_rights=&rft_subject=Speech Perception&rft_subject=Speech&rft_subject=Tone (Phonetics)&rft_subject=Language Acquisition&rft_subject=Cognitive Science&rft_subject=Psychology and Cognitive Sciences&rft_subject=Linguistics&rft_subject=Language, Communication and Culture&rft_subject=Psychology&rft_subject=Expanding Knowledge in Psychology and Cognitive Sciences&rft_subject=Expanding Knowledge&rft_subject=Expanding Knowledge&rft_subject=Applied Research&rft.type=dataset&rft.language=English Go to Data Provider

Please use the contact information below to request access to this data.

Licence & Rights:

No Licence view details
No licence

Access:

Restrictions apply view details

To discuss data access conditions please contact Professor Denis Burnham at the The MARCS Institute for Brain, Behaviour and Development
http://www.westernsydney.edu.au/marcs/contact_us

Full description

This data set contains audio-visual recording of adult native speakers of Thai, Cantonese, Mandarin and Malaysian-Mandarin producing syllables with all their tones and native Lancaster English speakers producing syllables with different intonations. There are 9,500 tokens from 10 Thai speaker; 7,440 tokens from 8 Cantonese speakers; 6,144 tokens from 8 Mandarin speakers; 9,360 tokens from 10 Malaysian-Mandarin speakers; 6,500 tokens from additional 5 Thai speakers and 4,752 tokens from 8 Lancaster English speakers. Audio files were extracted from every video file using Adobe Premier Pro 2.0 or VirtualDub software then they were segmented and labelled for each individual syllable within each file using the PRAAT program. These segmentation files were then used as segmentation cues to cut video files into individual syllables to be used as stimuli in speech perception experiments. The videos are in .avi format and all the audio files extracted from them are in .wav format which can be viewed via most media players. Labelled segmentation files are in .TextGrid format and can be viewed with the PRAAT program. The size of each video file is approximately 2 to 4 GB depending on the language with 266 files all together. Size of each extracted audio file is approximately 100 to 250 MB depending on the language. Size of each TextGrid file is approximately 150 to 250 KB depending on the language.

Data time period: 2009 to 12 2012