Corpus de la SNCF

Full Name
Corpus de la SNCF
Composer
project GRECO (Grenoble Campus Ouvert)
Language
French
French French
Register
Spoken
Genre
Telephone
Style
Formal
Period
1900-2000 AD
Number of words
< 500.000
Number of words (details)
84.494 words
Annotation
Tokenization
Format
CD/DVD
Format remarks
CD-ROM available in the UGent French section.
Data collection
Elicited
Multimedia
Transcription only
Availability
Not available
Remarks
Transcribed dialogues of telephone conversation from the information centre of the St-Lazare railway station in Paris.