TY - GEN
T1 - Survey of conversational behavior
T2 - 10th International Conference on Language Resources and Evaluation, LREC 2016
AU - Koisot, Hanae
AU - Tsuchiya, Tomoyuki
AU - Watanabet, Ryoko
AU - Yokomori, Daisuke
AU - Aizawa, Masao
AU - Den, Yasuharu
N1 - Funding Information:
This work is supported by Grant-in-Aid for Collaborative Research Project of NINJAL “A multifaceted study of spoken language using a large-scale corpus of everyday Japanese conversation” led by Hanae Koiso.
PY - 2016
Y1 - 2016
N2 - In 2016, we set about building a large-scale corpus of everyday Japanese conversation-a collection of conversations embedded in naturally occurring activities in daily life. We will collect more than 200 hours of recordings over six years, publishing the corpus in 2022. To construct such a huge corpus, we have conducted a pilot project, one of whose purposes is to establish a corpus design for collecting various kinds of everyday conversations in a balanced manner. For this purpose, we conducted a survey of everyday conversational behavior, with about 250 adults, in order to reveal how diverse our everyday conversational behavior is and to build an empirical foundation for corpus design. The questionnaire included when, where, how long, with whom, and in what kind of activity informants were engaged in conversations. We found that ordinary conversations show the following tendencies: i) they mainly consist of chats, business talks, and consultations; ii) in general, the number of participants is small and the duration of the conversation is short; iii) many conversations are conducted in private places such as homes, as well as in public places such as offices and schools; and iv) some questionnaire items are related to each other. This paper describes an overview of this survey study, and then discusses how to design a large-scale corpus of everyday Japanese conversation on this basis.
AB - In 2016, we set about building a large-scale corpus of everyday Japanese conversation-a collection of conversations embedded in naturally occurring activities in daily life. We will collect more than 200 hours of recordings over six years, publishing the corpus in 2022. To construct such a huge corpus, we have conducted a pilot project, one of whose purposes is to establish a corpus design for collecting various kinds of everyday conversations in a balanced manner. For this purpose, we conducted a survey of everyday conversational behavior, with about 250 adults, in order to reveal how diverse our everyday conversational behavior is and to build an empirical foundation for corpus design. The questionnaire included when, where, how long, with whom, and in what kind of activity informants were engaged in conversations. We found that ordinary conversations show the following tendencies: i) they mainly consist of chats, business talks, and consultations; ii) in general, the number of participants is small and the duration of the conversation is short; iii) many conversations are conducted in private places such as homes, as well as in public places such as offices and schools; and iv) some questionnaire items are related to each other. This paper describes an overview of this survey study, and then discusses how to design a large-scale corpus of everyday Japanese conversation on this basis.
UR - http://www.scopus.com/inward/record.url?scp=85037061332&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85037061332&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85037061332
T3 - Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016
SP - 4434
EP - 4439
BT - Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016
A2 - Calzolari, Nicoletta
A2 - Choukri, Khalid
A2 - Mazo, Helene
A2 - Moreno, Asuncion
A2 - Declerck, Thierry
A2 - Goggi, Sara
A2 - Grobelnik, Marko
A2 - Odijk, Jan
A2 - Piperidis, Stelios
A2 - Maegaard, Bente
A2 - Mariani, Joseph
PB - European Language Resources Association (ELRA)
Y2 - 23 May 2016 through 28 May 2016
ER -