International Centre for Language and Communicative Development: Corpus and Experimental Study: Children's Acquisition of Wh-questions, 2019

Corpus data analysis (Python scripts) audio recordings (audio files), stimuli (a complete OpenSesame Experiment file), transcribed and coded responses (.csv file) data analysis scripts (R files) from a training study described by the following abstract: Subject-auxiliary inversion in interrogatives has been a topic of great interest in language acquisition research, and has often been held up as evidence for the structure-dependence of grammar. This applies especially to wh- questions, which are argued to be structurally more complex than polar interrogatives. Non-inversion errors are of particular interest as they represent a rare case in which children reliably make errors involving word order. Usage-based and nativist approaches posit different representations and processes underlying children’s question formation and therefore predict different causes for these errors. Here, we explore the question of whether input statistics predict children’s spontaneous non-inversion errors with wh- questions. While previous work has focused primarily on the distributional properties of wh- words and auxiliaries themselves, we consider the statistical properties of additional subsequences. In particular, we look at properties of the non-inverted, errorful forms of questions. In keeping with recent evidence for multiword units in children’s comprehension and production, we explore the question of whether the frequency of uninverted subsequences (e.g., “she is going” in “what she is going to do?*”) is a good predictor of children’s errors. First, through a series of corpus analyses, we show that this is indeed the case. Second, we conduct an experiment in which children are asked to repeat wh-questions which, although matched for the n-grams in the well-formed questions, differ as to the n-gram frequencies of the crucial bigrams.The International Centre for Language and Communicative Development (LuCiD) will bring about a transformation in our understanding of how children learn to communicate, and deliver the crucial information needed to design effective interventions in child healthcare, communicative development and early years education. Learning to use language to communicate is hugely important for society. Failure to develop language and communication skills at the right age is a major predictor of educational and social inequality in later life. To tackle this problem, we need to know the answers to a number of questions: How do children learn language from what they see and hear? What do measures of children's brain activity tell us about what they know? and How do differences between children and differences in their environments affect how children learn to talk? Answering these questions is a major challenge for researchers. LuCiD will bring together researchers from a wide range of different backgrounds to address this challenge. The LuCiD Centre will be based in the North West of England and will coordinate five streams of research in the UK and abroad. It will use multiple methods to address central issues, create new technology products, and communicate evidence-based information directly to other researchers and to parents, practitioners and policy-makers. LuCiD's RESEARCH AGENDA will address four key questions in language and communicative development: 1) ENVIRONMENT: How do children combine the different kinds of information that they see and hear to learn language? 2) KNOWLEDGE: How do children learn the word meanings and grammatical categories of their language? 3) COMMUNICATION: How do children learn to use their language to communicate effectively? 4) VARIATION: How do children learn languages with different structures and in different cultural environments? The fifth stream, the LANGUAGE 0-5 PROJECT, will connect the other four streams. It will follow 80 English learning children from 6 months to 5 years, studying how and why some children's language development is different from others. A key feature of this project is that the children will take part in studies within the other four streams. This will enable us to build a complete picture of language development from the very beginning through to school readiness. Applying different methods to study children's language development will constrain the types of explanations that can be proposed, helping us create much more accurate theories of language development. We will observe and record children in natural interaction as well as studying their language in more controlled experiments, using behavioural measures and correlations with brain activity (EEG). Transcripts of children's language and interaction will be analysed and used to model how these two are related using powerful computer algorithms.

Show More

Geographic Coverage:

GB

Temporal Coverage:

2019-01-25/2019-06-27

Resource Type:

dataset

Available in Data Catalogs:

UK Data Service

Topics: