Learning to Read Connections Sensitivity to Collocation Frequency Links vocabulary size and reading comprehension in Middle Childhood
Learning to Read Connections Sensitivity to Collocation Frequency Links vocabulary size and reading comprehension in Middle Childhood
R
eading comprehension plays an important role in everyday life.
The ability to read and comprehend text fluently involves many
complex cognitive processes that develop in early and middle
childhood. Longitudinal studies in multiple alphabetic languages broadly
distinguish two underlying cognitive components: decoding and lan-
guage comprehension (Caravolas et al., 2019; Florit & Cain, 2011; Hjet-
land et al., 2020; Hoover & Tunmer, 2018; Nation, 2019). Together, they
have been shown to explain more than 85% of the variability in reading
comprehension abilities in middle childhood (Chiu, 2018; Lervåg
Reading Research Quarterly, 59(3) et al., 2018; Lonigan et al., 2018).
pp. 448–467 | doi:10.1002/rrq.548
© 2024 The Author(s). Reading Research Quarterly
Decoding skills (i.e., word recognition) are explained by underlying
published by Wiley Periodicals LLC on behalf of phonological and phoneme-grapheme convergence processes (e.g., pho-
International Literacy Association. This is an open access
article under the terms of the Creative Commons
nological awareness, letter-sound knowledge, rapid naming; Caravolas
Attribution-NonCommercial License, which permits use, et al., 2019; Moll et al., 2014; Schmitterer & Schroeder, 2019a; Zugarra-
distribution and reproduction in any medium, provided
the original work is properly cited and is not used for
murdi et al., 2022). However, disentangling the cognitive processes
commercial purposes. underlying the relation between early language comprehension
448
19362722, 2024, 3, Downloaded from https://ila.onlinelibrary.wiley.com/doi/10.1002/rrq.548 by Pontificia Universidad Catolica De Chile, Wiley Online Library on [20/08/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
development and reading comprehension skills in middle Only few computational studies have studied early
childhood is still considered a complex problem (Castles vocabulary development in other languages than English.
et al., 2018; Duke & Cartwright, 2021). Furthermore, while Here, we introduce studies using network growth model
decoding skills explain more variability in reading com- approaches that have been replicated across multiple lan-
prehension in early reading development, the contribution guages. These studies have investigated learning princi-
of language comprehension increases throughout reading ples that aim to explain early vocabulary development
development (Hjetland et al., 2019, 2020; Hulme (Fourtassi et al., 2020; Hills et al., 2009). In these studies,
et al., 2015). Thus, understanding how early language semantic networks were fitted for 130 high-frequency
comprehension development relates to reading compre- nouns typically acquired before 30 months of age, and
hension is relevant for reading acquisition in the long vocabulary growth was modeled based on Age- of-
term. In this study, we aim to investigate the role that the Acquisition norms (AoA; MCDI; Dale & Fenson, 1996).
early acquired cognitive architecture of semantic knowl- AoA norms provide information from parent question-
edge based on co-occurrences might play in the relation naires that qualify the age at which each noun was
between vocabulary size and reading comprehension abil- acquired for at least 50% of children (Braginsky et al.,
ities in middle childhood. 2019). The structure of semantic networks was fitted
using feature-based semantic relations (McRae et al.,
2005) and association norms (Nelson et al., 1998). In
Lexical Structure, Vocabulary addition, it was investigated whether the strength of
Development and Collocation relatedness between nodes and the number of nodes con-
Frequency nected to each other (i.e., connectivity) were consistent
Computational models of semantic representations with co- occurrence statistics from corpora of child-
assume that lexical knowledge (i.e., vocabulary) is stored directed speech (CHILDES; MacWhinney, 2014; Word-
in networks or vector spaces (Collins & Loftus, 1975; Firth, bank; Frank et al., 2017). The networks followed these
1957; Harris, 1954; Landauer & Dumais, 1997; Mikolov principles: A new word is more likely to be learned if it is
et al., 2013; Pennington et al., 2014; McClelland & Rogers, semantically related to other words. For example, the
2003; Stella et al., 2017; Steyvers & Tenenbaum, 2005; Tur- word “rain” is learned faster if the word “cloud” is already
ney & Pantel, 2010). In such structures, words are repre- known or appears together with “rain” in the child’s envi-
sented by nodes, which are connected by links (or vectors) ronment. Furthermore, a new word is more likely to be
that represent specific relations between the words. An learned if it is semantically related to words that are con-
underlying assumption is that the processing and meaning nected to many other words (i.e., high connectivity and
representation of any given word is specified by the words similar contexts). For example, the word “rain” will be
it relates to. learned more quickly if it is semantically related to a hub
Links between words in these models can be semantic including “cloud,” “thunder,” “mountain,” “lightning,” and
relations based on features (e.g., cucumber—is edible; “water”.
cucumber—is a vegetable, McRae et al., 2005), free asso- Seminal network growth studies by Hills et al. (2009)
ciation norms (e.g., dog—bone; Nelson et al., 1998), or and Fourtassi et al. (2020) found that young children’s lexi-
co-occurrence statistics (e.g., cloud—rain; Fourtassi cal structure in 10 languages represented associative rather
et al., 2020; Fourtassi, 2020). Because words often co- than feature-based relations. Since association norms have
occur within specific thematic contexts (e.g., clouds and been explained by co- occurrence statistics (Four-
rain during a walk), semantic relations based on co- tassi, 2020; Lund et al., 1996; Spence & Owens, 1990;
occurrence are also referred to as thematic relations (Mir- Unger et al., 2023), this points to the importance of collo-
man et al., 2017). Particularly in distributional semantic cation frequency in early vocabulary development. In
models (mainly vector-based), co-occurrence measures addition, the growth of the networks has been explained
are used as the basic statistical information to specify the by the number of co-occurring words with the target
models (Flores et al., 2020; Fourtassi, 2020; Pennington nouns (i.e., connectivity) in the language environment
et al., 2014; Unger et al., 2023). The underlying assump- (i.e., preferential acquisition). Thus, in these models words
tion is that words with similar meanings or roles tend to that were linked to many co-occurring words in child-
appear in similar linguistic contexts. In this study, we use directed speech were acquired earlier than words that were
the term collocation frequency to highlight that sensitiv- linked to few co-occurring words. Studies with adults
ity to co-occurrence measures is connected to the fre- show similar growth effects that highlight the importance
quency with which two words co-occur (e.g., within 5 of semantic relatedness and connectivity for lexical growth
words, within a sentence; Evert, 2008; Unger et al., 2023). (Steyvers & Tenenbaum, 2005). Based on this evidence for
In this study, we hypothesize that sensitivity to colloca- lexical growth in early childhood and adulthood, we
tion frequency in the input language links vocabulary size assume that for middle childhood, computational models
and reading comprehension. would show that collocation frequency in the language
FIGURE 1
Schematic Description of Stimuli Selection in Thematic Judgment Task. Note. Co-occurrence measure = t-score
(Evert, 2008); The higher the t-score, the shorter the path length.
controlled for by parent AoA ratings in the original design. Stimuli presentation within blocks was random.
study.The experiment followed a mixed stimulus-within- Each block was followed by a short break. Prior to analy-
condition design. Each child responded to all stimuli in ses, we conducted power analyses (Westfall et al., 2014) on
each condition (45 stimuli; fully crossed). Each target sen- stimuli-within-condition and fully crossed designs with an
tence was presented three times (once for each condition) estimated 100 participants and 45 stimuli (d = .05) and
to each child (see Figure S1). Items were nested within tar- found power estimates greater than 0.8.
get sentences and conditions (stimuli-within-condition).
Participants were instructed as follows: “Soon you will Standardized Assessments
hear a sentence followed by two words. Please judge which
of the two words fits the sentence better. If you think the Vocabulary Size
first word fits the sentence better, press 1. If you think the Participants were asked to produce 40 nouns, verbs, or
second word fits the sentence better, press 2.” The children adjectives based on pictures on a computer screen (Worts-
indicated their judgments by pressing the appropriate key chatz-und Wortfindungstest [WWT]; Glück & Glück,
on a laptop. In total, there were 90 pairs (45 counterbal- 2011). The raw score was the sum of all correct responses.
anced pairs) organized into six lists of 15 trials, each con- The test provided a list of alternative correct responses for
taining each target sentence once, of which three lists (= each item, derived from the normative sample, to increase
blocks) were presented to each child in a Latin-square the objectivity of assessment. Criteria for item selection
Decoding Covariates
Participants were asked to correctly read pseudowords Phonological Working Memory
(1 min) from a list of 156 items (Salzburger Lese-und Participants completed a digit span task (Arbe-
Rechtschreibtest [SLRT-II]; Moll & Landerl, 2010). The itsgedächtnistestbatterie [AGTB]; von Hasselhorn
raw score was the number of correctly read pseudowords. et al., 2012) consisting of eight adaptive test trials: they
Parallel test reliability was reported as r = .90 for third would advance to the next level after correctly answering
grade. Standardized scores were reported for the middle of two consecutive items with the same digit span. The raw
third grade. score was calculated as the mean of the last two correctly
TABLE 1
Descriptive Statistics of Experimental and Standardized Assessments
Variable M SD Range Max
Thematic judgment accuracy (%) 81.32 9.06 56–98 43a
Note. N = 147; for standardized assessments the first row depicts raw scores, the second row depicts standardized scores (T).
a
Two items were excluded from the analysis.
b
Speeded tests.
Condition effects
Random effects
σ2 3.290 0.146
Marginal R /Conditional R
2 2
0.029/0.195 0.045/0.304
Random effects
σ2 3.290 0.146
Note. Accuracy models: 6321 observations; Latency models: 4673 observations; all models: 147 participants, 29 classes, 43 items (trials) and 15
reference words. Bold values represent significant effects.
FIGURE 3
Effects of Vocabulary Size and Reading Abilities on Thematic Judgment Responses. Note. Plot based on raw data;
arrows represent standard errors.
had a lower but acceptable fit (>.6). Both models for this Reading Comprehension on Text Level
score are shown in Figure S5. The mediation model of The mediation model for text-level reading comprehen-
the composite reading score had an acceptable fit sion had a good fit (CFI = .99; RMSEA = .06 [.00, .15];
(CFI = .96; RMSEA = .09 [.04, .14]; SRMR = .07; SRMR = .04; 𝝌2 = 6.43 (4), p =. 17; see Figure 4a). Vocabu-
𝝌2 = 25.72 (12), p =. 01). Vocabulary size did not signifi- lary size did not significantly predict reading comprehen-
cantly predict reading comprehension, either before or sion before, but after thematic judgment responses were
after accounting for thematic judgment responses (c: considered (c: β = .13, SE = .10, z = 1.28, p > .05; c’: β = .30,
β = −.02, SE = .09, z = −.21, p > .05; c’: β = 12, SE = .07, SE = .08, z = 3.76, p < .001). Vocabulary size predicted the-
z = 1.731, p = .08). However, vocabulary size predicted matic judgment (a: β = .48, SE = .07, z = 6.79, p < .001) and
thematic judgment (a: β = .47, SE = .07, z = 6.78, p < .001), thematic judgment in turn predicted reading comprehen-
and thematic judgment in turn predicted reading com- sion (b: β = .35, SE = .14, z = 2.41, p < .05). The mediation
prehension (b: β = .30, SE = .13, z = 2.27, p < .05), leading effect was significant, indicating that thematic judgment
to a significant mediation effect (a*b: β = .14, SE = .06, contributed to the significant relation between vocabulary
z = 2.20, p < .05). size and reading comprehension (a*b: β = .16, SE = .07,
The second model included decoding. The model also z = 2.32, p < .05).
had an acceptable fit (CFI = .97; RMSEA = .08 [.04, .12]; The model including decoding had an acceptable fit
SRMR = .10; 𝝌2 = 34.11 (17), p =. 01). However, including (CFI = .97; RMSEA = .09 [.02, .15]; SRMR = .07; 𝝌2 = 14.52
decoding changed the pattern of results. Vocabulary size (7), p = .04). However, the effect of thematic judgment on
now significantly predicted reading comprehension both reading comprehension and the mediation effect were not
before and after considering thematic judgments (c: β = 18, significant (b: β = .13, SE = .12, z = 1.11, p > .05; a*b: β = .06,
SE = .05, z = 3.29, p < .01; c’: β = 19, SE = .04, z = 4.5, p < .001) SE = .05, z = 1.10, p > .05; see Figure 4b). Vocabulary size
and still predicted thematic judgment (a: β = .47, SE = .07, predicted thematic judgment (a: β = .48, SE = .07, z = 6.77,
z = 6.76, p < .001). The mediation effect was not significant p < .001) and reading comprehension (c: β = 28, SE = .08,
(a*b: β = .01, SE = .03, z = .26, p > .05) as thematic judgment z = 3.34, p < .01; c’: β = .34, SE = .06, z = 5.31, p < .001). In
no longer predicted reading comprehension (a: β = .02, addition, decoding strongly predicted reading compre-
SE = .07, z = .26, p > .05). Expectedly, decoding strongly hension (β = .55, SE = .06, z = 8.56, p < .001).
predicted reading comprehension (β = .73, SE = .05, Despite a moderate correlation with text-level reading
z = 13.51, p < .001). comprehension (r = .31), thematic judgment no longer
Acknowledgments https://doi.org/10.1371/journal.pone.0019348
Braginsky, M., Yurovsky, D., Marchman, V. A., & Frank, M. C. (2019).
We thank our student research assistants for their support Consistency and variability in children’s word learning across lan-
guages. Open Mind, 3, 52–67. https://doi.org/10.1162/opmi_a_00026
in data collection. Furthermore, we thank Marcus Hassel- Buchanan, L., Westbury, C., & Burgess, C. (2001). Characterizing
horn, Garvin Brod, Leonard Tetzlaff, and all collaborators semantic space: Neighborhood effects in word recognition. Psycho-
for their role in supporting the iLearn project. Open nomic Bulletin & Review, 8, 531–544. https://doi.org/10.3758/BF031
Access funding enabled and organized by Projekt DEAL. 96189
Caravolas, M., Lervåg, A., Mikulajová, M., Defior, S., Seidlová-Málková,
G., & Hulme, C. (2019). A cross-linguistic, longitudinal study of the
foundations of decoding and reading comprehension ability. Scien-
Funding Information tific Studies of Reading, 23(5), 386–402. https://doi.org/10.1080/
10888438.2019.1580284
This research was funded by the German Federal Ministry Carroll, J. B., Davies, P., & Richman, B. (1971). Word frequency book.
of Education and Research, as part of the LONDI research American Heritage.
program and its subproject iLearn (grant number: Castles, A., Rastle, K., & Nation, K. (2018). Ending the Reading wars:
Reading acquisition from novice to expert. Psychological Science in
01GJ1710B). the Public Interest, 19(1), 5–51. https://doi.org/10.1177/1529100618
772271
Chiu, Y. D. (2018). The simple view of Reading across development:
Ethics Approval and Conflict of Prediction of grade 3 Reading comprehension from prekindergarten
skills. Remedial and Special Education, 39(5), 289–303. https://doi.
Interest Disclosure org/10.1177/0741932518762055
Cohen, J., Cohen, P., West, S. G., & Aiken, L. S. (2013). Applied multiple
This study was approved by the Institutional Review Board regression/correlation analysis for the behavioral sciences. Routledge.
(Ethics Committee) of the DIPF | Leibniz Institute for Collins, A. M., & Loftus, E. F. (1975). A spreading-activation theory of
Research and Information in Education (reference num- semantic processing. Psychological Review, 82(6), 407–428. https://
doi.org/10.1037/0033-295X.82.6.407
ber: iLearn) and the Ethics Committee of the Hessian Dale, P. S., & Fenson, L. (1996). Lexical development norms for young
Ministry for Culture, Education and Opportunities (refer- children. Behavior Research Methods, Instruments, & Computers, 28,
ence number: GWU 695). Parents or legal guardians gave 125–127. https://doi.org/10.3758/BF03203646
written consent for their children to participate prior to Dornseiff, F. (2020). Der deutsche Wortschatz nach Sachgruppen. De
data collection. Children could also agree to participate in Gruyter. https://doi.org/10.1515/9783110457742
Duke, N. K., & Cartwright, K. B. (2021). The science of reading
the study with a simplified version of the consent form and progresses: Communicating advances beyond the simple view of
were informed before data collection about their right to reading. Reading Research Quarterly, 56, S25–S44. https://doi.org/10.
refuse or stop participation at any time without negative 1002/rrq.411
consequences. In addition, teachers gave written consent Duke, N. K., Ward, A. E., & Pearson, P. D. (2021). The science of
for their own participation in the study prior to data col- reading comprehension instruction. The Reading Teacher, 74(6),
663–672. https://doi.org/10.1002/trtr.1993
lection. The consent forms included permission to use the Elleman, A. M. (2017). Examining the impact of inference instruction
collected data in scientific research projects, including on the literal and inferential comprehension of skilled and less
publications, in an anonymized form. The authors have no skilled readers: A meta-analytic review. Journal of Educational Psy-
conflicts of interest to declare. chology, 109(6), 761–781. https://doi.org/10.1037/edu0000180