视听协同发展在婴儿期语言习得中的作用: 典型发育与高风险自闭症谱系障碍婴儿的比较分析

doi:10.3724/SP.J.1042.2025.1794

摘要/Abstract

摘要：

多模态感知机制对婴儿期语言习得至关重要, 其中视听整合在典型发育婴儿的语言能力发展中起着关键作用。相比之下, 高风险自闭症谱系障碍(ASD)婴儿在此整合过程中常面临挑战。典型的言语感知发展轨迹强调面部特征加工的重要性——对眼部与嘴部区域的注意能促进语言学习。实证研究表明, 4.5月龄的婴儿已具备视听整合能力, 这种能力可有效预测后期语言发展水平。而高风险ASD婴儿则表现出社会性注意减少和视听整合功能受损, 此类缺陷可能破坏常规语言习得路径。因此, 早期干预策略应优先采用基于生物学的感觉引导方法, 重点增强多感官整合能力, 而非仅针对注意行为进行训练。理解这些机制不仅能深化对典型语言发展的认知, 更能为制定早期干预方案提供实证基础, 从而支持高风险ASD群体的语言习得。

关键词: 语音感知, 视听匹配, 多感官整合, 高风险自闭症谱系障碍婴儿, 语言发展, 早期干预

Abstract:

Language development in infancy is fundamentally shaped by the dynamic integration of auditory and visual (AV) cues. This review examines the role of AV synergy in early language acquisition by contrasting developmental trajectories in typically developing infants and those at elevated risk for autism spectrum disorder (ASD).

In typically developing infants, AV integration progresses through stage-specific mechanisms. During the first three months postnatally, infants prioritize eye gaze to establish social engagement. At this stage, infants rely on temporally synchronized cues, such as speech paired with facial expressions. Early cross-modal learning is mediated by primary audiovisual cortical responses, initially confined to narrow temporal windows. Between 3-6 months, attention shifts toward the mouth region, driven by exaggerated articulatory movements and prosodic features typical of infant-directed speech (IDS). During this phase, infants also begin to show sensitivity to conflicting AV inputs, demonstrated by the McGurk effect. In such cases, infants integrate mismatched visual /ga/ and auditory /ba/ into a fused “da” percept. Adaptive mechanisms emerge during this period, with infants increasing mouth fixation to compensate for auditory ambiguity in noisy or unfamiliar linguistic contexts. Between 6-9 months, mouth-focused attention becomes dominant, facilitating precise phoneme-lip mapping. Bilingual infants exhibit adaptive plasticity, extending mouth fixation durations to manage dual-language inputs. From 9-12 months, socio-cognitive maturation supports dynamic rebalancing of attention. Infants maintain mouth fixation during lexical acquisition to enhance phoneme-semantic associations, while simultaneously reinstating eye contact to facilitate joint attention and intentional communication. Across all stages, IDS optimizes language learning through enhanced AV synchrony, such as slowed speech rates and amplified mouth movements, serving as scaffolding for developmental milestones.

Infants at high risk for ASD demonstrate systematic deviations in AV integration emerging early in life. A prominent feature is a progressive decline in social attention, particularly eye gaze, apparent as early as two months of age. Unlike typically developing infants, who maintain eye contact to foster social reciprocity, high-risk infants gradually reduce fixation on the eyes. This diminished attention disrupts foundational processes of joint attention, thereby limiting caregiver-infant interactions and linguistic input. Neural studies link these behavioral differences to reduced cortical activation in temporal regions during dynamic face processing, suggesting impaired encoding of social stimuli. Concurrently, high-risk infants display delayed attention to the mouth region, with significant increases in mouth fixation occurring around 18 months, considerably later than the typical 6- to 9-month period. This delay negatively impacts phoneme-lip mapping accuracy, leading to weaker phoneme discrimination. For instance, high-risk infants struggle to leverage visual speech cues in noisy environments, reflecting impaired AV integration. Neurophysiological evidence further highlights impaired AV synchrony detection, including increased tolerance to asynchronous AV stimuli and the absence of McGurk responses by nine months. These behavioral deficits are underpinned by neural atypicality, evidenced by attenuated event-related potentials (ERPs), such as diminished N290 responses to dynamic faces. Such neural signatures predict later social and linguistic impairments. Additionally, sex differences reveal divergent compensatory strategies: female high-risk infants partially mitigate language delays by increasing mouth fixation, whereas male infants exhibit persistent deficits in social attention and AV integration. Collectively, these findings highlight AV integration anomalies as early transdiagnostic markers detectable months before overt behavioral ASD symptoms, such as language delays or social withdrawal, emerge.

Intervention strategies aligned with developmental stages have demonstrated efficacy. Early interventions (0-6 months) leverage biofeedback to reinforce eye contact and enrich IDS-driven multimodal input. Mid-phase interventions (6-12 months) employ virtual reality training to enhance visual reliance in challenging auditory environments, alongside wearable eye-trackers to align gaze with auditory labeling. After 12 months, interventions incorporate emotional prosody and facial expressions to support socio-linguistic fluency. Preliminary studies indicate that multisensory integration training significantly improves language outcomes in high-risk infants, surpassing attention-focused approaches.

Critical challenges remain, including clarifying how prosodic cues influence phoneme discrimination, understanding neural mechanisms underlying consonant learning, and translating AV biomarkers into practical clinical tools. Future research should combine naturalistic observation with advanced neuroimaging techniques to develop multimodal risk assessment systems. Addressing these gaps will facilitate early, personalized interventions, leveraging neuroplasticity to reduce developmental impairments.

Key words: speech perception, audiovisual matching, multisensory integration, high-risk autism spectrum disorder (ASD) infants, language development, early intervention

中图分类号:

B844

靳孟可, 严璘璘, 刘少英, 肖乃奇. (2025). 视听协同发展在婴儿期语言习得中的作用: 典型发育与高风险自闭症谱系障碍婴儿的比较分析. 心理科学进展 , 33(10), 1794-1804.

JIN Mengke, YAN Linlin, LIU Shaoying, XIAO Naiqi. (2025). Audiovisual integration in infant language acquisition: Different patterns in typically developing infants and those at elevated risk for autism spectrum disorder. Advances in Psychological Science, 33(10), 1794-1804.

图/表 1

参考文献 94

[1]	陈钰, 张丹丹. (2020). 新生儿语音感知的脑机制. 心理科学, 43(4), 844-849. https://doi.org/10.16719/j.cnki.1671-6981.20200411
[2]	郭桐阳, 莫李澄, 张丹丹. (2024). 婴儿面孔和注视方向加工的认知神经机制. 心理科学进展, 32 (10), 1670-1679. doi: 10.3724/SP.J.1042.2024.01670
[3]	李思瑾, 王庭栋, 彭芝琳, 张丹丹. (2023). 新生儿对语音的感知、辨别和学习. 心理科学进展, 31(12), 2295-2305. https://doi.org/10.3724/sp.J.1042.2023.02295 doi: 10.3724/SP.J.1042.2023.02295 URL
[4]	Alviar C., Sahoo M., Edwards L. A., Jones W., Klin A., & Lense M. (2023). Infant-directed song potentiates infants’ selective attention to adults’ mouths over the first year of life. Developmental Science, 26(5), e13359. https://doi.org/10.1111/desc.13359
[5]	Bastianello T., Keren-Portnoy T., Majorano M., & Vihman M. (2022). Infant looking preferences towards dynamic faces: A systematic review. Infant Behavior and Development, 67, 101709. https://doi.org/10.1016/j.infbeh.2022.101709
[6]	Belteki Z., van den Boomen C., & Junge C. (2022). Face-to- face contact during infancy: How the development of gaze to faces feeds into infants’ vocabulary outcomes. Frontiers in Psychology, 13, 997186. https://doi.org/10.3389/fpsyg.2022.997186
[7]	Bergelson E., & Swingley D. (2012). At 6-9 months, human infants know the meanings of many common nouns. Proceedings of the National Academy of Sciences, 109(9), 3253-3258. https://doi.org/10.1073/pnas.1113380109
[8]	Birulés J., Bosch L., Brieke R., Pons F., & Lewkowicz D. J. (2018). Inside bilingualism: Language background modulates selective attention to a talker's mouth. Developmental Science, 22(3), e12755. https://doi.org/10.1111/desc.12755
[9]	Birulés J., Martinez-Alvarez A., Lewkowicz D. J., de Diego-Balaguer R., & Pons F. (2022). Violation of non- adjacent rule dependencies elicits greater attention to a talker's mouth in 15-month-old infants. Infancy, 27(5), 963-971. https://doi.org/10.1111/infa.12489 doi: 10.1111/infa.12489 URL pmid: 35833310
[10]	Bortfeld H., Morgan J. L., Golinkoff R. M., & Rathbun K. (2005). Mommy and me: Familiar names help launch babies into speech-stream segmentation. Psychological Science, 16(4), 298-304. https://doi.org/10.1111/j.0956-7976.2005.01531.x URL pmid: 15828977
[11]	Brooks R., & Meltzoff A. N. (2015). Connecting the dots from infancy to childhood: A longitudinal study connecting gaze following, language, and explicit theory of mind. Journal of Experimental Child Psychology, 130, 67-78. https://doi.org/10.1016/j.jecp.2014.09.010 doi: 10.1016/j.jecp.2014.09.010 URL pmid: 25462032
[12]	Bruce M., Panneton R., & Taylor C. (2022). Multisensory integration and maternal sensitivity are related to each other and predictive of expressive vocabulary in 24-month- olds. Journal of Experimental Child Psychology, 214, 105304. https://doi.org/10.1016/j.jecp.2021.105304
[13]	Buiatti M., Di Giorgio E., Piazza M., Polloni C., Menna G., Taddei F., … Vallortigara G. (2019). Cortical route for facelike pattern processing in human newborns. Proceedings of the National Academy of Sciences, 116(10), 4625-4630. https://doi.org/10.1073/pnas.1812419116
[14]	Burnham D., & Dodd B. (2004). Auditory-visual speech integration by prelinguistic infants: Perception of an emergent consonant in the McGurk effect. Developmental Psychobiology, 45(4), 204-220. https://doi.org/10.1002/dev.20032 URL pmid: 15549685
[15]	Campos R. (2018). If you want to get ahead, get a good master. Annette Karmiloff-Smith: The developmental perspective. Infancia y Aprendizaje: Journal for the Study of Education and Development, 41(1), 90-137. https://doi.org/10.1080/02103702.2017.1401318
[16]	Çetinçelik M., Rowland C. F., & Snijders T. M. (2021). Do the eyes have it? A systematic review on the role of eye gaze in infant language development. Frontiers in Psychology, 11, 589096. https://doi.org/10.3389/fpsyg.2020.589096
[17]	Chawarska K., Lewkowicz D., Feiner H., Macari S., & Vernetti A. (2022). Attention to audiovisual speech does not facilitate language acquisition in infants with familial history of autism. Journal of Child Psychology and Psychiatry, 63(12), 1466-1476. http://10.1111/jcpp.13595
[18]	Chawarska K., Macari S., & Shic F. (2013). Decreased spontaneous attention to social scenes in 6-month-old infants later diagnosed with autism spectrum disorders. Biological Psychiatry, 74(3), 195-203. https://doi.org/10.1016/j.biopsych.2012.11.022 doi: 10.1016/j.biopsych.2012.11.022 URL pmid: 23313640
[19]	Coffey J. R., & Snedeker J. (2025). How strong is the relationship between caregiver speech and language development? A meta-analysis. Journal of Child Language. Advance online publication. https://doi.org/10.1017/S0305000924000692
[20]	Curtin S., & Zamuner T. S. (2014). Understanding the developing sound system: Interactions between sounds and words. WIREs Cognitive Science, 5(5), 589-602. https://doi.org/10.1002/wcs.1307
[21]	Demopoulos C., & Lewine J. D. (2016). Audiometric profiles in autism spectrum disorders: Does subclinical hearing loss impact communication? Autism Research, 9(1), 107-120. https://doi.org/10.1002/aur.1495 doi: 10.1002/aur.1495 URL pmid: 25962745
[22]	Edgar E. V., Todd J. T., & Bahrick L. E. (2022). Intersensory matching of faces and voices in infancy predicts language outcomes in young children. Developmental Psychology, 58(8), 1413-1428. https://doi.org/10.1037/dev0001375 doi: 10.1037/dev0001375 URL pmid: 35446074
[23]	Edgar E. V., Todd J. T., & Bahrick L. E. (2023). Intersensory processing of faces and voices at 6 months predicts language outcomes at 18, 24, and 36 months of age. Infancy, 28(3), 569-596. https://doi.org/10.1111/infa.12533 doi: 10.1111/infa.12533 URL pmid: 36760157
[24]	Elsabbagh M., Gliga T., Pickles A., Hudry K., Charman T., Johnson M. H., & BASIS Team. (2013). The development of face orienting mechanisms in infants at-risk for autism. Behavioural Brain Research, 251, 147-154. https://doi.org/10.1016/j.bbr.2012.07.030 doi: 10.1016/j.bbr.2012.07.030 URL pmid: 22846849
[25]	Farroni T., Csibra G., Simion F., & Johnson M. H. (2002). Eye contact detection in humans from birth. Proceedings of the National Academy of Sciences, 99(14), 9602-9605. https://doi.org/10.1073/pnas.152159999
[26]	Germani T., Zwaigenbaum L., Bryson S., Brian J., Smith I., Roberts W., … Vaillancourt T. (2014). Brief report: Assessment of early sensory processing in infants at high-risk of autism spectrum disorder . Journal of Autism and Developmental Disorders, 44(12), 3264-3270. https://doi.org/10.1007/s10803-014-2175-x
[27]	Gervain J., Macagno F., Cogoi S., Peña M., & Mehler J. (2008). The neonate brain detects speech structure. Proceedings of the National Academy of Sciences, 105(37), 14222-14227. https://doi.org/10.1073/pnas.0806530105
[28]	Gotham K., Pickles A., & Lord C. (2009). Standardizing ADOS scores for a measure of severity in autism spectrum disorders. Journal of Autism and Developmental Disorders, 39(5), 693-705. https://doi.org/10.1007/s10803-008-0674-3 doi: 10.1007/s10803-008-0674-3 URL pmid: 19082876
[29]	Guiraud J. A., Tomalski P., Kushnerenko E., Ribeiro H., Davies K., Charman T., & BASIS Team. (2012). Atypical audiovisual speech integration in infants at risk for autism. PLOS One, 7(5), Article e36428. https://doi.org/10.1371/journal.pone.0036428
[30]	Hazlett H. C., Gu H., Munsell B. C., Kim S. H., Styner M., Wolff J. J., … IBIS Network..(2017). Early brain development in infants at high risk for autism spectrum disorder. Nature, 542(7641), 348-351. https://doi.org/10.1038/nature21369
[31]	Hillairet de Boisferon A., Tift A. H., Minar N. J., & Lewkowicz D. J.(2016). Selective attention to a talker's mouth in infancy: Role of audiovisual temporal synchrony and linguistic experience. Developmental Science, 20(3), e12381. https://doi.org/10.1111/desc.12381
[32]	Hudry K., Chandler S., Bedford R., Pasco G., Gliga T., Elsabbagh M., … Charman T. (2014). Early language profiles in infants at high-risk for autism spectrum disorders. Journal of Autism and Developmental Disorders, 44(1), 154-167. https://doi.org/10.1007/s10803-013-1861-4 doi: 10.1007/s10803-013-1861-4 URL pmid: 23748385
[33]	Hurley K. B., & Oakes L. M. (2015). Experience and distribution of attention: Pet exposure and infants’ scanning of animal images. Journal of Cognition and Development, 16(1), 11-30. https://doi.org/10.1080/15248372.2013.833922
[34]	Hyvärinen L., Walthes R., Jacob N., Chaplin K. N., & Leonhardt M. (2014). Current understanding of what infants see. Current Ophthalmology Reports, 2(4), 142-149. https://doi.org/10.1007/s40135-014-0056-2 doi: 10.1007/s40135-014-0056-2 URL pmid: 25478306
[35]	Imafuku M., Kanakogi Y., Butler D., & Myowa M. (2019a). Demystifying infant vocal imitation: The roles of mouth looking and speaker’s gaze. Developmental Science, 22(6), e12825. https://doi.org/10.1111/desc.12825
[36]	Imafuku M., Kawai M., Niwa F., Shinya Y., & Myowa M. (2019b). Audiovisual speech perception and language acquisition in preterm infants: A longitudinal study. Early Human Development, 128, 93-100. https://10.1016/j.earlhumdev.2018.11.001
[37]	Ishikawa M., Senju A., Kato M., & Itakura S. (2022). Physiological arousal explains infant gaze following in various social contexts. Royal Society Open Science, 9(8), 220592. https://doi.org/10.1098/rsos.220592
[38]	Johnson M. H., Jones E. J., & Gliga T. (2015). Brain adaptation and alternative developmental trajectories. Developmental Psychopathology, 27(2), 425-442. https://doi.org/10.1017/S0954579415000073
[39]	Jones W., & Klin A. (2013). Attention to eyes is present but in decline in 2-6-month-old infants later diagnosed with autism. Nature, 504(7480), 427-431. https://doi.org/10.1038/nature12715
[40]	Kidd C., Piantadosi S. T., & Aslin R. N. (2014). The Goldilocks effect in infant auditory attention. Child Development, 85(5), 1795-1804. https://doi.org/10.1111/cdev.12263 doi: 10.1111/cdev.12263 URL pmid: 24990627
[41]	Kleberg J. L., Nyström P., Bölte S., & Falck-Ytter T. (2019). Sex differences in social attention in infants at risk for autism. Journal of Autism and Developmental Disorders, 49(4), 1342-1351. https://doi.org/10.1007/s10803-018-3799-z doi: 10.1007/s10803-018-3799-z URL pmid: 30467821
[42]	Kolesnik A., Begum Ali J., Gliga T., Guiraud J., Charman T., Johnson M. H., … BASIS Team.(2019). Increased cortical reactivity to repeated tones at 8 months in infants with later ASD. Translational Psychiatry, 9(1), 46. https://doi.org/10.1038/s41398-019-0393-x
[43]	Kosie J. E., & Lew-Williams C. (2024). Infant-directed communication: Examining the many dimensions of everyday caregiver-infant interactions. Developmental Science, 27(5), e13515. https://doi.org/10.1111/desc.13515
[44]	Kuhl P. K. (2010). Brain mechanisms in early language acquisition. Neuron, 67(5), 713-727. https://doi.org/10.1016/j.neuron.2010.08.038 doi: 10.1016/j.neuron.2010.08.038 URL pmid: 20826304
[45]	Kuhl P. K., Stevens E., Hayashi A., Deguchi T., Kiritani S., & Inverson P. (2006). Infants show a facilitation effect for native language phonetic perception between 6 and 12 months. Developmental Science, 9(2), 13-21. https://doi.org/10.1111/j.1467-7687.2006.00468.x URL pmid: 16472309
[46]	Kuhl P. K., Tsao F., & Liu H. (2003). Foreign-language experience in infancy: Effects of short-term exposure and social interaction on phonetic learning. Proceedings of the National Academy of Sciences, 100(15), 9096-9101. https://doi.org/10.1073/pnas.1532872100
[47]	Kuhl P. K., Williams K. A., Lacerda F., Stevens K. N., & Lindblom B. (1992). Linguistic experience alters phonetic perception in infants by 6 months of age. Science, 255(5044), 606-608. https://doi.org/10.1126/science.1736364 doi: 10.1126/science.1736364 URL pmid: 1736364
[48]	Kujala T., Partanen E., Virtala P., & Winkler I. (2023). Prerequisites of language acquisition in the newborn brain. Trends in Neuroscience, 46(9), 726-737. https://doi.org/10.1016/j.tins.2023.05.011
[49]	Kwon M.-K., Setoodehnia M., Baek J., Luck S. J., & Oakes L. M. (2016). The development of visual search in infancy: Attention to faces versus salience. Developmental Psychology, 52(4), 537-555. https://doi.org/10.1037/dev0000080
[50]	Lalonde K., & Werner L. A. (2019). Infants and adults use visual cues to improve detection and discrimination of speech in noise. Journal of Speech, Language, and Hearing Research, 62(10), 3860-3875. https://doi.org/10.1044/2019_jslhr-h-19-0106
[51]	Leng L. L., Zhu Y. W., & Zhou L. G. (2024). Explaining differences in autism detection timing: Age of diagnosis and associated individual and socio-familial factors in Chinese children. Autism, 28(4), 896-907. https://doi.org/10.1177/13623613231187184
[52]	Lewkowicz D. J. (1996). Perception of auditory-visual temporal synchrony in human infants. Journal of Experimental Psychology: Human Perception and Performance, 22(5), 1094-1106. https://doi.org/10.1037//0096-1523.22.5.1094
[53]	Lewkowicz D. J. (2010). Infant perception of audio-visual speech synchrony. Developmental Psychology, 46(1), 66-77. https://doi.org/10.1037/a0015579 doi: 10.1037/a0015579 URL pmid: 20053007
[54]	Lewkowicz D. J., & Hansen-Tift A. M. (2012). Infants deploy selective attention to the mouth of a talking face when learning speech. Proceedings of the National Academy of Sciences, 109(5), 1431-1436. https://doi.org/10.1073/pnas.1114783109
[55]	Lombardo M. V., Eyler L., Moore A., Datko M., Carter Barnes C., Cha D., … Pierce K. (2019). Default mode-visual network hypoconnectivity in an autism subtype with pronounced social visual engagement difficulties. eLife, 8, e47427. https://doi.org/10.7554/eLife.47427
[56]	Long H. L., Ramsay G., Bene E. R., Su P. L., Yoo H., Klaiman C., … Oller D. K. (2024). Canonical babbling trajectories across the first year of life in autism and typical development. Autism, 28(12), 3078-3091. https://doi.org/10.1177/13623613241253908
[57]	Lozano I., Campos R., & Belinchón M. (2025). Sensitivity to temporal synchrony in audiovisual speech and language development in infants with an elevated likelihood of autism: A developmental review. Infant Behavior & Development, 78, 102026. https://doi.org/10.1016/j.infbeh.2024.102026
[58]	Lozano I., Viktorsson C., Capelli E., & Gliga T. (2024, July). Developmental pathways of audiovisual processing in infants at elevated likelihood of autism during the first year of life. Paper presented at the meeting of the International Congress of Infant Studies, Glasgow, Scotland.
[59]	Maurer D., & Werker J. F. (2013). Perceptual narrowing during infancy: A comparison of language and faces. Developmental Psychobiology, 56(2), 154-178. https://doi.org/10.1002/dev.21177
[60]	Mercure E., Bright P., Quiroz I., & Filippi R. (2022). Effect of infant bilingualism on audiovisual integration in a McGurk task. Journal of Experimental Child Psychology, 217, 105351. https://doi.org/10.1016/j.jecp.2021.105351
[61]	Minagawa Y., Hata M., Hakuno Y., Yamamoto E., & Abe K. (2019, May). Developmental trajectory of selective attention to a talking face in infants at-risk for ASD. Poster session presented at the meeting of the International Society for Autism Research, Montreal, QC.
[62]	Nair A., Jalal R., Liu J., Tsang T., McDonald N. M., Jackson L., … Dapretto M. (2021). Altered thalamocortical connectivity in 6-week-old infants at high familial risk for autism spectrum disorder. Cerebral Cortex, 31(9), 4191-4205. https://doi.org/10.1093/cercor/bhab078
[63]	Ozonoff S., Young G. S., Steinfeld M. B., Hill M. M., Cook I., Hutman T., … Sigman M. (2009). How early do parent concerns predict later autism diagnosis? Journal of Developmental and Behavioral Pediatrics, 30(5), 367-375. https://doi.org/10.1097/dbp.0b013e3181ba0fcf doi: 10.1097/dbp.0b013e3181ba0fcf URL pmid: 19827218
[64]	Pierce K., Conant D., Hazin R., Stoner R., & Desmond J. (2011). Preference for geometric patterns early in life as a risk factor for autism. Archives of General Psychiatry, 68(1), 101-109. https://doi.org/10.1001/archgenpsychiatry.2010.113 doi: 10.1001/archgenpsychiatry.2010.113 URL pmid: 20819977
[65]	Pons F., Bosch L., & Lewkowicz D. J. (2015). Bilingualism modulates infants’ selective attention to the mouth of a talking face. Psychological Science, 26(4), 490-498. https://doi.org/10.1177/0956797614568320
[66]	Pons F., Bosch L., & Lewkowicz D. J. (2019). Twelve-month-old infants’ attention to the eyes of a talking face is associated with communication and social skills. Infant Behavior and Development, 54, 80-84. https://doi.org/10.1016/j.infbeh.2018.12.003
[67]	Pons F., & Lewkowicz D. J. (2014). Infant perception of audio-visual speech synchrony in familiar and unfamiliar fluent speech. Acta Psychologica, 149, 142-147. https://doi.org/10.1016/j.actpsy.2013.12.013 doi: 10.1016/j.actpsy.2013.12.013 URL pmid: 24576508
[68]	Quinn P. C., Lee K., & Pascalis O. (2019). Face processing in infancy and beyond: The case of social categories. Annual Review of Psychology, 70, 165-189. https://doi.org/10.1146/annurev-psych-010418-102753 doi: 10.1146/annurev-psych-010418-102753 URL pmid: 30609912
[69]	Riva V., Riboldi E. M., Dondena C., Piazza C., Molteni M., & Cantiani C. (2022). Atypical ERP responses to audiovisual speech integration and sensory responsiveness in infants at risk for autism spectrum disorder. Infancy, 27(2), 369-388. https://doi.org/10.1111/infa.12456 doi: 10.1111/infa.12456 URL pmid: 35037381
[70]	Rosenblum L. D. (2008). Speech perception as a multimodal phenomenon. Current Directions in Psychological Science, 17(6), 405-409. https://doi.org/10.1111/j.1467-8721.2008.00615.x doi: 10.1111/j.1467-8721.2008.00615.x URL pmid: 23914077
[71]	Rothbart M. K., Sheese B. E., Rueda M. R., & Posner M. I. (2011). Developing mechanisms of self-regulation in early life. Emotion Review, 3(2), 207-213. https://doi.org/10.1177/1754073910387943 URL pmid: 21892360
[72]	Sandoval M., & Gómez R. L. (2016). Overriding the metrical bias with lexical information: English-learning 7.5-month-olds use Mommy to segment Iambic words. Language Learning and Development, 12(4), 398-412. https://doi.org/10.1080/15475441.2016.1162722
[73]	Santapuram P., Feldman J. I., Bowman S. M., Raj S., Suzman E., Crowley S., … Woynaroski T. G. (2022). Mechanisms by which early eye gaze to the mouth during multisensory speech influences expressive communication development in infant siblings of children with and without Autism. Mind, Brain, and Education, 16(1), 62-74. https://doi.org/10.1111/mbe.12310
[74]	Sheldrick R. C., Maye M. P., & Carter A. S. (2017). Age at first identification of autism spectrum disorder: An analysis of two US surveys. Journal of the American Academy of Child & Adolescent Psychiatry, 56(4), 313-320. https://doi.org/10.1016/j.jaac.2017.01.012
[75]	Suri K. N., Whedon M., & Lewis M. (2023). Perception of audio-visual synchrony in infants at elevated likelihood of developing autism spectrum disorder. European Journal of Pediatrics, 182(5), 2105-2117. https://doi.org/10.1007/s00431-023-04871-y
[76]	Talbott M. R., Young G. S., & Ozonoff S. (2025). Can combining existing behavioral tools improve identification of infants at elevated likelihood of autism in the first year of life? Autism, 29(2), 462-475. https://doi.org/10.1177/13623613241275455
[77]	Tan S. H. J., Kalashnikova M., Di Liberto G. M., Crosse M. J., & Burnham D. (2023). Seeing a talking face matters: Gaze behavior and the auditory-visual speech benefit in adults’ cortical tracking of infant-directed speech. Journal of Cognitive Neuroscience, 35(11), 1741-1759. https://doi.org/10.1162/jocn_a_02044
[78]	Tenenbaum E. J., Sobel D. M., Sheinkopf S. J., Shah R. J., Malle B. F., & Morgan J. L. (2014). Attention to the mouth and gaze following in infancy predict language development. Journal of Child Language, 42(6), 1173-1190. https://doi.org/10.1017/s0305000914000725
[79]	Tomalski P. (2015). Developmental trajectory of audiovisual speech integration in early infancy. A review of studies using the McGurk paradigm. Psychology of Language and Communication, 19(2), 77-100. https://doi.org/10.1515/plc-2015-0006
[80]	Tsang T., Atagi N., & Johnson S. P. (2018). Selective attention to the mouth is associated with expressive language skills in monolingual and bilingual infants. Journal of Experimental Child Psychology, 169, 93-109. https://doi.org/10.1016/j.jecp.2018.01.002 doi: S0022-0965(17)30306-5 URL pmid: 29406126
[81]	Ujiie Y., Kanazawa S., & Yamaguchi M. K. (2021). The other-race effect on the McGurk effect in infancy. Attention, Perception, & Psychophysics, 83(7), 2924-2936. https://doi.org/10.3758/s13414-021-02342-w
[82]	van Noordt S., Desjardins J. A., BASIS Team, & Elsabbagh M. (2022). Inter-trial theta phase consistency during face processing in infants is associated with later emerging autism. Autism Research, 15(5), 834-846. http://10.1002/aur.2701
[83]	Vouloumanos A., & Werker J. F. (2007). Listening to language at birth: Evidence for a bias for speech in neonates. Developmental Science, 10(2), 159-164. https://doi.org/10.1111/j.1467-7687.2007.00549.x URL pmid: 17286838
[84]	Wagner L., Banchik M., Tsang T., Okada N. J., Altshuler R., McDonald N., Bookheimer S. Y., Jeste S. S., Green S. A., & Dapretto M. (2025). Atypical early neural responses to native and non-native language in infants at high likelihood for developing autism. Molecular Autism, 16, 6. https://doi.org/10.1186/s13229-025-00640-w
[85]	Wang X., Bouton S., Kojovic N., Giraud A. L., & Schaer M. (2025). Atypical audio-visual neural synchrony and speech processing in early autism. Journal of Neurodevelopmental Disorders, 17(1), 9. https://doi.org/10.1186/s11689-025-09593-w
[86]	Weisleder A., & Fernald A. (2013). Talking to children matters: Early language experience strengthens processing and builds vocabulary. Psychological Science, 24(11), 2143-2152. https://doi.org/10.1177/0956797613488145 doi: 10.1177/0956797613488145 URL pmid: 24022649
[87]	Werker J. F., & Tees R. C. (1984). Cross-language speech perception: Evidence for perceptual reorganization during the first year of life. Infant Behavior and Development, 7(1), 49-63. https://doi.org/10.1016/S0163-6383(84)80022-3
[88]	Wu R., Tummeltshammer K. S., Gliga T., & Kirkham N. Z. (2014). Ostensive signals support learning from novel attention cues during infancy. Frontiers in Psychology, 5, 251. https://doi.org/10.3389/fpsyg.2014.00251
[89]	Wu Y. J., Hou X., Peng C., Yu W., Oppenheim G. M., Thierry G., & Zhang D. (2022). Rapid learning of a phonemic discrimination in the first hours of life. Nature Human Behavior, 6(8), 1169-1179. https://doi.org/10.1038/s41562-022-01355-1
[90]	Xiao N. G., Mukaida M., Quinn P. C., Pascalis O., Lee K., & Itakura S. (2018). Narrowing in face and speech perception in infancy: Developmental change in the relations between domains. Journal of Experimental Child Psychology, 176, 113-127. https://doi.org/10.1016/j.jecp.2018.06.007 doi: S0022-0965(17)30604-5 URL pmid: 30149243
[91]	Yan L., Liu X., Hu S., Liu S., & Xiao G. (2024, July). Infants demonstrated the McGurk effect with own-race but not other-race faces. Paper presented at the meeting of the International Congress of Infant Studies, Glasgow, Scotland.
[92]	Young G. S., Rogers S. J., Hutman T., Rozga A., Sigman M., & Ozonoff S. (2011). Imitation from 12 to 24 months in autism and typical development: A longitudinal Rasch analysis. Developmental Psychology, 47(6), 1565-1578. https://doi.org/10.1037/a0025418 doi: 10.1037/a0025418 URL pmid: 21910524
[93]	Zhang F., & Roeyers H. (2019). Exploring brain functions in autism spectrum disorder: A systematic review on functional near-infrared spectroscopy (fNIRS) studies. International Journal of Psychophysiology, 137, 41-53. doi: S0167-8760(18)30880-8 pmid: 30664885
[94]	Zhou X., Wang L., Hong X., & Wong P. C. M. (2023). Infant‐directed speech facilitates word learning through attentional mechanisms: An fNIRS study of toddlers. Developmental Science, 27(1), e13424. https://doi.org/10.1111/desc.13424