Startseite From neural matter to rapid symbolic learning in brains and artificial neural networks: a brief overview and perspective
Artikel Open Access

From neural matter to rapid symbolic learning in brains and artificial neural networks: a brief overview and perspective

  • Rosario Tomasello ORCID logo EMAIL logo
Veröffentlicht/Copyright: 21. Oktober 2025
Linguistics Vanguard
Aus der Zeitschrift Linguistics Vanguard

Abstract

Advances in artificial neural networks (ANNs) have revolutionized the way we work, learn, and acquire information, achieving human-level capabilities. Yet, ANNs differ fundamentally from the human brain in how symbolic knowledge is acquired, typically requiring extensive training to form stable internal representations. In contrast, the human brain exhibits exceptional ability to instantaneously map new words to their referents, a process known as “fast mapping”, considered a fundamental mechanism underlying symbol acquisition in early ontogeny. This review provides an overview of neurocognitive research on rapid symbolic learning and examines recent advances in computational modeling approaches aimed at replicating this capability. Models constrained by neurobiological principles known to exist in the human brain are discussed, providing a first step toward neural- and cortical-level explanations of rapid symbolic learning and opening new venues for identifying the neural mechanisms underpinning rapid word acquisition. Archiving these advances may be particularly relevant for guiding the development of sustainable, energy-efficient architectures. A major desideratum from a linguistic and pragmatic perspective involves investigating the neural basis of fast mapping across diverse communicative and pragmatic contexts, an area where current models still fall short.

1 Introduction

One of the main features that sets humans apart is the surprisingly rapid acquisition of a vast vocabulary of meaningful symbols in early ontogeny. By around 12 months, most infants begin to produce their first word, and towards 18 months, when vocabulary reaches a certain threshold, growth often accelerates, with children acquiring on average up to 10 words per day, while their receptive vocabulary is likely even larger (e.g., Bloom 2002; Reznick and Goldfield 1992). By adulthood, vocabulary size can reach approximately 42,000 words or more (Brysbaert et al. 2016; see also Anglin et al. 1993).[1] How can such spectacular growth of a lexicon be explained?

While a range of linguistic, cognitive, and socio-pragmatic factors are known to contribute to early word learning (Goldstein et al. 2003; Samuelson and McMurray 2017; Tomasello 2003), one mechanism that has received considerable empirical attention is fast mapping. This process refers to the ability to form associations between a novel word and its referent after only minimal exposures or sometimes even after a single encounter. Fast mapping was first formally described by Carey and Bartlett (1978), who showed that children, when presented with a blue tray and an olive tray and instructed “Bring me the chromium tray, not the blue one, the chromium one”, successfully inferred that the novel label chromium referred to the olive-colored tray (for a review, see Carey 2010). Subsequent research has examined the robustness of rapid mappings over time in both children and adults, showing successful retention of a single new word even after one month (Markson and Bloom 1997). However, when 24-month-old children were exposed to multiple new words (eight in total), they performed well in the referent selection task but showed poor retention in a delayed recall task after just 5 min (Horst and Samuelson 2008). These findings suggest that while fast mapping may enable initial mapping under minimal exposure, its effectiveness diminishes when cognitive load increases, such as when multiple mappings are learned at once. To account for long-term retention, researchers have proposed “extended mapping”, a gradual refinement of memory traces through repeated exposure and use, as a critical mechanism (Kucker et al. 2015; Swingley 2010; see also Bion et al. 2013). Although repeated exposure plays a role, a central question regards how initial mapping differs between fast mapping, where novel meanings are inferred through contrast with familiar referents, and explicit instruction, which involves direct labeling without such contrast.

Addressing this question, a recent study found that although learning 10 novel word–meaning mappings through fast mapping resulted in lower initial mapping accuracy than with explicit instruction, it led to more stable memory consolidation after 1.5 years (Chen et al. 2023). This suggests that fast mapping may facilitate early integration of novel words into existing semantic and conceptual networks, supporting deeper encoding and more robust consolidation when novel items are contrasted with familiar objects (Atir-Sharon et al. 2015; Coutanche and Thompson-Schill 2014; Smith et al. 2014; Zaiser et al. 2022a, 2022b). Further evidence supporting this view has shown that fast mapping enables quicker semantic integration when the novel item shares multiple features with a familiar concept (e.g., Zaiser et al. 2022b), suggesting that such sharedness facilitates access to existing conceptual networks. Additionally, by the time infants begin to understand language at about the age of one, they have already accumulated rich observational experience with objects, actions, and phonological forms through babbling and imitation, further supporting their initial single-word acquisition (Macnamara 1972; see also Werker and Hensch 2015). Consequently, early exposure, or the contrast of previously formed semantic memory traces during word–referent mapping, appears not only to facilitate familiarity with sounds and forms but also to enable infants to rapidly associate new words with relevant concepts.

This human ability stands in contrast to standard artificial neural networks (ANNs), which, despite recent advances, generally require extensive training involving thousands of learning events and vast datasets to achieve stable internal information representations (e.g., Devlin et al. 2019; Lake et al. 2017). Numerous learning methods and neural network variants exist, with the most successful approach being the classic supervised backpropagation learning rule (Rumelhart et al. 1986) and its various adaptations. Although this algorithm in neural networks has proven highly efficient in language processing, such as speech recognition, and word and semantic processing (Chen et al. 2017; Kriegeskorte 2019; LeCun et al. 2015; Rogers et al. 2004), it is typically slow due to weight adjustments propagating through multiple deep layers and is criticized for its biological implausibility (Mazzoni et al. 1991; Reilly 1999). Furthermore, the human brain operates under neuroanatomical constraints that shape learning in ways that are often overlooked in artificial models, yet these constraints have been considered essential for simulating higher cognitive functions, such as language and symbolic learning and processing (Breakspear 2017; Pezzulo et al. 2013; Pulvermüller et al. 2021; van Albada et al. 2021).

2 Rapid symbolic learning in mind and brain

Since the discovery of fast mapping as a mechanism for rapid novel word acquisition, its neural underpinnings have received growing attention. Electroencephalography (EEG), with its millisecond-level temporal resolution, has revealed rapid neural changes following minimal exposure to novel word–referent pairings (de Diego Balaguer et al. 2007; Friedrich and Friederici 2011; Shtyrov 2011; Shtyrov et al. 2022; Torkildsen et al. 2008). For instance, by 6 months, infants exhibit distinct neural responses when mapping novel words to referent objects in a picture–word mismatch paradigm (Friedrich and Friederici 2011). The efficiency of this mapping seems to vary with vocabulary size in 20-month-old children, evidenced by neural activation differences in fast mapping (Torkildsen et al. 2008), suggesting that prior learning experiences shape fast mapping mechanisms. Similarly, an EEG study on artificial language acquisition demonstrated neural response changes within about 1 min of exposure, indicating rapid structural information extraction (de Diego Balaguer et al. 2007). Changes in brain responses due to rapid learning have also been consistently observed in adults. Several EEG studies have shown that novel phonological forms can be mapped onto neural representations after only brief exposure, with learning-related changes emerging in perisylvian language cortices (Kimppa et al. 2015; Partanen et al. 2017; Shtyrov et al. 2010). Extending this, other studies have examined the mapping of novel word forms onto their referent objects, revealing enhanced neural responses during word–referent association (Mestres-Missé et al. 2007; Shtyrov et al. 2021, 2022; Vasilyeva et al. 2019; for a review, see Leminen et al. 2023). In an EEG study, Shtyrov et al. (2022) had participants learn 20 unfamiliar auditory word forms paired with novel visual referents, each presented 10 times alongside familiar objects in a fast mapping condition. In contrast, an instructed explicit learning condition used additional unfamiliar word–referent pairs without familiar co-presentations. Fast mapping elicited stronger neural responses as early as ∼200 ms, whereas explicit learning was associated with later-occurring activation patterns. Notably, greater left-lateralization, particularly in language-related regions, has been documented in fast mapping scenarios compared to explicit learning (Shtyrov et al. 2021). These distinct activation patterns suggest that fast mapping engages early, automatic lexical-semantic processing and may rely on pre-existing conceptual representations.

Although substantial evidence has demonstrated neuroplastic changes underlying rapid memory trace formation during symbolic learning, the cortical regions involved in word–meaning mapping and their functional roles remain a matter of debate. Grasping the meaning of words and their relationship to the external world has been shown to rely on large-scale, distributed neural circuits spanning multiple cortical regions. These circuits encompass both domain-specific regions, including action and perceptual systems, and domain-general hubs that act as bridges between them (Binder and Desai 2011; Kuhnke et al. 2023; Pulvermüller 2013; Pulvermüller and Fadiga 2010). Importantly, these cortical regions are not uniformly engaged across all word types; rather, their activation patterns vary depending on the semantic content of the words. High-resolution functional magnetic resonance imaging (fMRI) studies have shown that different word types or symbols (action vs. object words, or animal vs. tool words) selectively activate visual and motor cortices, reflecting different aspects of their meaning (Buccino et al. 2005; Grisoni et al. 2021; Hauk et al. 2004; Meteyard 2012; Shebani et al. 2022; for work on patients, see also Dreyer et al. 2015). Behavioral studies showed motor interference effects during action word understanding or graspable object recognition (e.g., Buccino et al. 2017; Shebani and Pulvermüller 2013), further supporting the role of sensorimotor regions in word meaning processing. Importantly, these domain-specific activations are thought to interact with multimodal hub regions in frontal, temporal, and parietal cortices, functioning as convergence zones where different types of word meanings are equally processed (Binder et al. 2009; Binder and Desai 2011; Bookheimer 2002; Patterson et al. 2007; Pulvermüller 2013). Although it is clear that various cortical regions are involved in word meaning processing, debate persists over the exact role these regions play in facilitating rapid learning and symbolic processing. This debate is particularly relevant given that rapid plastic changes in neocortical regions in fast mapping conditions appear to occur independently of hippocampal learning processes (Atir-Sharon et al. 2015; Merhav et al. 2015). Lesions in semantic hubs, such as the anterior temporal lobe, impair rapid mapping, suggesting their crucial role in word learning-induced cortical plasticity (Atir-Sharon et al. 2015; Merhav et al. 2015; Sharon et al. 2011). The existence of this direct neocortical route challenges traditional views of the semantic memory system, which posit that it is first supported by the hippocampus and only later by slow, gradual consolidation of neural representations in the neocortex (Davis and Gaskell 2009; Frankland and Bontempi 2005; McClelland et al. 1995). Although the anterior temporal lobe has been implicated in fast mapping, word retrieval (Merhav et al. 2015), and enhanced retention after overnight consolidation (Mohan et al. 2022), inconsistent findings leave its role controversial and not yet conclusive (Smith et al. 2014; Warren et al. 2016; for a discussion, see Cooper et al. 2019), questioning the reliability of the underlying mechanisms and the methodological approaches of previous studies conducted with adults. Sensorimotor regions have also been implicated in rapid word learning, with transcranial magnetic stimulation over motor areas shown to modulate neural activity during brief learning sessions (Vukovic et al. 2021). Despite advances in understanding the cortical dynamics and plasticity underlying rapid symbolic learning, the precise neural mechanisms underlying these processes remain unclear. In particular, it is yet not fully clear how the brain fine-tunes vast numbers of synapses across distributed cortical networks, comprising thousands to millions of interacting neurons, to form rapid and stable memory traces.

3 Computational modeling for symbolic acquisition

Mathematical brain models have emerged as powerful tools for exploring the neural principles underlying language processing and its emergence. This approach is particularly fruitful, as it allows detailed examination of how internal representations emerge at the cellular and synaptic levels during learning. Standard ANNs have successfully achieved great performance in language and other tasks, as well as replicated aspects of their disorders (Chen et al. 2017; Kietzmann et al. 2019; Kriegeskorte 2019; Lake and Murphy 2023; LeCun et al. 2015; Mitchell and Krakauer 2023; Rogers et al. 2004; Rogers and McClelland 2004), ranging from speech and object recognition to translation, word acquisition, and semantic processing. Despite their power and human-like abilities, they require thousands of examples to form stable internal representations, thus relying on a slow, gradual learning process, in contrast to how human brains acquire novel knowledge. Previous modeling efforts have attempted to capture the rapid learning principle through a two-stage semantic learning process, involving an initial stage where concepts are encoded, followed by a second learning phase of symbolic learning that builds upon the previously established framework (Clay et al. 2024; Mayor and Plunkett 2010; Plunkett et al. 1992; Regier 2005). Although these models provide valuable insights into semantic and rapid learning, they differ significantly from realistic neurophysiological and anatomical structures of the human brain and employ biologically implausible supervised training methods like backpropagation (Mazzoni et al. 1991; O’Reilly 1998). Consequently, their applicability is limited when it comes to understanding rapid, cellular-level information storage and identifying key cortical regions involved in symbol acquisition. For instance, Mayor and Plunkett’s (2010) model demonstrated rapid word–reference learning after minimal exposure using a simple two-layer model architecture that directly linked visual and acoustic representations (e.g., the referent and acoustic word form dog). However, this model did not incorporate the broader network of cortical regions involved in word meaning processing in the human brain (see Section 2). As noted above, it is widely agreed that the biological realism of neural models is crucial for exploring the mechanisms underlying higher cognitive processing, as the structural properties of the human brain play a fundamental role in shaping distinct human capabilities (Breakspear 2017; Pezzulo et al. 2013; Pulvermüller et al. 2021; van Albada et al. 2021).

Following this approach, spiking brain-constrained neural models have been developed to specifically mimic the anatomical and physiological properties of 12 cortical regions of the frontal, temporal, and occipital cortices, as well as their connectivity structures relevant to language and symbolic processing (see Figure 1B; Tomasello et al. 2018; and early modeling architectures by Garagnani et al. 2008; Wennekers 2007). This approach has yielded an initial understanding of how different symbols (action and object-related words; e.g., run vs. sun) are instantiated in the human brain (Carriere et al. 2024; Garagnani and Pulvermüller 2016; Tomasello et al. 2017; Tomasello et al. 2018). These models were equipped with learning mechanisms based on Hebbian plasticity (Hebb 1949) including long-term potentiation and depression, which are well established biologically (Artola and Singer 1993; Tsumoto 1992). Acquisition of object and action words was simulated by co-activating neurons in primary visual and motor areas, to mimic key features of real-world situations, in which word forms are semantically grounded in information about their related objects and actions, consistent with reports from developmental studies (Tomasello 2003; Vouloumanos and Werker 2009). As a result, the model showed the spontaneous formation of cortical circuits (or the so-called cell assemblies) working as functional units (Braitenberg 1978; Garagnani et al. 2008; Palm et al. 2014; Pulvermüller 2013; Wennekers 2007). These circuits were scattered across areas exhibiting richer connections, so-called “convergence zones” (Damasio and Damasio 1994) or “connector hub areas” (Van den Heuvel and Sporns 2013), for general semantic processing and to modality preferential areas implicated for the processing of specific semantic categories, that is, temporo-visual regions for object words and frontal-motor regions for action words (Figure 1A and C; Tomasello et al. 2017, 2018). Intriguingly, neural circuits recruited additional neurons in primary visual (V1) and temporo-occipital (TO) areas differently when the model was deprived of visual input (i.e., in blindness) at either early or late in acquisition (Tomasello et al. 2024, 2019b). This set of simulation results replicated and provided a mechanistic explanation for a range of neurocognitive studies showing the involvement of sensorimotor and hub regions in word meaning processing (Binder and Desai 2011; Buccino et al. 2017; Carota et al. 2024; Hauk et al. 2004; Pulvermüller 2013; Pulvermüller and Fadiga 2010; Shebani et al. 2022) as well as visual cortex recruitment in cases of blindness (e.g., Amedi et al. 2003, 2004; Bedny et al. 2011). The key biological principles playing a role included connectivity structure and correlation learning, while neural circuit expansion mechanisms further explained the recruitment of visual neural matter in blindness. Recently, these neural network models were employed to explore abstract word processing (Dobler et al. 2024; Henningsen-Schomers and Pulvermüller 2022), language-driven concept formation (Nguyen et al. 2024), and the emergence of verbal working memory (Carriere et al. 2024), offering neuromechanistic insights into key aspects of linguistic phenomena. However, these modeling efforts, like standard ANNs using backpropagation, still rely on extensive training to establish internal representations. Biologically plausible learning rules, such as Hebbian learning, require numerous repetitions of co-activated neurons for stable neural representations to emerge due to their incremental synaptic changes (Froemke et al. 2010; Hebb 1949). This raises the critical question of whether a mathematical implementation capable of achieving rapid learning, similar to that observed in the human brain, is feasible at all. One hypothesis suggests that fast word–meaning mapping relies on pre-existing knowledge acquired through prior experiences (Macnamara 1972; see also Werker and Hensch 2015; and also Section 1). But how can this be explained at the neural matter level?

Figure 1: 
Schematic of the brain-constrained neural network and emergence of symbolic circuits. (A) Fronto-temporal-occipital areas relevant for language and semantic processing showing neural circuits emergence after word–meaning mapping. As a result of word learning, specific circuits of neurons distributed across the network were activated (one dot indicates one active model neuron; blue: object-word circuit; red: action-word circuit; yellow: both), shown in the black boxes representing areas. (B) Schematic display of the area structure and connectivity of the brain inspired model. (C) Bars show average numbers of neurons per area for object-word circuits (dark gray) and action-word circuits (light gray) averaged across multiple networks; whiskers show standard errors. Note the relatively stronger representation of object-word circuits in ventral-visual areas and that of action-word circuits in dorsolateral-frontal areas. Asterisks denote significant differences. (Figure adapted from Tomasello et al. 2017, 2018.)
Figure 1:

Schematic of the brain-constrained neural network and emergence of symbolic circuits. (A) Fronto-temporal-occipital areas relevant for language and semantic processing showing neural circuits emergence after word–meaning mapping. As a result of word learning, specific circuits of neurons distributed across the network were activated (one dot indicates one active model neuron; blue: object-word circuit; red: action-word circuit; yellow: both), shown in the black boxes representing areas. (B) Schematic display of the area structure and connectivity of the brain inspired model. (C) Bars show average numbers of neurons per area for object-word circuits (dark gray) and action-word circuits (light gray) averaged across multiple networks; whiskers show standard errors. Note the relatively stronger representation of object-word circuits in ventral-visual areas and that of action-word circuits in dorsolateral-frontal areas. Asterisks denote significant differences. (Figure adapted from Tomasello et al. 2017, 2018.)

Repeated interactions with the external world, through separate exposure to actions, objects, and word forms, activate specific neural populations in their respective cortical modalities motor, visual and lang´nge regions. According to Hebbian learning principles (Hebb 1949), this co-activation gradually leads to the formation of specialized neural circuits, distributed in distinct cortical brain regions (Braitenberg 1978; Palm 1982; Pulvermüller and Fadiga 2010; Pulvermüller et al. 2014; Wennekers et al. 2006). This process may enable efficient mapping between preformed neural circuits, such as those representing sensory concepts (e.g., visual features of an object) and those associated with linguistic representations (e.g., auditory and articulatory aspects of a word, see Pulvermüller 2023). For instance, a child frequently encountering different types of dogs would lead to the emergence of neural circuits across visual and temporal cortices that represent the concept of “dog”. Similarly, repeated exposure to the word dog, whether through hearing or babbling, leads to circuit formations in the perisylvian language cortex, reflecting auditory and articulatory features (see Figure 2A, left panel). In this scenario, the final step involves linking these preformed neural circuits through their co-activation when the child simultaneously encounters visual and linguistic input during symbolic learning (Figure 2A, right panel), thereby enabling rapid word–referent association.[2]

Figure 2: 
Rapid symbolic learning at the neural level and in a brain-constrained neural model. (A) Visual depiction of the pre-exposure phase and neural circuit formation for word form and visual object processing in separated instances, with rapid word–meaning mapping (symbolic learning) subsequently occurring post-exposure due to preformed circuits. (B) Brain-constrained neural network simulations depicting the pre-exposure phase and circuit formation within the network for object, action, and word form cell assemblies (CAs), with circuits primarily located in the visual, motor, and language cortices, respectively. Bar plots depict the formed neural circuit cell sizes across the different modeled regions, with asterisks indicating statistically significant differences. (C) Comparison of neural circuit formation across learning events in two scenarios: a two-stage learning model with pre-exposure to stimuli and a one-stage learning model without pre-exposure. The two-stage model demonstrates significantly faster mapping compared to the one-stage scenario. (Panels B and C adapted from Constant et al. 2023)
Figure 2:

Rapid symbolic learning at the neural level and in a brain-constrained neural model. (A) Visual depiction of the pre-exposure phase and neural circuit formation for word form and visual object processing in separated instances, with rapid word–meaning mapping (symbolic learning) subsequently occurring post-exposure due to preformed circuits. (B) Brain-constrained neural network simulations depicting the pre-exposure phase and circuit formation within the network for object, action, and word form cell assemblies (CAs), with circuits primarily located in the visual, motor, and language cortices, respectively. Bar plots depict the formed neural circuit cell sizes across the different modeled regions, with asterisks indicating statistically significant differences. (C) Comparison of neural circuit formation across learning events in two scenarios: a two-stage learning model with pre-exposure to stimuli and a one-stage learning model without pre-exposure. The two-stage model demonstrates significantly faster mapping compared to the one-stage scenario. (Panels B and C adapted from Constant et al. 2023)

Recent simulation work confirmed the neural hypothesis for rapid word–referent mapping (Figure 2A). By employing a brain-constrained neural model equipped with Hebbian learning, the model simulated a two-stage word learning process by learning object and action words grounded in the sensorimotor system. First, the model was pre-exposed to phonological and conceptual information; this was followed by symbolic associative learning that enabled rapid linking of preformed representations to their corresponding meanings. Intriguingly, in the pre-exposure simulations, all neural circuits extended into central hub areas within the network (Figure 2B), including the anterior temporal lobe (AT), a region identified as crucial for fast mapping (Atir-Sharon et al. 2015; see also Zaiser et al. 2022a). This finding suggests that such hub regions might serve as the primary site for rapid linking, as all neural circuits for objects, actions, and word forms extended into those regions. As a result, the network demonstrated rapid linking in the symbolic learning phase, with successful word-related circuit mapping occurring after just two to three exposures and most circuits fully established within 13 presentations in the model (Constant et al. 2023). Moreover, comparing a pre-exposed model with one that was not (mimicking direct associative learning) showed neural circuits emergence substantially later, requiring over 50 learning events (Figure 2C). Thus, this suggests that Hebbian learning is able to show rapid learning, especially when preformed neural representations are present, aligning with fast mapping theories (Atir-Sharon et al. 2015; Mayor and Plunkett 2010). While word–meaning mapping occurred rapidly, neural circuits became more stable and strongly mapped with repeated word–referent encounters, leading to more robust representations (Kucker et al. 2015), and potentially also explaining poorer initial retention shown in previous behavioral studies (e.g., Horst and Samuelson 2008). Notably, robust representations emerged much earlier in models with prior exposure compared to those without, underscoring the critical role of pre-exposure in facilitating rapid and stable learning.

Although the model was able to offer some key aspects of the underlying brain mechanism, much work is still needed. For instance, the pre-exposure of symbols and referents required up to 3,000 presentations to reach stable internal representations prior to associative learning, still relying on slow pre-acquired knowledge acquisition, which raises the question of how much pre-exposure is needed in this regard. Furthermore, object and action words were trained in the same manner, without accounting for their specific semantic features and properties. For example, the temporal and contextual dynamics involved in learning the word kicking, such as observing someone actively kicking a ball, differ significantly from learning the word dog, which often involves prolonged visual perception. Additionally, actions are often grounded in more contextual scenarios and typically do not occur in isolation. Future simulations should explore these different semantic features in action- versus object-word learning, potentially providing a neural explanation for the long-standing debate about the delayed acquisition of verbs (typically depicting actions) compared to nouns (object categories; Nelson 1973; for a discussion, see Waxman et al. 2013). Also in this regard, the model’s input structure did not consider shared semantic features between similar concepts (“dog” vs. “cat”) that might be represented by shared neural material in the cortex (Martin 2007; Pulvermüller et al. 2014). This is especially important, as previous studies have shown that highly shared features between familiar and novel referents can significantly facilitate rapid learning, as overlap in semantic characteristics helps the brain efficiently map new information onto existing knowledge (e.g., Himmer et al. 2017; Mak 2019; Zaiser et al. 2022a).

4 Conclusion and desiderata for artificial neural networks for rapid symbolic learning

Computational models have made significant progress in exhibiting human-like abilities across diverse domains. However, most models still lack the extraordinary ability of the human brain to acquire knowledge rapidly. While certain computational approaches have successfully achieved rapid symbolic learning, brain-constrained neural networks, designed to closely replicate the structure and functionality of the human brain, have been able to provide neuromechanistic insights at both cellular and cortical levels. Nevertheless, further research is essential to fully grasp the neural mechanisms underpinning this remarkable cognitive ability. As outlined above, key questions include the extent of pre-exposure to verbal and conceptual information, the role of shared semantic features, and the influence of existing vocabulary size. Each of these factors could impact rapid synaptic changes and the functional roles of cortical regions differently. Further work could also explore human-specific brain structures that may enable human-specific rapid learning capacities. Previous studies have shown that the connectivity within the perisylvian language cortices, particularly the arcuate fasciculus, is significantly more developed in humans than in nonhuman primates (e.g., Rilling 2014). Prior simulations (Carriere et al. 2024; Schomers et al. 2017) demonstrate that constraining networks with human-specific connectivity structures plays a pivotal role in the emergence of verbal working memory. Another important aspect not yet considered is the role of laminar structure and differentiation of distinct layers in the cortex, which has been proposed to vary between primary and higher-order cortical regions (Hilgetag et al. 2022), an important aspect that needs to be tackled in future studies regarding their role in the rapid formation of symbolic representations.

A breakthrough in understanding not only advances theoretical knowledge but can also guide modelers in developing more efficient learning strategies for neural network architectures. This effort is particularly critical nowadays, as high energy consumption remains a major criticism of current models, which require substantial computational power to achieve optimal performance (Cai et al. 2017; Tabbakh et al. 2024; Thompson et al. 2020). This energy-intensive demand necessitates extensive water usage for cooling data centers, further exacerbating the environmental impact. One of the most effective approaches to improving efficiency is parameter reduction, which involves decreasing the number of trainable weights and biases in neural networks (e.g., Frankle 2018). By reducing the model’s parameter count, it is possible to lower computational requirements and energy consumption, making neural networks more sustainable and suitable for deployment in resource-constrained environments. However, insights from neuroscience and language research could also be highly relevant. A valuable insight that could already inform current models is the importance of pre-exposure, driven by linguistic and cognitive theories, alongside designing architectures that closely mirror cortical structures involved in language processing, for example, the role of connector hub regions in linking distributed neural circuits and their connectivity patterns. Other factors may be critical as well, as mentioned above, which need to be examined first. All in all, examining critical biological principles with brain-constrained neural networks could further clarify which of these are most relevant for rapid learning.

A significant desideratum is that most current modeling approaches rely on web-based text or image processing, limiting their ability to accurately capture life-like language acquisition (for discussion, see Lake and Murphy 2023). Although efforts to integrate discourse-level information exist, such as communication patterns from forum discussions (see, e.g., Johns 2021), word meaning acquisition is inherently multimodal (e.g., Holler and Levinson 2019; Karadöller et al. 2024). Linguists and neuroscientists have long emphasized that word learning is fundamentally social (Tomasello 2000), unfolding within interactions where language functions as a tool for communication (Austin 1975; Grice 1957; Searle 1969; Wittgenstein 1953). In real-life language development, children receive a combination of verbal and nonverbal cues about referents and the surrounding communicative context. For example, children learn novel words like salt through varied interactions and modalities that may convey distinct communicative purposes: naming (This is salt) links words to objects, while questions (Is the salt on the table?) and requests (Pass me the salt) embed word meanings in contexts aimed at achieving specific goals. Prosodic contours (Villani et al. 2025) and accompanying gestures or facial expressions (Kelly 2006; Nota et al. 2021) further contribute to how such utterances are interpreted, offering crucial multimodal input during word learning. Yet another critical pragmatic feature in novel word–meaning acquisition is the “action sequence structure” associated with their function in communication: a request is usually followed by grasping and handing over an object, whereas naming is often followed by looking at the object and signaling recognition (Alston 1964; Fritz 2013). These action sequences appear to be a critical feature for understanding communicative functions (Pulvermüller 2018). Neuropragmatic research supports this view, showing that various speech acts conveyed through identical linguistic utterances across various modalities (e.g., prosodic, gestural) rely on distinct neural networks (for a review, see Tomasello 2023; Tomasello et al. 2025; Pulvermüller 2016). For instance, request and question elicit very early motor cortex activation during understanding, as compared to naming or statement functions expressed with the same utterance (Boux et al. 2021; Egorova et al. 2014, 2016; Tomasello et al. 2019a, 2022; van Ackeren et al. 2016; see Figure 5 in Tomasello 2023). This early motor cortex activation likely reflects the typical action sequences these communicative acts trigger, such as reaching to grasp an object or giving a verbal answer (Alston 1964; Fritz 2013; Levinson 1983), and may have an impact on the rate of novel rapid word–meaning mapping across different communicative contexts and the formation of their underlying neural circuits. Standard ANNs, are in general limited in capturing such rich, pragmatic, interactional dimensions and multimodality inherent in human language learning. Addressing these challenges at leastin part requires neural architectures capable of processing simultaneous multimodal inputs (visual, auditory, motor) to account for crucial pragmatic features of communicative functions, as well as, other highly relevant communicative features during conversation (e.g., Tomasello et al. 2025). Brain-constrained neural models designed for multimodal integration may offer a promising first step in this direction.


Corresponding author: Rosario Tomasello, Brain Language Laboratory, Department of Philosophy, Humanities, WE4, Freie Universität Berlin, Habelschwerdter Allee 45, 14195, Berlin, Germany; Cluster of Excellence “Matters of Activity. Image Space Material”, Humboldt Universität zu Berlin, Berlin, Germany; and Dahlem Center for Linguistics, Freie Universität Berlin, Berlin, Germany, E-mail:

Award Identifier / Grant number: EXC 2025-390648296

  1. Conflict of interest: The author declares no conflict of interest.

  2. Research funding: This work was supported by the Deutsche Forschungsgemeinschaft, Germany, via the Cluster of Excellence “Matters of Activity. Image Space Material” funded under Germany’s Excellence Strategy (EXC 2025–390648296), and the Open Access Funding provided by Freie Universität Berlin.

References

Aitchison, J. 2012. Words in the mind: An introduction to the mental lexicon. Oxford: John Wiley & Sons.Suche in Google Scholar

Alston, W. P. 1964. Philosophy of language. Englewood Cliffs, NJ: Prentice-Hall.Suche in Google Scholar

Amedi, A., A. Floel, S. Knecht, E. Zohary & L. G. Cohen. 2004. Transcranial magnetic stimulation of the occipital pole interferes with verbal processing in blind subjects. Nature Neuroscience 7. 1266. https://doi.org/10.1038/nn1328.Suche in Google Scholar

Amedi, A., N. Raz, P. Pianka, R. Malach & E. Zohary. 2003. Early “visual” cortex activation correlates with superior verbal memory performance in the blind. Nature Neuroscience 6. 758–766. https://doi.org/10.1038/nn1072.Suche in Google Scholar

Anglin, J. M., G. A. Miller & P. C. Wakefield. 1993. Vocabulary development: A morphological analysis. Monographs of the Society for Research in Child Development 58(10). i–186. https://doi.org/10.2307/1166112.Suche in Google Scholar

Artola, A. & W. Singer. 1993. Long-term depression of excitatory synaptic transmission and its relationship to long-term potentiation. Trends in Neurosciences 16. 480–487.10.1016/0166-2236(93)90081-VSuche in Google Scholar

Atir-Sharon, T., A. Gilboa, H. Hazan, E. Koilis & L. M. Manevitz. 2015. Decoding the formation of new semantics: MVPA investigation of rapid neocortical plasticity during associative encoding through fast mapping. Neural Plasticity 804385. 17. https://doi.org/10.1155/2015/804385.Suche in Google Scholar

Austin, J. L. 1975. How to do things with words. Oxford: Oxford University Press.10.1093/acprof:oso/9780198245537.001.0001Suche in Google Scholar

Bedny, M., Pascual-Leone, A., Dodell-Feder, D., E. Fedorenko & R. Saxe. 2011. Language processing in the occipital cortex of congenitally blind adults. Proceedings of the National Academy of Sciences 108. 4429–4434, https://doi.org/10.1073/pnas.1014818108.Suche in Google Scholar

Binder, J. R. & R. H. Desai. 2011. The neurobiology of semantic memory. Trends in Cognitive Sciences 15. 527–536. https://doi.org/10.1016/j.tics.2011.10.001.Suche in Google Scholar

Binder, J. R., R. H. Desai, W. W. Graves & L. L. Conant. 2009. Where is the semantic system? A critical review and meta-analysis of 120 functional neuroimaging studies. Cerebral Cortex 19. 2767–2796. https://doi.org/10.1093/cercor/bhp055.Suche in Google Scholar

Bion, R. A., A. Borovsky & A. Fernald. 2013. Fast mapping, slow learning: Disambiguation of novel word–object mappings in relation to vocabulary learning at 18, 24, and 30 months. Cognition 126. 39–53. https://doi.org/10.1016/j.cognition.2012.08.008.Suche in Google Scholar

Bloom, P. 2002. How children learn the meanings of words. Cambridge, MA: MIT Press.Suche in Google Scholar

Bookheimer, S. 2002. Functional MRI of language: New approaches to understanding the cortical organization of semantic processing. Annual Review of Neuroscience 25. 151–188. https://doi.org/10.1146/annurev.neuro.25.112701.142946.Suche in Google Scholar

Boux, I., R. Tomasello, L. Grisoni & F. Pulvermüller. 2021. Brain signatures predict communicative function of speech production in interaction. Cortex 135. 127–145. https://doi.org/10.1016/j.cortex.2020.11.008.Suche in Google Scholar

Braitenberg, V. 1978. Cell assemblies in the cerebral cortex. In R. Heim & G. Palm (eds.), Theoretical approaches to complex systems, 171–188. Berlin: Springer.10.1007/978-3-642-93083-6_9Suche in Google Scholar

Breakspear, M. 2017. Dynamic models of large-scale brain activity. Nature Neuroscience 20. 340–352. https://doi.org/10.1038/nn.4497.Suche in Google Scholar

Brysbaert, M., M. Stevens, P. Mandera & E. Keuleers. 2016. How many words do we know? Practical estimates of vocabulary size dependent on word definition, the degree of language input and the participant’s age. Frontiers in Psychology 7. 1116. https://doi.org/10.3389/fpsyg.2016.01116.Suche in Google Scholar

Buccino, G., B. F. Marino, C. Bulgarelli & M. Mezzadri. 2017. Fluent speakers of a second language process graspable nouns expressed in L2 like in their native language. Frontiers in Psychology 8. https://doi.org/10.3389/fpsyg.2017.01306.Suche in Google Scholar

Buccino, G., L. Riggio, G. Melli, F. Binkofski, V. Gallese & G. Rizzolatti. 2005. Listening to action-related sentences modulates the activity of the motor system: A combined TMS and behavioral study. Cognitive Brain Research 24. 355–363. https://doi.org/10.1016/j.cogbrainres.2005.02.020.Suche in Google Scholar

Cai, E., D.-C. Juan, D. Stamoulis & D. Marculescu. 2017. Neuralpower: Predict and deploy energy-efficient convolutional neural networks. Proceedings of Machine Learning Research 77. 622–637.Suche in Google Scholar

Carey, S. 2010. Beyond fast mapping. Language Learning and Development 6. 184–205. https://doi.org/10.1080/15475441.2010.484379.Suche in Google Scholar

Carey, S. & E. Bartlett. 1978. Acquiring a single new word. Papers and Reports on Child Language Development 15. 17–29.Suche in Google Scholar

Carota, F., H. Nili, N. Kriegeskorte & F. Pulvermüller. 2024. Experientially-grounded and distributional semantic vectors uncover dissociable representations of conceptual categories. Language, Cognition and Neuroscience 39. 1020–1044. https://doi.org/10.1080/23273798.2023.2232481.Suche in Google Scholar

Carriere, M., R. Tomasello & F. Pulvermüller. 2024. Can human brain connectivity explain verbal working memory? Network: Computation in Neural Systems 1–42. https://doi.org/10.1080/0954898X.2024.2421196.Suche in Google Scholar

Chen, L., M. A. Lambon Ralph & T. T. Rogers. 2017. A unified model of human semantic knowledge and its disorders. Nature Human Behaviour 1. https://doi.org/10.1038/s41562-016-0039.Suche in Google Scholar

Chen, S., Y. Wang & W. Yan. 2023. More stable memory retention of novel words learned from fast mapping than from explicit encoding. Journal of Psycholinguistic Research 52. 905–922. https://doi.org/10.1007/s10936-022-09921-4.Suche in Google Scholar

Clay, V., G. Pipa, K.-U. Kühnberger & P. König. 2024. Development of few-shot learning capabilities in artificial neural networks when learning through self-supervised interaction. IEEE Transactions on Pattern Analysis and Machine Intelligence 46. 209–219. https://doi.org/10.1109/TPAMI.2023.3323040.Suche in Google Scholar

Constant, M., F. Pulvermüller & R. Tomasello. 2023. Brain-constrained neural modeling explains fast mapping of words to meaning. Cerebral Cortex 33. 6872–6890. https://doi.org/10.1093/cercor/bhad007.Suche in Google Scholar

Cooper, E., A. Greve & R. N. Henson. 2019. Little evidence for fast mapping (FM) in adults: A review and discussion. Cognitive Neuroscience 10. 196–209. https://doi.org/10.1080/17588928.2018.1542376.Suche in Google Scholar

Coutanche, M. N. & S. L. Thompson-Schill. 2014. Fast mapping rapidly integrates information into existing memory networks. Journal of Experimental Psychology: General 143(6). 2296–2303. https://doi.org/10.1037/xge0000020.Suche in Google Scholar

Davis, M. H. & M. G. Gaskell. 2009. A complementary systems account of word learning: Neural and behavioural evidence. Philosophical Transactions of the Royal Society of London: Series B, Biological Sciences 364. 3773–3800. https://doi.org/10.1098/rstb.2009.0111.Suche in Google Scholar

Damasio, A. R. & H. Damasio. 1994. Cortical systems for retrieval of concrete knowledge: The convergence zone framework. In C. Koch & J. L. Davis (eds.), Large-scale neuronal theories of the brain, 61–74. Cambridge, MA: MIT Press.Suche in Google Scholar

de Diego Balaguer, R., J. M. Toro, A. Rodriguez-Fornells & A.-C. Bachoud-Lévi. 2007. Different neurophysiological mechanisms underlying word and rule extraction from speech. PLoS One 2(11). https://doi.org/10.1371/journal.pone.0001175.Suche in Google Scholar

Devlin, J., M.-W. Chang & K. Lee. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 4171–4186. Minneapolis, Minnesota: Association for Computational Linguistics.Suche in Google Scholar

Dobler, F. R., M. R. Henningsen-Schomers & F. Pulvermüller. 2024. Verbal symbols support concrete but enable abstract concept formation: Evidence from brain-constrained deep neural networks. Language Learning 74(S1). 258–295. https://doi.org/10.1111/lang.12646.Suche in Google Scholar

Dreyer, F. R., D. Frey, S. Arana, S. von Saldern, T. Picht, P. Vajkoczy & F. Pulvermüller. 2015. Is the motor system necessary for processing action and abstract emotion words? Evidence from focal brain lesions. Frontiers in Psychology 6. https://doi.org/10.3389/fpsyg.2015.01661.Suche in Google Scholar

Egorova, N., F. Pulvermüller & Y. Shtyrov. 2014. Neural dynamics of speech act comprehension: An MEG study of naming and requesting. Brain Topography 27. 375–392. https://doi.org/10.1007/s10548-013-0329-3.Suche in Google Scholar

Egorova, N., Y. Shtyrov & F. Pulvermüller. 2016. Brain basis of communicative actions in language. NeuroImage 125. 857–867. https://doi.org/10.1016/j.neuroimage.2015.10.055.Suche in Google Scholar

Frankland, P. W. & B. Bontempi. 2005. The organization of recent and remote memories. Nature Reviews Neuroscience 6. 119–130. https://doi.org/10.1038/nrn1607.Suche in Google Scholar

Frankle, J. & M. Carbin. 2018. The lottery ticket hypothesis: Finding sparse trainable neural networks. arXiv preprint arXiv:1803.03635.Suche in Google Scholar

Friedrich, M. & A. D. Friederici. 2011. Word learning in 6-month-olds: Fast encoding–weak retention. Journal of Cognitive Neuroscience 23. 3228–3240.10.1162/jocn_a_00002Suche in Google Scholar

Fritz, G. 2013. Dynamische texttheorie. Gießen: Gießener elektronische bibliothek.Suche in Google Scholar

Froemke, R. C., D. Debanne & G.-Q. Bi. 2010. Temporal modulation of spike-timing-dependent plasticity. Frontiers in Synaptic Neuroscience 2. https://doi.org/10.3389/fnsyn.2010.00019.Suche in Google Scholar

Garagnani, M. & F. Pulvermüller. 2016. Conceptual grounding of language in action and perception: A neurocomputational model of the emergence of category specificity and semantic hubs. European Journal of Neuroscience 43. 721–737. https://doi.org/10.1111/ejn.13145.Suche in Google Scholar

Garagnani, M., T. Wennekers & F. Pulvermüller. 2008. A neuroanatomically grounded Hebbian-learning model of attention-language interactions in the human brain. European Journal of Neuroscience 27. 492–513. https://doi.org/10.1111/j.1460-9568.2008.06015.x.Suche in Google Scholar

Goldstein, M. H., A. P. King & M. J. West. 2003. Social interaction shapes babbling: Testing parallels between birdsong and speech. Proceedings of the National Academy of Sciences USA 100. 8030–8035. https://doi.org/10.1073/pnas.1332441100.Suche in Google Scholar

Grice, H. P. 1957. Meaning. Philosophical Review 66. 377–388. https://doi.org/10.2307/2182440.Suche in Google Scholar

Grisoni, L., R. Tomasello & F. Pulvermüller. 2021. Correlated brain indexes of semantic prediction and prediction error: Brain localization and category specificity. Cerebral Cortex 31. 1553–1568. https://doi.org/10.1093/cercor/bhaa308.Suche in Google Scholar

Hauk, O., I. Johnsrude & F. Pulvermüller. 2004. Somatotopic representation of action words in human motor and premotor cortex. Neuron 41. 301–307. https://doi.org/10.1016/S0896-6273(03)00838-9.Suche in Google Scholar

Hebb, D. O. 1949. The organization of behavior. New York: John Wiley.Suche in Google Scholar

Henningsen-Schomers, M. R. & F. Pulvermüller. 2022. Modelling concrete and abstract concepts using brain-constrained deep neural networks. Psychological Research 86. 2533–2559. https://doi.org/10.1007/s00426-021-01591-6.Suche in Google Scholar

Hilgetag, C. C., A. Goulas & J.-P. Changeux. 2022. A natural cortical axis connecting the outside and inside of the human brain. Network Neuroscience 6. 950–959. https://doi.org/10.1162/netn_a_00256.Suche in Google Scholar

Himmer, L., E. Müller, S. Gais & M. Schönauer. 2017. Sleep-mediated memory consolidation depends on the level of integration at encoding. Neurobiology of Learning and Memory 137. 101–106. https://doi.org/10.1016/j.nlm.2016.11.019.Suche in Google Scholar

Holler, J. & S. C. Levinson. 2019. Multimodal language processing in human communication. Trends in Cognitive Sciences 23. 639–652. https://doi.org/10.1016/j.tics.2019.05.006.Suche in Google Scholar

Horst, J. S. & L. K. Samuelson. 2008. Fast mapping but poor retention by 24-month-old infants. Infancy 13. 128–157. https://doi.org/10.1080/15250000701795598.Suche in Google Scholar

Johns, B. T. 2021. Disentangling contextual diversity: Communicative need as a lexical organizer. Psychological Review 128. 525–557. https://doi.org/10.1037/rev0000265.Suche in Google Scholar

Karadöller, D. Z., B. Sümer & A. Özyürek. 2024. First-language acquisition in a multimodal language framework: Insights from speech, gesture, and sign. First Language. https://doi.org/10.1177/01427237241290678.Suche in Google Scholar

Kelly, B. F. 2006. The development of constructions through early gesture use. Constructions in Acquisition 174. 15.Suche in Google Scholar

Kietzmann, T. C., P. McClure & N. Kriegeskorte. 2019. Deep neural networks in computational neuroscience. In Oxford research encyclopedia of neuroscience. (accessed 10 September 2025).10.1093/acrefore/9780190264086.013.46Suche in Google Scholar

Kimppa, L., T. Kujala, A. Leminen, M. Vainio & Y. Shtyrov. 2015. Rapid and automatic speech-specific learning mechanism in human neocortex. NeuroImage 118. 282–291. https://doi.org/10.1016/j.neuroimage.2015.05.098.Suche in Google Scholar

Kriegeskorte, N. & T. Golan. 2019. Neural network models and deep learning. Current Biology 29. R231–R236. https://doi.org/10.1016/j.cub.2019.02.034.Suche in Google Scholar

Kucker, S. C., B. McMurray & L. K. Samuelson. 2015. Slowing down fast mapping: Redefining the dynamics of word learning. Child Development Perspectives 9. 74–78. https://doi.org/10.1111/cdep.12110.Suche in Google Scholar

Kuhnke, P., M. C. Beaupain, J. Arola, M. Kiefer & G. Hartwigsen. 2023. Meta-analytic evidence for a novel hierarchical model of conceptual processing. Neuroscience & Biobehavioral Reviews 144. https://doi.org/10.1016/j.neubiorev.2022.104994.Suche in Google Scholar

Lake, B. M. & G. L. Murphy. 2023. Word meaning in minds and machines. Psychological Review 130. 401–431. https://doi.org/10.1037/rev0000297.Suche in Google Scholar

Lake, B. M., T. D. Ullman, J. B. Tenenbaum & S. J. Gershman. 2017. Building machines that learn and think like people. Behavioral and Brain Sciences 40. https://doi.org/10.1017/S0140525X16001837.Suche in Google Scholar

LeCun, Y., Y. Bengio & G. Hinton. 2015. Deep learning. Nature 521. 436–444. https://doi.org/10.1038/nature14539.Suche in Google Scholar

Leminen, A., E. Partanen & Y. Shtyrov. 2023. Electrophysiology of word learning. In M. Grimaldi, E. Brattico & Y. Shtyrov (eds.), Language electrified, 505–525. New York, NY: Humana.10.1007/978-1-0716-3263-5_15Suche in Google Scholar

Levinson, S. C. 1983. Pragmatics. Cambridge: Cambridge University Press.Suche in Google Scholar

Macnamara, J. 1972. Cognitive basis of language learning in infants. Psychological Review 79. 1–13. https://doi.org/10.1037/h0031901.Suche in Google Scholar

Mak, M. H. C. 2019. Why and how the co-occurring familiar object matters in fast mapping (FM)? Insights from computational models. Cognitive Neuroscience 10. 229–231. https://doi.org/10.1080/17588928.2019.1593121.Suche in Google Scholar

Markson, L. & P. Bloom. 1997. Evidence against a dedicated system for word learning in children. Nature 385. 813–815. https://doi.org/10.1038/385813a0.Suche in Google Scholar

Martin, A. 2007. The representation of object concepts in the brain. Annual Review of Psychology 58. 25–45. https://doi.org/10.1146/annurev.psych.57.102904.190143.Suche in Google Scholar

Mayor, J. & K. Plunkett. 2010. A neurocomputational account of taxonomic responding and fast mapping in early word learning. Psychological Review 117. 1–31. https://doi.org/10.1037/a0018130.Suche in Google Scholar

Mazzoni, P., R. A. Andersen & M. I. Jordan. 1991. A more biologically plausible learning rule for neural networks. Proceedings of the National Academy of Sciences 88. 4433–4437. https://doi.org/10.1073/pnas.88.10.4433.Suche in Google Scholar

McClelland, J. L., B. L. McNaughton & R. C. O’Reilly. 1995. Why there are complementary learning systems in the hippocampus and necortex: Insights from the successes and failures of connectionist models of learning and memory. Psychological Review 102. 419–457. https://doi.org/10.1037/0033-295X.102.3.419.Suche in Google Scholar

Merhav, M., A. Karni & A. Gilboa. 2015. Not all declarative memories are created equal: Fast mapping as a direct route to cortical declarative representations. NeuroImage 117. 80–92. https://doi.org/10.1016/j.neuroimage.2015.05.027.Suche in Google Scholar

Mestres-Missé, A., A. Rodriguez-Fornells & T. F. Münte. 2007. Watching the brain during meaning acquisition. Cerebral Cortex 17. 1858–1866. https://doi.org/10.1093/cercor/bhl094.Suche in Google Scholar

Meteyard, L. 2012. Trial shows only that practice varies. BMJ 345. e6022. Author reply e6023–e6022; author reply e6023. https://doi.org/10.1136/bmj.e6022.Suche in Google Scholar

Milton, J. & J. Treffers-Daller. 2013. Vocabulary size revisited: The link between vocabulary size and academic achievement. Applied Linguistics Review 4. 151–172. https://doi.org/10.1515/applirev-2013-0007.Suche in Google Scholar

Mitchell, M. & D. C. Krakauer. 2023. The debate over understanding in AI’s large language models. Proceedings of the National Academy of Sciences 120(13). https://doi.org/10.1073/pnas.2215907120.Suche in Google Scholar

Mohan, M. P., R. N. Menon, S. P. Goswami, S. V. Thomas, A. Cherian & A. Radhakrishnan. 2022. Exploring novel word learning via fast mapping and explicit encoding in persons with temporal lobe epilepsy. Annals of Indian Academy of Neurology 25. 1080–1086. https://doi.org/10.4103/aian.aian_222_22.Suche in Google Scholar

Nelson, K. 1973. Structure and strategy in learning to talk. Monographs of the Society for Research in Child Development 38(1). https://doi.org/10.2307/1165788.Suche in Google Scholar

Nguyen, P., M. Henningsen-Schomers & F. Pulvermüller. 2024. Causal influence of linguistic learning on perceptual and conceptual processing: A brain-constrained deep neural network study of proper names and category terms. Journal of Neuroscience 44(9). https://doi.org/10.1523/JNEUROSCI.1048-23.2023.Suche in Google Scholar

Nota, N., J. P. Trujillo & J. Holler. 2021. Facial signals and social actions in multimodal face-to-face interaction. Brain Sciences 11. https://doi.org/10.3390/brainsci11081017.Suche in Google Scholar

O’Reilly, R. C. 1998. Six principles for biologically based computational models of cortical cognition. Trends in Cognitive Sciences 2. 455–462. https://doi.org/10.1016/S1364-6613(98)01241-8.Suche in Google Scholar

Palm, G. 1982. Neural assemblies: An alternative approach to artificial intelligence. Secaucus, NJ: Springer-Verlag New York.Suche in Google Scholar

Palm, G., A. Knoblauch, F. Hauser & A. Schüz. 2014. Cell assemblies in the cerebral cortex. Biological Cybernetics 108. 559–572. https://doi.org/10.1007/s00422-014-0596-4.Suche in Google Scholar

Partanen, E., A. Leminen, S. de Paoli, A. Bundgaard, O. S. Kingo, P. Krøjgaard & Y. Shtyrov. 2017. Flexible, rapid and automatic neocortical word form acquisition mechanism in children as revealed by neuromagnetic brain response dynamics. NeuroImage 155. 450–459. https://doi.org/10.1016/j.neuroimage.2017.03.066.Suche in Google Scholar

Patterson, K., P. J. Nestor & T. T. Rogers. 2007. Where do you know what you know? The representation of semantic knowledge in the human brain. Nature Reviews Neuroscience 8. 976–987. https://doi.org/10.1038/nrn2277.Suche in Google Scholar

Pezzulo, G., L. W. Barsalou, A. Cangelosi, M. H. Fischer, K. McRae & M. Spivey. 2013. Computational grounded cognition: A new alliance between grounded cognition and computational modeling. Frontiers in Psychology 3. https://doi.org/10.3389/fpsyg.2012.00612.Suche in Google Scholar

Plunkett, K., C. Sinha, M. F. Møller & O. Strandsby. 1992. Symbol grounding or the emergence of symbols? Vocabulary growth in children and a connectionist net. Connection Science 4. 293–312. https://doi.org/10.1080/09540099208946620.Suche in Google Scholar

Pulvermüller, F. 2013. How neurons make meaning: Brain mechanisms for embodied and abstract-symbolic semantics. Trends in Cognitive Sciences 17. 458–470. https://doi.org/10.1016/j.tics.2013.06.004.Suche in Google Scholar

Pulvermüller, F. 2016. Language, action, interaction: Neuropragmatic perspectives on symbols, meaning, and context-dependent function. In A. K. Engel, K. J. Friston & D. Kragic (eds.), The pragmatic turn: Toward action-oriented views in cognitive science, 139–158. Boston, MA: MIT Press.10.7551/mitpress/10709.003.0011Suche in Google Scholar

Pulvermüller, F. 2018. Neural reuse of action perception circuits for language, concepts and communication. Progress in Neurobiology 160. 1–44. https://doi.org/10.1016/j.pneurobio.2017.07.001.Suche in Google Scholar

Pulvermüller, F. 2023. Neurobiological mechanisms for language, symbols and concepts: Clues from brain-constrained deep neural networks. Progress in Neurobiology 230. https://doi.org/10.1016/j.pneurobio.2023.102511.Suche in Google Scholar

Pulvermüller, F. & L. Fadiga. 2010. Active perception: Sensorimotor circuits as a cortical basis for language. Nature Reviews Neuroscience 11. 351–360. https://doi.org/10.1038/nrn2811.Suche in Google Scholar

Pulvermüller, F., M. Garagnani & T. Wennekers. 2014. Thinking in circuits: Toward neurobiological explanation in cognitive neuroscience. Biological Cybernetics 108. 573–593. https://doi.org/10.1007/s00422-014-0603-9.Suche in Google Scholar

Pulvermüller, F., R. Tomasello, M. R. Henningsen-Schomers & T. Wennekers. 2021. Biological constraints on neural network models of cognitive function. Nature Reviews Neuroscience 22. 488–502. https://doi.org/10.1038/s41583-021-00473-5.Suche in Google Scholar

Regier, T. 2005. The emergence of words: Attentional learning in form and meaning. Cognitive Science 29. 819–865.10.1207/s15516709cog0000_31Suche in Google Scholar

Reilly, R. C. O. 1999. Six principles for biologically-based computational models of cortical cognition. Trends in Cognitive Sciences 2. 455–462.10.1016/S1364-6613(98)01241-8Suche in Google Scholar

Reznick, J. S. & B. A. Goldfield. 1992. Rapid change in lexical development in comprehension and production. Developmental Psychology 28(3). 406–413. https://doi.org/10.1037/0012-1649.28.3.406.Suche in Google Scholar

Rilling, J. K. 2014. Comparative primate neuroimaging: Insights into human brain evolution. Trends in Cognitive Sciences 18. 46–55. https://doi.org/10.1016/j.tics.2013.09.013.Suche in Google Scholar

Rogers, T. T., M. A. Lambon Ralph, P. Garrard, S. Bozeat, J. L. McClelland, J. R. Hodges & K. Patterson. 2004. Structure and deterioration of semantic memory: A neuropsychological and computational investigation. Psychological Review 111. 205–235. https://doi.org/10.1037/0033-295X.111.1.205.Suche in Google Scholar

Rogers, T. T. & J. L. McClelland. 2004. Semantic cognition: A parallel distributed processing approach. Cambridge, MA: MIT Press.10.7551/mitpress/6161.001.0001Suche in Google Scholar

Rumelhart, D. E., G. E. Hinton & R. J. Williams. 1986. Learning representations by back-propagating errors. Nature 323. 533–536. https://doi.org/10.1038/323533a0.Suche in Google Scholar

Samuelson, L. K. & B. McMurray. 2017. What does it take to learn a word? Wiley Interdisciplinary Reviews: Cognitive Science 8. https://doi.org/10.1002/wcs.1421.Suche in Google Scholar

Schomers, M. R., M. Garagnani & F. Pulvermüller. 2017. Neurocomputational consequences of evolutionary connectivity changes in perisylvian language cortex. Journal of Neuroscience 37. 3045–3055. https://doi.org/10.1523/JNEUROSCI.2693-16.2017.Suche in Google Scholar

Searle, J. R. J. 1969. Speech acts: An essay in the philosophy of language, vol. 626, 203. Cambridge University Press.10.1017/CBO9781139173438Suche in Google Scholar

Sharon, T., M. Moscovitch & A. Gilboa. 2011. Rapid neocortical acquisition of long-term arbitrary associations independent of the hippocampus. Proceedings of the National Academy of Sciences 108. 1146–1151. https://doi.org/10.1073/pnas.1005238108.Suche in Google Scholar

Shebani, Z., F. Carota, O. Hauk, J. B. Rowe, L. W. Barsalou, R. Tomasello & F. Pulvermüller. 2022. Brain correlates of action word memory revealed by fMRI. Scientific Reports 12. 1–15. https://doi.org/10.1038/s41598-022-19416-w.Suche in Google Scholar

Shebani, Z. & F. Pulvermüller. 2013. Moving the hands and feet specifically impairs working memory for arm- and leg-related action words. Cortex 49. 222–231. https://doi.org/10.1016/j.cortex.2011.10.005.Suche in Google Scholar

Shtyrov, Y. 2011. Fast mapping of novel word forms traced neurophysiologically. Frontiers in Psychology 2. https://doi.org/10.3389/fpsyg.2011.00340.Suche in Google Scholar

Shtyrov, Y., M. Filippova, E. Blagovechtchenski, A. Kirsanov, E. Nikiforova & O. Shcherbakova. 2021. Electrophysiological evidence of dissociation between explicit encoding and fast mapping of novel spoken words. Frontiers in Psychology 12. https://doi.org/10.3389/fpsyg.2021.571673.Suche in Google Scholar

Shtyrov, Y., M. Filippova, E. Perikova, A. Kirsanov, O. Shcherbakova & E. Blagovechtchenski. 2022. Explicit encoding vs. fast mapping of novel spoken words: Electrophysiological and behavioural evidence of diverging mechanisms. Neuropsychologia 172. https://doi.org/10.1016/j.neuropsychologia.2022.108268.Suche in Google Scholar

Shtyrov, Y., V. V. Nikulin & F. Pulvermüller. 2010. Rapid cortical plasticity underlying novel word learning. Journal of Neuroscience 30. 16864–16867. https://doi.org/10.1523/JNEUROSCI.1376-10.2010.Suche in Google Scholar

Smith, C. N., Z. J. Urgolites, R. O. Hopkins & L. R. Squire. 2014. Comparison of explicit and incidental learning strategies in memory-impaired patients. Proceedings of the National Academy of Sciences 111. 475–479. https://doi.org/10.1073/pnas.1322263111.Suche in Google Scholar

Swingley, D. 2010. Fast mapping and slow mapping in children’s word learning. Language Learning and Development 6(3). 179–183. https://doi.org/10.1080/15475441.2010.484412.Suche in Google Scholar

Tabbakh, A., L. Al Amin, M. Islam, G. M. I. Mahmud, I. K. Chowdhury & M. S. H. Mukta. 2024. Towards sustainable AI: A comprehensive framework for Green AI. Discover Sustainability 5. 408. https://doi.org/10.1007/s43621-024-00641-4.Suche in Google Scholar

Thompson, N. C., K. Greenewald, K. Lee & G. F. Manso. 2020. The computational limits of deep learning. arXiv preprint arXiv:2007.05558.Suche in Google Scholar

Tomasello, M. 2000. The social-pragmatic theory of word learning. Pragmatics 10. 401–413. https://doi.org/10.1075/prag.10.4.01tom.Suche in Google Scholar

Tomasello, M. 2003. Constructing a language: A usage based theory of language acquisition. Boston: Harvard University Press.Suche in Google Scholar

Tomasello, R. 2023. Linguistic signs in action: The neuropragmatics of speech acts. Brain and Language 236. https://doi.org/10.1016/j.bandl.2022.105203.Suche in Google Scholar

Tomasello, R., I. P. Boux & F. Pulvermüller. 2025. Theory of mind and the brain substrates of direct and indirect communicative action understanding. Philosophical Transactions of the Royal Society B 380. https://doi.org/10.1098/rstb.2023.0497.Suche in Google Scholar

Tomasello, R., M. Carriere & F. Pulvermüller. 2024. The impact of early and late blindness on language and verbal working memory: A brain-constrained neural model. Neuropsychologia 196. https://doi.org/10.1016/j.neuropsychologia.2024.108816.Suche in Google Scholar

Tomasello, R., M. Garagnani, T. Wennekers & F. Pulvermüller. 2017. Brain connections of words, perceptions and actions: A neurobiological model of spatio-temporal semantic activation in the human cortex. Neuropsychologia 98. 111–129. https://doi.org/10.1016/j.neuropsychologia.2016.07.004.Suche in Google Scholar

Tomasello, R., M. Garagnani, T. Wennekers & F. Pulvermüller. 2018. A neurobiologically constrained cortex model of semantic grounding with spiking neurons and brain-like connectivity. Frontiers in Computational Neuroscience 12. https://doi.org/10.3389/fncom.2018.00088.Suche in Google Scholar

Tomasello, R., L. Grisoni, I. Boux, D. Sammler & F. Pulvermüller. 2022. Instantaneous neural processing of communicative functions conveyed by speech prosody. Cerebral Cortex 32(21). 4885–4901. https://doi.org/10.1093/cercor/bhab522.Suche in Google Scholar

Tomasello, R., C. Kim, F. R. Dreyer, L. Grisoni & F. Pulvermüller. 2019a. Neurophysiological evidence for rapid processing of verbal and gestural information in understanding communicative actions. Scientific Reports 9. 16285. https://doi.org/10.1038/s41598-019-52158-w.Suche in Google Scholar

Tomasello, R., T. Wennekers, M. Garagnani & F. Pulvermüller. 2019b. Visual cortex recruitment during language processing in blind individuals is explained by Hebbian learning. Scientific Reports 9. https://doi.org/10.1038/s41598-019-39864-1.Suche in Google Scholar

Torkildsen, J. K. von, J. M. Svangstu, H. F. Hansen, L. Smith, H. G. Simonsen, I. Moen & M. Lindgren. 2008. Productive vocabulary size predicts event-related potential correlates of fast mapping in 20-month-olds. Journal of Cognitive Neuroscience 20. 1266–1282. https://doi.org/10.1162/jocn.2008.20087.Suche in Google Scholar

Tsumoto, T. 1992. Long-term potentiation and long-term depression in the neocortex. Progress in Neurobiology 39. 209–228. https://doi.org/10.1016/0301-0082(92)90011-3.Suche in Google Scholar

van Ackeren, M. J., A. Smaragdi & S. A. Rueschemeyer. 2016. Neuronal interactions between mentalising and action systems during indirect request processing. Social Cognitive and Affective Neuroscience 11. 1402–1410. https://doi.org/10.1093/scan/nsw062.Suche in Google Scholar

van Albada, S. J., A. Morales-Gregorio, T. Dickscheid, A. Goulas, R. Bakker, S. Bludau, G. Palm, C.-C. Hilgetag & M. Diesmann. 2021. Bringing anatomical information into neuronal network models. In M. Giugliano, M. Negrello & D. Linaro (eds.), Computational modelling of the brain, 201–234. Cham: Springer.10.1007/978-3-030-89439-9_9Suche in Google Scholar

Van den Heuvel, M. P. & O. Sporns. 2013. Network hubs in the human brain. Trends in Cognitive Sciences 17. 683–696. https://doi.org/10.1016/j.tics.2013.09.012.Suche in Google Scholar

Vasilyeva, M. J., V. M. Knyazeva, A. A. Aleksandrov & Y. Shtyrov. 2019. Neurophysiological correlates of fast mapping of novel words in the adult brain. Frontiers in Human Neuroscience 13. https://doi.org/10.3389/fnhum.2019.00304.Suche in Google Scholar

Villani, C., I. P. Boux, F. Pulvermüller & R. Tomasello. 2025. The time course of speech act recognition conveyed by speech prosody: A gating study. Language, Cognition and Neuroscience 40. 1065–1084. https://doi.org/10.1080/23273798.2025.2506641.Suche in Google Scholar

Vouloumanos, A. & J. F. Werker. 2009. Infants’ learning of novel words in a stochastic environment. Developmental Psychology 45. 1611–1617. https://doi.org/10.1037/a0016134.Suche in Google Scholar

Vukovic, N., B. Hansen, T. E. Lund, S. Jespersen & Y. Shtyrov. 2021. Rapid microstructural plasticity in the cortical semantic network following a short language learning session. PLoS Biology 19. https://doi.org/10.1371/journal.pbio.3001290.Suche in Google Scholar

Warren, D. E., D. Tranel & M. C. Duff. 2016. Impaired acquisition of new words after left temporal lobectomy despite normal fast-mapping behavior. Neuropsychologia 80. 165–175. https://doi.org/10.1016/j.neuropsychologia.2015.11.016.Suche in Google Scholar

Waxman, S., X. Fu, S. Arunachalam, E. Leddon, K. Geraghty & H. Song. 2013. Are nouns learned before verbs? Infants provide insight into a long-standing debate. Child Development Perspectives 7. 155–159. https://doi.org/10.1111/cdep.12032.Suche in Google Scholar

Wennekers, T. 2007. A cell assembly model for complex behaviour. Neurocomputing 70. 1988–1992. https://doi.org/10.1016/j.neucom.2006.10.079.Suche in Google Scholar

Wennekers, T., M. Garagnani & F. Pulvermüller. 2006. Language models based on Hebbian cell assemblies. Journal of Physiology-Paris 100(1). 16–30. https://doi.org/10.1016/j.jphysparis.2006.09.007.Suche in Google Scholar

Werker, J. F. & T. K. Hensch. 2015. Critical periods in speech perception: New directions. Annual Review of Psychology 66. 173–196. https://doi.org/10.1146/annurev-psych-010814-015104.Suche in Google Scholar

Wittgenstein, L. 1953. Philosophical investigations. Oxford: Blackwell.Suche in Google Scholar

Zaiser, A.-K., R. Bader & P. Meyer. 2022a. High feature overlap reveals the importance of anterior and medial temporal lobe structures for learning by means of fast mapping. Cortex 146. 74–88. https://doi.org/10.1016/j.cortex.2021.07.017.Suche in Google Scholar

Zaiser, A.-K., P. Meyer & R. Bader. 2022b. High feature overlap and incidental encoding drive rapid semantic integration in the fast mapping paradigm. Journal of Experimental Psychology: General 151(1). 97–120. https://doi.org/10.1037/xge0001070.Suche in Google Scholar

Received: 2024-12-09
Accepted: 2025-07-08
Published Online: 2025-10-21

© 2025 the author(s), published by De Gruyter, Berlin/Boston

This work is licensed under the Creative Commons Attribution 4.0 International License.

Heruntergeladen am 5.11.2025 von https://www.degruyterbrill.com/document/doi/10.1515/lingvan-2024-0249/html?lang=de
Button zum nach oben scrollen