Preprint of:

Thelen, E., Schöner, G., Scheier, C. & Smith, L., B. (2000) The Dynamics of Embodiment: A Field Theory of Infant Perseverative Reaching. Behavioral and Brain Sciences 24 (1): XXX-XXX.

Paper also available in PDF FORMAT


This is the unedited final draft of a BBS target article that has been accepted for publication (Copyright 2000: Cambridge University Press) and is currently being circulated for Open Peer Commentary.

This preprint is for inspection only, to help prospective commentators decide whether or not they wish to prepare a formal commentary.

Please do not prepare a commentary unless you have received a formal invitation indicating that it has been possible to include you in the final list of invited commentators.

For information on becoming a commentator on this or other BBS target articles, write to bbs@soton.ac.uk

For information about subscribing or purchasing offprints of the published version, with commentaries and author's response, write to:

journals_subscriptions@cup.org (North America)
journals_subscriptions@cup.cam.ac.uk (All other countries).


The Dynamics of Embodiment:

A Field Theory of Infant Perseverative Reaching
 

Esther Thelen1

Gregor Schöner2,

Christian Scheier3

and

Linda B. Smith1





1 Department of Psychology and Program in Cognitive Science,
Indiana University,
Bloomington,
IN 47405,
USA
thelene@indiana.edu
smith4@indiana.edu
 
2Centre de Recherche en Neurosciences Cognitives,
C.N.R.S.,
Marseille,
Cedex 20,
France.
gregor@lnf.cnrs-mrs.fr
 
3 Department of Biology,
California Institute of Technology,
Pasadena,
CA 91125, USA
scheier@neuro.caltech.edu

 

 

Esther Thelen has a Ph.D in Biological Sciences from the University of Missouri and has been Professor of Psychology and Cognitive Science at Indiana University-Bloomington since 1985. Her research has centered on the acquisition of motor skills in infants, the relation of movement to cognition, and developmental theory. She is the author of over 85 papers and, with Linda Smith, the book, A Dynamical Systems Approach to the Development of Cognition and Action. She recently served as the President of the International Society for Infant Studies.
 
 
 
 
 
 

Gregor Schöner is a Directeur de Recherche at the Centre de Recherche en Neurosciences Cognitives of the French Centre National de la Recherche Scientifique. Originally trained as a theoretical physicist he has developed ideas from dynamical systems theory for such diverse fields as perception, movement,cortical neurophysiology, and autonomous robotics, with a recent interest in cognition. Over the last 15 years he published over 85 papers in experimental psychology, neuroscience, and engineering. He has lectured on dynamical systems ideas to many different audiences, with the goal of contributing to interdisciplinary interchange.
 
 
 
 
 
 

Christian Scheier is a postdoctoral fellow at the California Institute of Technology. He is the author of over 40 publications in the areas of time series analysis, autonomous agents, crossmodal psychophysics and infant development. He has recently published, together with Rolf Pfeifer, a book on embodied cognitive science with MIT Press. He did his Ph.D. on category learning in autonomous agents at the Artificial Intelligence Laboratory at the University of Zurich.
 
 
 
 
 
 
 
 

Linda Smith is a Chancellor's Professor of Psychology and Cognitive Science at Indiana University - Bloomington. She received her B.S. degree in 1973 from the University of Wisconsin - Madison and her Ph.D. in psychology from the University of Pennsylvania in 1977. She joined the faculty at Indiana University in 1977. Her research is directed to understanding developmental processes especially at it applies to cognition and word learning. She has published over 80 research articles and is co-author with Esther Thelen of A Dynamical Systems Approach to the Development of Cognition and Action. Her research is supported by grants from the National Institutes of Child Health and Development and the National Institute of Mental Health.
 
 
 
 
 
 


Abstract:

The overall goal of this target article is to demonstrate a mechanism for an embodied cognition. The particular vehicle is a much-studied, but still widely debated phenomenon seen in 7-12 month-old-infants. In Piaget's classic "A-not-B error," infants who have successfully uncovered a toy at location "A" continue to reach to that location even after they watch the toy hidden in a nearby location "B." Here we question the traditional explanations of the error as indicator of infants' concepts of objects or other static mental structures. Instead, we demonstrate that the A-not-B error and its previously puzzling contextual variations can be understood by the coupled dynamics of the ordinary processes of goal-directed actions: looking, planning, reaching, and remembering. We offer a formal dynamic theory and model based on cognitive embodiment that both simulates the known A-not-B effects and offers novel predictions that match new experimental results. The demonstration supports an embodied view by casting the mental events involved in perception, planning, deciding and remembering in the same analogic dynamic language as that used to describe bodily movement, so that they may be continuously meshed. We maintain that this mesh is a pre-eminently cognitive act of "knowing" not only in infancy but also in everyday activities throughout the life span.
 

Keywords:

Cognitive development, dynamical systems theory, embodied cognition, infant development, motor control, motor planning, perception and action.



 

"It is far too little recognized how entirely the intellect is built up of practical interests" (William James 1897 p.36 ).

1. Introduction

A century after William James created a psychology based on the primacy of experience, the ideas of embodiment are again entering the debate in the mind sciences. To say that cognition is embodied means that it arises from bodily interactions with the world. From this point of view, cognition depends on the kinds of experiences that come from having a body with particular perceptual and motor capabilities that are inseparably linked and that together form the matrix within which reasoning, memory, emotion, language and all other aspects of mental life are meshed. The contemporary notion of an embodied cognition stands in contrast to the prevailing cognitivist stance which sees mind as a device to manipulate symbols and is thus concerned with the formal rules and processes by which the symbols appropriately represent the real world. There is now converging interest in embodiment from scholars in philosophy, cognitive science, psychology, linguistics, robotics, and neuroscience (Almássy et al. 1998; Ballard et al. 1997; Brooks 1991; Chiel & Beer 1997; Clark 1997; Damasio 1994; Edelman 1987; Fogel 1993; Glenberg 1997; Gibson 1969; Harnad 1990; Johnson 1987; Lakoff 1987; Lakoff & Johnson 1990; Merleau-Ponty 1963; Pfeiffer& Scheier 1999; Sheets-Johnstone, 1990; Talmy 1988; Thelen 1995; Thelen & Smith 1994; Varela et al. 1991.)

In this target article, we contribute to this multidisciplinary effort by focusing in considerable detail on a particular, and controversial, phenomenon seen in human infants, the so called, "A-not-B error." We present theory, a model, simulations, and experiments that recast this phenomenon in embodied terms, using assumptions and the formal language of dynamic systems. We believe this offers an attractive starting point for an embodied cognition for several reasons:

1. We cast the mental events involved in perception, planning, deciding, and remembering in the analogic language of dynamics. This situates cognition within the same continuous, time-based, and nonlinear processes as those involved in bodily movement, and in the large-scale processes in the nervous system (Freeman & Skarda 1985; Kelso 1995; Koch & Davis 1994; Port & van Gelder 1995; Singer 1990; Turvey 1990; van Gelder, 1998). Finding a common language for behavior, body, and brain is a first step for banishing the specter of dualism once and for all.

2. Because perception, action, decision, execution, and memory are cast in compatible task dynamics, the processes can be continuously meshed together. This changes the information-processing flow from the traditional input-transduction-output stream to one of time-based and often shifting patterns of cooperative and competitive interactions. The advantage is the ability to capture the subtle contextual and temporal influences that are the hallmarks of real life behavior in the world.

3. We address specifically the developmental origins of cognition. Since Piaget (1952, 1954), it has been widely acknowledged that all forms of human thought must somehow arise from the purely sensorimotor activities of infants. But it is also generally assumed that the goal of development is to rise above the "mere sensorimotor" into symbolic and conceptual modes of functioning. The task of the developmental researcher, in this view, has been to unearth the "real" cognitive competence of the child unfettered by performance deficits from immature perception, attention, or motor skills. This division between what children really "know" and what they can demonstrate they know has been a persistent theme in developmental psychology (Gelman 1979; Spelke 1990). We argue here that these discontinuities are untenable. Our message is: if we can understand this particular infant task and its myriad contextual variations in terms of coupled dynamic processes, then the same kind of analysis can be applied to any task at any age. If we can show that "knowing" cannot be separated from perceiving, acting, and remembering, then these processes are always linked. There is no time and no task when such dynamics cease and some other mode of processing kicks in. Body and world remain ceaselessly melded together.

The burden of this larger agenda rests on our dissection and reinterpretation of a classic infant perseverative reaching phenomenon, the well-known "A-not-B" error (Piaget 1954). The dynamic field model formalizes a new approach to this error first suggested in conceptual form in Thelen & Smith (1994) and subsequently extended and supported by a series of experiments described in Smith, et al. (1999b). The model is an adaptation of Erlhagen & Schöner's (1998) dynamic neural field theory of motor programming (Schöner et al. 1997) and thus offers a bridge between the more general processes of motor planning and execution and the developmentally specific effects revealed by the A-not-B task. The model offers a powerful and parsimonious, yet biologically plausible, account of the many contextual influences on A-not-B tasks that have puzzled developmental psychologists for two generations. More generally, it demonstrates the elegance and usefulness of dynamic systems principles and language for understanding the intertwined processes of perceiving, deciding, acting, and remembering, and their changes over time.

The article proceeds as follows: First, we describe the A-not-B error in its canonical form and the variations that constitute the data to-be-explained. We show how previous explanations each capture some of the phenomenon but fail to account for all the known effects. Next we lay out the broad outlines of the new approach put forth in Thelen & Smith (1994) and Smith et al. (1999b) and the empirical support for that approach. Then we introduce the major assumptions of the dynamic field model and discuss why the model is so well-suited to explaining the A-not-B error. This is followed by a description of the model, and a series of simulations that capture the main A-not-B effect as well as the known contextual variations. We evaluate the strengths and shortcomings of this model in relation to other explanations. And finally, we offer some speculations about the model's more general usefulness for integrating multiple, time-based processes of human cognition and action.

2. The A-not-B error

The A-not-B error was first described by Jean Piaget (1954) in the context of his life-long quest for the developmental origins of knowledge. Piaget was particularly concerned with the question of when infants understand the properties of objects, and especially that objects continue to exist even when they cannot be seen or directly acted upon. Through a series of clever hiding games he played with his own children, Piaget discovered that such object knowledge arises gradually and rather late in infancy. Before 7 or 8 months of age, infants refuse to search for a toy hidden under a cover, as though the toy simply ceased to exist. After about 12 months, they search robustly, even after the toy is hidden in several places in succession. But between 7-12 months, infants display a peculiar kind of "partial knowledge" where they search at one location, but cannot switch their search if the toy is then switched to a second or subsequent hiding place. Piaget labeled this behavior as "Stage IV" in his series of stages in the development of adult-like object permanence: Infants act as if the toy had lasting existence only where it first disappeared.

In the decades since Piaget's first descriptions, the hiding task has been repeated countless times, in laboratories all over the world, and with myriad variations (see reviews by Acredolo 1985; Bremner 1985; Diamond 1990a, 1990b; Harris 1987, Markovitch & Zelazo, 1999; Munakata, 1998; Wellman et al. 1986, among others.). The classic, canonical version goes like this: An investigator hides a small, attractive toy under one of two identical hiding places, usually cloth covers or hiding wells with lids, and allows the infant to search and recover the toy. After a number of hidings and recoveries from the first location, "A," the investigator hides the toy in the second location "B" in full sight of the baby. If there is a few second delay between the hiding event and when the infant is permitted to search, eight-to-ten-month-old infants reliably make the "A-not-B" error, that is they reach back to the original location "A," even though they saw the toy hidden at "B."

It is both surprising and endearing to see infants so determined to make a mistake. But the reason that the A-not-B error has intrigued developmentalists for nearly fifty years is not just the phenomenon itself, but the questions raised by the many variations of the task studied over that time. Here is the puzzle: while the A-not-B error is entirely robust in the canonical form we described above, even seemingly small alterations in the task conditions can disrupt it. Nearly every aspect of the event matters: the visual properties of the hiding locations, including the distinctiveness, distance, number, and transparency of the covers (e.g. Bremner 1978b; Butterworth 1977; Butterworth et al. 1982; Horobin & Acredolo 1986; Sophian 1985), the delay between hiding and search (e.g. Diamond 1985; Gratch et al. 1974; Harris 1973), whether search involves reaching or just looking (e.g., Hofstadter & Reznick 1996) whether there are landmarks in the environment (Acredolo 1979), whether infants search for objects, food treats or people (Diamond 1997), whether the task is done at home or in the laboratory (Acredolo 1979), whether the infant or the hiding places have been moved (e.g., Bremner 1978a) , and the infants' amount of crawling experience (e.g. Bertenthal & Campos 1990; Horobin & Acredolo 1986). Such diverse context effects pose a serious challenge to Piaget's original interpretation. If the A-not-B error is a true measure of the status of infants' representations of objects, how can it be that what they know depends on so many seemingly irrelevant factors? How can it be that infants have a more mature object concept at home than in the laboratory or when the object is a cookie rather than a small toy?

2.1 Some explanations

The contemporary consensus is that Piaget's account is incorrect, but opinions differ on why the classic explanation is insufficient. One group of developmentalists argue that Piaget was asking the right question, but that he simply chose the wrong behavioral task to answer it. (Baillargeon & Graber 1988; Baillargeon & DeVos 1991, Bertenthal 1996; Diamond 1990b; Munakata et al. 1997). These theorists focus on the striking decalage between what infants seem to know about hidden objects when they manually search for them compared to when they just watch hiding events. Experiments using visual violation-of-expectancy measures have shown that many months before infants routinely make the A-not-B error when reaching for hidden objects, they seem to expect that objects will be retrieved from the place they were just hidden when no reaches are involved. They demonstrate this knowledge by looking longer at events where an object is plucked from a place other than the one where the baby watched it last disappear (e.g.,Ahmed & Ruffman 1998; Baillargeon & Graber 1988). Reaching is the problem, some argue, because it requires a "stronger" object representation than looking (Munakata et al. 1997), or because it involves additional means-ends performance demands (Baillargeon & Graber 1988), or because the "knowing" system is unable to control the "acting" system (Ahmed & Ruffman 1998; Bertenthal 1996). Infants, Diamond (1990b, p. 662) maintains, "really know where the [object] is even when they reach back to where they last found it."

One foundational assumption behind these dual-process (knowing vs. acting) accounts is that there lives, in the baby's head, a creature that is smarter than the body it inhabits; that there is a sharp partition between the mental events that precede the decision to act and the action itself. In this scenario, an object concept develops that is disembodied, timeless, and modular, and that may or may not actually motivate behavior. Moreover, there is the second assumption that because looking is motorically less complex than reaching, it will have a privileged access into the object knowledge module; it is a better measure of infants' "real" object concept. Bertenthal (1996) takes this further to suggest that knowing and acting are two dissociable systems, with different anatomical bases. One system represents objects, develops early, and is tapped by the visual expectancy studies. The A-not-B error comes from a second, "perception-action" system and has nothing to do with object representation. But if the A-not-B error is indeed not about the object concept at all, the intriguing question remains. What are the mechanisms that account, in the same baby, for accurate performance under some circumstances and perseverative reaching under other conditions?

One way to view the A-not-B error is simply as a reach to the wrong location. Indeed, a second group of theorists explain the A-not-B error as a manifestation of infants' immature abilities to direct their movements in space. Here the basic assumption is that young infants tend to represent space egocentrically, that is, based on their own bodies, rather than the objects' true positions in space, or an allocentric representation. For instance, infants who are trained at A and then moved around the table 180 degrees reach correctly to B at the B cue, which is still the A position from the baby's perspective (Bremner 1978a,b; Bremner & Bryant 1977). Indeed, studies show that conditions that provide clues in the environment that help disambiguate the two identical targets, A and B, tend to decrease perseverative reaching. These include making the hiding covers more distinctive, adding landmarks in the room, or testing infants in more familiar environments (see Acredolo 1985; Wellman et al. 1986). In addition, there is a strong association between infants experience in self-locomotion and correct responses (Acredolo 1985; Bell & Fox 1992; Bertenthal & Campos 1990; Kermoian & Campos 1988). Self-locomotion is believed to increase infants' visual attention to spatial locations and thereby, their ability to code them allocentrically. The spatial hypothesis itself is not sufficient, however, because it does not account for the delay effect (why would babies' spatial coding change from allocentric to egocentric in the three second delay?) nor for the looking-reaching decalage (allocentric when looking and egocentric when reaching?).

Another way to view the A-not-B task without invoking object permanence is as behavior requiring memory for a location and a motor response. In a series of influential papers, Diamond (1985, 1988, 1990a, 1990b) invoked these two processes to address two of the important context effects: the necessary delay between hiding and retrieval, and the ability of older infants to tolerate longer delays. Diamond proposed that two processes combine: the error results from infants' poor memories for the hiding place and their inability to inhibit strong motor responses. Having once reached to A, infants must inhibit this prepotent response in order to shift to the B place. The delay is important because in infants, both the memory for the hiding location and the ability to inhibit responses are short-lived. Thus, over the few second delay between hiding and retrieval both the memory of the hiding place and the ability to inhibit decline. At the B trial, this leads to a B reach with a short delay and a return to the old, prepotent A response with longer delays. With age, the persistence of these processes increases and infants do not err at short delays. The source of the developmental effect, according to Diamond, is maturational change in the dorsolateral prefrontal cortex, a brain area identified by lesion studies to be involved in both memory and response inhibition (Diamond & Goldman-Rakic, 1989). This account, while powerful, is incomplete. It does not offer a principled explanation of the spatial location or context effects described above, nor why locomotion would hasten the decline of perseveration. And it cannot account for the looking-reaching decalage.

Thus, each of these accounts captures some truths about infant perseverative reaching, but none has a full explanation of both the canonical error, and the richly-documented effects of context which are part and parcel of the same phenomenon. In our theory, we incorporate aspects of many of the explanations of our predecessors. First, we agree with some of our colleagues that the A-not-B error is not about an object concept per se. Smith et al (1999b) performed a critical experiment: infants were tested in the canonical task with one difference, there were no hidden objects at all. When simply cued to location A or B by waving the lids covering the wells, infants made the A-not-B error just as robustly as they did when objects were actually hidden and recovered. But we deeply disagree with the widely held assumptions that knowing and acting are modular and dissociable. Indeed the cornerstone of our dynamic model is that "knowing" is perceiving, moving, and remembering as they evolve over time, and that the error can be understood simply and completely in terms of these coupled processes.

Second, we further agree that the A-not-B error is about moving to a location in space. But it is also about remembering a cued location, and being unable to inhibit a previous response. What we will demonstrate with the model, however, is that there is no need to posit such individual and separate mechanisms such as egocentric or allocentric coding or memory or response inhibition deficits or incomplete object knowledge. Infants indeed sometimes act as though their responses are egocentric or as though they lack memory or inhibition. But they act that way because of the coupled interactions of the very same dynamic processes that make them appear to sometimes "know" where the object was hidden. Because all of the processes contributing to the behavior are coupled, continuous, and based in time, we can account in one model for both the error itself and for the decline in perserverative responding in different situations and at different ages.

3. A dynamic systems approach

The starting point of the dynamic model is with new assumptions. The A-not-B error is not about what infants have and don't have as enduring concepts, traits, or deficits, but what they are doing and have done. What they do is reach repeatedly to one location and then return to the original location when the goal has changed. From a dynamic perspective, this perseveration is emergent from the real-time dynamics of visually elicited reaching, the memory dynamics of repeating the same action several times in succession, and the intrinsic dynamics of these processes in infants. According to our view, the error arises from the same multiple processes that produce goal-directed reaching at any age and we indeed create the error using a model originally formulated to simulate the general process of motor planning for reaching. The age and context effects arise naturally from the parameters of the model, which in turn, we derive from a fine analysis of the actual task.

3.1 A task analysis

Thus, to begin, we describe the error task in the most simple behavioral terms. Then, we identify the factors in the task itself or in the baby that are known to affect the behavioral outcome, that is, the tendency to reach to A when cued at B. We report here the details of the canonical task used in Smith et al (1999b). There were two versions of this task, which differed only in whether a toy was actually hidden or whether there was no hidden object and the infant was cued simply with the cover to the hiding well (lids-only). If this account seems burdened with details, it is because these details matter profoundly, as we will further document below.

For both the hidden toy and lids-only versions, the infant sat on a parents' lap at a small table facing the experimenter, surrounded by neutral and unmarked walls or screens. Here are the events that transpired in a typical A-side hiding trial. The infants first saw the hiding box, which was 30 cm by 23 cm by 5.5 cm. It was painted brown and contained two circular hiding wells, each with a radius of 4.5 cm and depth of 4.5 cm. The centers of the wells were 12.5 cm apart. The wells were each covered by a circular wooden lid with a small round handle in the middle, all painted the same brown color as the box. The box and the covers were in view throughout the whole procedure and thus constituted a continuous visual task input, as depicted in Figure 1. The notable characteristic of the task input was its lack of visual specificity. The two lids were indistinguishable from each other and blended into the background of the box. No familiar or distinct environmental landmarks were evident. Once the two lids were in place, there was little else to demarcate them except their relative spatial position.

Figure 1: A task analysis of the canonical A-not-b error, depicting a typical A-side hiding event. The box and hiding wells constitute the continually present visual task input. The specific input comes from the transient cue of hiding the toy in the A well. A delay is imposed between hiding and allowing the infant to reach. During these events, the infant looks at the events, remembers the cued location and undertakes a mental planning process leading to activation of reach parameters, followed by reaching itself. Finally, the infant remembers the parameters of the current reach.

The trial began with the box well out of the infants' reach. The experimenter first called attention to the toy by waving it or tapping it on the box, always in the vicinity of the A side, and often calling the child's name. (All experiments were counterbalanced: half the infants had the A side on their right and half had the A side on their left.) When the infant was clearly looking at the toy, the experimenter hid the toy under the lid. This visual (and auditory) cue we call the specific input, a transitory indication of which lid is specified as the goal (Figure 1). (In the lids-only condition, no toy was hidden and only the lid to the well was waved and tapped.) We then imposed a short delay of 3 seconds for 8-month-olds and 5 seconds for ten-month-olds. During the delay, the infants most often looked at the cued location. Infrequently, they also glanced at the experimenter. During this delay, infants needed to remember the cued location in the face of the ambiguous task input. After the delay, the experimenter pushed the box into the infants' reach.

After infants saw the specific location cue, they needed to decide whether to reach to the A or to the B side, and plan the appropriate movement parameters to actually activate the muscles of their arms to go to the right or to the left. This planning was done largely in the absence of the well-specified goal. We have conceived this planning in Figure 1 as mental activation functions that precede the reach and are initially equally likely at A or B, but gradually shift to the A side. (We shall further justify this depiction below.) The reach itself was initiated once activation reached a certain threshold, sufficient to activate the muscles to move the arm in the specified direction. The infants continued to look at the A target throughout the planning and reaching.

Finally, the infants grasped the target toy or lid and shifted their glances back to the experimenter, who also withdrew the test box. A critical assumption on our part at this juncture is that the just executed reaching act was remembered for some unknown time after the reaching act was completed. Again, we will elaborate further on this memory process in a later section.

The A trials were repeated several times before infants were asked to switch to B for two trials. The events for the B trials were identical to those at the A side except that the experimenter hid the toy in or cued the previously unspecified location. Under these canonical conditions in both hiding and lids-only conditions, 70-80% of infants reached back to A on the first B trial, and an equal proportion continued to perseverate on the second B trial. This rate is significantly different from chance responding, that is, from an equal probability to go for either target after the B cue.

3.2. Events and processes in the A-not-B task

A foundational assumption of our theory and model is that both the error and correct reaching emerge from the coupled dynamics of looking, planning, reaching, and remembering within the particular context of the task: repeating a novel and confusing reaching action. Having described these events and processes in a canonical A-not-B trial, we now further analyze their contributions to the infants' behavior. It is important to note here that although we address these contributions one at a time, the point of our dynamic analysis is that they are continuously coupled and interactive. Everything counts! These detailed context and infant effects are the data upon which the model is based. We need not invoke any new constructs or traits.

3.2.1. Task input. As we mentioned earlier, the notable characteristic of the test scene was that the hiding box and wells were poorly specified as targets. Once the toy was hidden or the lid placed on the box, infants could easily confuse the targets, requiring them to decide on a reaching direction in the absence of well-marked locational cues. Indeed, there is ample evidence in the literature that perseverative responding depends on target ambiguity and that therefore, errors are reduced by manipulations that make the targets more distinct from one another (see Wellman et al. 1986). For instance, the error is decreased when the two hiding locations are visually very different, for example, when one cover is blue and the other cover is white (Butterworth et al. 1982). There is less clear evidence on the importance of the spatial separation of the targets, but for the most part, only relatively small differences in this distance have been manipulated (Wellman, et al. 1986; Sophian 1985). However, in the study with the largest target separation, only 6 of 56 9-month old infants made the A-not-B error, far less than would be expected in the canonical situation (Horobin & Acredolo 1986). Using multiple, rather than just two, hiding locations both reduces error and increases correct responding, meaning that infants return to the just cued location more than at just chance levels (Horobin & Acredolo 1986; Wellman, et al. 1986). The effect of multiple targets as part of the task input may be to add spatial landmarks to the scene, giving the infant other relational cues for reach direction once the specific input is completed. Indeed, the addition of the familiar landmarks of infants' homes dramatically reduced egocentric responding (Acredolo 1979).

In addition to the A-not-B task being visually confusing for babies, it is also entirely novel. Infants have been grabbing objects for four or five months before they typically do the A-not-B test, but during this time, they have been reaching for well-specified, single, often colorful and highly distinct toys. Before this, no one has asked them to repeatedly choose between identical, closely spaced, boring items such as the covers to the hiding locations.

Indeed, the task is so confusing and novel for infants that they have to be trained to do it. All A-not-B studies involve some, often unspecified, number of training trials. For example, in the canonical hidden object task used by Smith et al. (1999b) the training consisted of four trials in which infants were gradually led from a familiar task to the novel A-not-B test. To do this, the experimenter first placed the toy alone at the edge of the box at the A location and slid the box forward toward the infant, while verbally encouraging the infant to reach for the toy. This task was familiar and infants reached reliably. On the second training trial (A2), the toy was placed inside the well but left uncovered and on the third trial (A3), the toy was partially hidden in the well. In the last training trial (A4) the toy was completely hidden. In the lids-only version, the A-side lid was progressively moved back over the four trials from the front edge of the box to a position in line with the B side lid. The training was followed by two test trials at A with the toy completely hidden or with the lids lined up equidistant from the edge of the box and then the two B trials where the cue was switched, as described above.

Because the task input is novel and confusing, Smith et al. (1999b) predicted that without the training events to help disambiguate the A and B locations, infants would not be strongly disposed to reach to A even on the first A trial. They tested this by eliminating the four pre-training trials and commencing with only the two A test trials. With no training, only 35% of the infants reached correctly to A on the first A trial, compared to over 75% of infants who had been conventionally trained. Moreover, without training, a sizeable proportion of the infants did not reach at all, but just stared at the hiding wells, as though they could not figure out what to do, and they often refused to reach on the remaining trials as well.

In previous studies of A-not-B, training was justified as necessary to teach babies to reach for a hidden or unfamiliar object, and it does accomplish this. But this training, when done at the A location, also accustoms infants to repeatedly reach to A. In an effort to demonstrate these two effects, Smith et al (1999b) trained infants for the typical A not B task, not at the two-welled test box, but at a different box with only a single, center well. They reasoned that, given the confusion of the targets and the fewer reaches to A, training at this single "C" location should render infants less likely to consistently reach to A on both A and B trials. Indeed, infants trained at the single well were more likely to reach to A on the A trials than babies with no pretraining. Simple practice with the hiding and recovery events in one location made the two-well condition less novel. But the "train at C" infants were also less likely to reliably reach to A on both the A and B trials than those babies given the standard training regime. Stable behavior at A and perseveration back to A depended on previously reaching to the A side. (We will discuss the effects of repetition further below.) Thus, the training trials, thought to be merely "warm-up" by previous investigators, are also critical contributors to the dynamics that produce the A-not-B error, both in helping disambiguate the confusing targets--affecting A-side behavior-- and building up a tendency to stay at A when cued at B--affecting the appearance of the error.

Thus, target ambiguity alone is not the single cause of perseveration. But coupled with the cue, the delay, and the particular dynamics of infant reaching and remembering: The relative ambiguity of the task input is a critical parameter in the model.

3.2.2 Specific input. Given the largely tonic and confusing nature of the task input, infants receive transient clues to where to reach by the actions of the experimenter who waves and/or taps the toy or target lid. This activity invariably directed infants' visual attention to the target location, and they continued to look at that location throughout the hiding and reaching event (Smith et al. 1999b). The finding that waving the lid to the hiding well was as effective in producing perseverative reaching as was actually hiding a toy demonstrates that it is not hiding per se that is the critical stimulus. However, this does not mean that the nature of the specific input is unimportant. In our dynamic view, it is one of the critical events.

Recall that the specific input serves to demarcate the target location and then it disappears; the infant must remember it when it is no longer evident. Given that the specific input serves both to capture visual attention and to provide a spatial target in memory, it follows that some specific inputs may be more effective in either or both of these purposes than others. Some objects and events may be more interesting for babies and some may be more memorable than others. Specific inputs with more "punch," therefore should increase correct responding, both on A and on B trials.

We found some support for this assumption in the literature. For instance, inspired by Smith et al.'s (1999b) demonstration of perseverative responding with no hidden objects, Munakata (1997) tried the following variant: She presented infants with only lids on the A trials, but on the B trials, either hid a toy at B or cued just the lid. In the lids-only condition, infants reached perseveratively at A, replicating Smith et al.(1999b) The error was decreased, however, in the condition where the toy was shown for the first time at B. Munakata (1997) interpreted this as infants representing lids-hiding-toys differently from lids with no toys. Our interpretation of these interesting results is simpler: the novel and visually interesting toy captured visual attention more strongly than the lid alone and pulled infants away from their habitual response. This interpretation is further supported by Munakata's (1997) second experiment where first toys were hidden on A and then infants were cued with either toys or just lids on B. In this manipulation, infants made the error in both conditions. Of course, because both the lid and the toy were used on the A trials, neither the lid alone nor lid-plus-toy provided a sufficiently strong specific input to counteract the repeated reaches to A.

Our assumption that the strength of the specific cue makes a difference is further confirmed by experiments reported recently by Diamond (1997). Diamond coded infants' levels of interest in the hidden toys as they were hidden and then uncovered. Infants made significantly fewer A-not-B errors when the toy was different from that used on previous trials and the when infants' interest in that toy was high. More remarkably, when Diamond substituted pieces of cookie for hidden toys, all infants reached correctly, even at delays in which they had previously erred when toys were hidden.

Furthermore, when the "punch" of the specific input is enhanced by an event that increases infants' visual attention in one direction, it should be more potent in influencing reaching direction. Likewise, when the specific cue is diminished by competing claims on infants' visual attention, the power of that cue for directing the subsequent reach should be weakened. This is exactly what Smith et al. (1999b) found. In one experiment, these investigators manipulated the direction of infants' gaze after the A trials by simply tapping on a little blue rod placed either on the far right or the far left side of the testing arena. After 4 training trials, half the infants' had their visual attention directed to the just trained A side and half to the upcoming B side. Reliably more infants in the A-side-tap condition reached to A on the A trials than those in the B-side-tap condition, and although there were no further taps, these infants also reached more to A on the B trials. In a second experiment, the researchers provided the pulls to visual attention before the B trials. In this case, infants given the additional cue to A all stayed at A, committing the error. Conversely, and in line with predictions, infants seeing the tap in the direction of B reached correctly on the B trials,

The critical feature, then, of the contribution of the specific cue is not whether it is a toy or a lid or a cookie or just an event to look at, but the power of the stimulus for capturing infants' attention and/or for remaining in memory when it is no longer in view. When the specific input is strong it will have a powerful influence; when it is weak, it may be swamped by the other system dynamics.

Thus: The relative strength of the specific input is a second critical element in the model.

3.2.3. Delay. Infants do not perseverate at any age if they are allowed to reach immediately after the object is hidden. The error emerges in the delay between the cessation of the specific input and when infants are permitted to act by sliding the box forward (Figure 1). In addition, as they get older, infants require increasingly longer gaps before they perseverate. Eight- month-olds make the error with a 3 second delay, but 10-month-olds require 5 seconds to perseverate. The delay effect is robust (e.g. Diamond 1985; Harris 1973; Wellman, et al. 1986) and is especially hard to reconcile with a strictly Piagetian interpretation of infants' stage-specific object knowledge since there is no reason why infants would know less after a longer delay, when they presumably have more time to process the situation.

What sorts of mechanisms can account for this apparent switch from right-to-wrong in 3 seconds? One class of explanations invokes a shift of level of processing within the delay period. For instance, Gratch et al. (1974) proposed that at no delay, infants' actions are guided by a motor memory of the most recent event, but at the longer delay, the concept of the hidden object kicks in, so infants return to the habitual hiding location. Harris (1983, 1987) agreed that the no-delay performance was dominated by a simple motor response, but also, at longer delays, infants had problems with object identity in that they believed the object hidden at B was not the same one that was previously hidden at A. In contrast, Wellman, et al. (1986) saw the delay as revealing a conflict between two mechanisms for search. At short delays, infants relied on more immature, direct-search strategies: go where the object last disappeared. Longer delays, however, allowed activation of the conflicting, albeit more mature, strategy of an inferred search based on the movement of the object to its hiding position, leading to sometimes correct and sometimes errorful actions. Competing short and longer-term memories are also at the heart of the recent connectionist model of Munakata (1998).

As mentioned earlier, these accounts have several troubling aspects. Strategy-based theories are hard to reconcile with the age and delay interactions. If retrieving a correct strategy takes time, as infants get older and as they are given more time to retrieve a correct strategy (longer delay), they should be increasingly correct, but the reverse is true. Even more problematic, in our view, is the sharp distinction between knowledge (more mature, more conceptual) and action (more immature, less planful), especially in the face of compelling evidence that this "knowledge" comes and goes with each variant of the task.

Nonetheless, the heart of the explanation of the A-not-B task, we believe, lies in what happens during this delay when infants are faced with the ambiguous task input and yet must decide whether to go to A or to B based on a cue that is no longer present. Our model is similar to Diamond's well-known theory or to the recent connectionist model suggested by Munakata (1998), in postulating interacting dynamic processes that lead to one behavioral outcome at short delays and the probability of another outcome as time passes. But, specifically and uniquely, we place the locus of these dynamics in the motor planning process, envisioned as a continuous dynamic field that evolves under the influence of several input parameters, and whose behavior can be sampled during the delay.

In sum: What happens during the delay is also critical as it is a window on the natural dynamics of the contributing processes.

3.2.4. Reaching. Our focus on the A-not-B error as centered in the motor plan for reaching is well-supported by the developmental evidence. As we mentioned earlier, one of the most serious challenges for Piaget's original explanation was the discovery of the apparent decalage between infants' understanding of objects when reaching for them compared to when they were questioned by looking measures alone. Thus, even several months before they demonstrate reliable reaching errors, infants are not apparently confused about the last location of a hidden object when just looking. Likewise, 8-12 month-old infants tolerate a longer delay before erring in a visual violation-of-expectancy search task than when manually searching (Ahmed & Ruffman 1998). Moreover, when infants only watch the hiding events at A and B, they more often look correctly at B than they reach to B in the conventional search task (Hofstadter & Reznick 1996.) Thus, at the very same age, infants show that they "know" more about hidden objects in the visual modality than with manual action.

The apparent confusion of what infants "really know" in the wake of such conflicting stories is only an issue when the foundational assumption is that infants "really know" something in the absence of the processes that demonstrate, in the moment, what they do in light of what they have just done. As argued in Smith et al. (1999b), the A-not-B error is about the behavior of reaching in the context of a particular perceptual scene, specific task dynamics, and dynamics of reaching and remembering intrinsic to infants at particular stages of development. Perseveration or the lack thereof in looking tasks has its own contributing dynamics, which may or may not produce the same behavioral outcome (and clearly does not). Neither reaching nor looking is a better measure of the infant mind; both are very revealing windows into the complex coupled dynamics that produce goal-directed behavior. But whether infants do better at looking than reaching is somewhat of a side issue for our present goal which is to elucidate the dynamics of the reaching A-not-B error. Once these dynamics are understood, a similar analysis can be applied to looking tasks and comparisons made.

At the heart of our model, therefore, is the act of reaching, which requires that infants see and remember a target location as a goal, that they plan the appropriate movement parameters for a trajectory in space and time that will transport their hands to the desired object, and that they activate muscles that will carry out their intended movement. These processes are perceptual and they are motor and they are cognitive. Indeed, the model demonstrates very clearly the impossibility of making clear distinctions between these processes as they evolve in coupled and parallel fashion over the time of each trial and the time of the whole experiment.

In particular, we focus on the evolution of the reaching plan between the time the target is cued and infants actually move their arms forward to pick up the toy or lid. This plan, we contend, is the locus of both the error and correct responding because it integrates the perceptual input of both the tonic task conditions, the cuing of the location with toy or lid, the infants' visual attention to that cue, and the memory of previous actions in the same situation. The combined dynamics of these processes constitute the infants' decisions whether to move toward the A side or toward the B side. All of these are expressed in movement parameters because the ultimate behavior is an action requiring a correct movement. The fact that the visual, memory, and motor processes are coupled and continuous means that changes in any of their parameters can (and often does) affect whether infants reach correctly or perseveratively. Again, we will justify this feature of the model in more detail in the next section.

Thus, the error emerges in the context of the specific behavior of reaching.

3.2.5 Remembering. In an earlier section, we presented evidence that infants reliably produced the A-not-B error only after a number of training trials which both served to disambiguate the targets and to establish a repeated pattern of reaching to the A side. Indeed, in every version of the task reported, experimenters elicit the error only after several reaches to the A location. Nonetheless, in their comprehensive meta-analysis, Wellman et al. (1986) found no overall effect of the number of A reaches on the commission of the error. Does repetition matter? This is a critical question for understanding the task dynamics, and we believe the answer is unequivocally -yes, they matter profoundly.

In a number of recent studies, Smith, Thelen, and their associates have shown conclusively that commission of location errors with the B cue is strictly a function of the number of prior reaches to A. The strongest influence on where infants will reach on any trial is where they have just reached. This is true whether the target objects are hidden toys or just the lids.

As we described earlier, perseverative errors decreased when infants were not trained at all or trained at a neutral location. Both of these manipulations simply reduced the number of times infants have reached to A (Smith et al. 1999b). Furthermore, infants tended to perseverate even when no specific cue was offered. In a recent experiment, Smith et al. (1999b) gave groups of infants 6 trials in the lids-only task. One group was cued on one side before they reached at all and then allowed to reach spontaneously. These babies went to the cued side on their first reach and tended to stay there. The other groups were allowed to reach spontaneously on the first trial and then they were cued once, either after 1, 3 or 5 reaches. The cue was always opposite of infants' preferred side; a "B" trial to infants' personal "A" choices. The infants who were perturbed by a B trial after reaching five times to their A choices stayed at the A side. Without any hidden objects or even location cues, infants built a habit to go to A strong enough to counteract the new pull on visual attention. Infants perturbed after one or three reaches, in contrast, were more likely to switch to B when cued, and to switch sides more spontaneously thereafter.

In addition, in several studies where other parameters of the A-not-B task were manipulated, including aspects of the specific and task input as well as the dynamics of the reaching arm, the Smith/Thelen group always found a strong effect of the number of trials to A on whether infants stayed at A or switched. This effect could be detected because, as noted by previous researchers, infants' tendencies to go to A or B are never absolute, but probabilistic. That is, on any given cue to A, even the very first one, there is a chance that, spontaneously, infants will go to B. Even after several trials to A, when the probability to stick with A is high, a few infants will go to B.

Diedrich et al. (1999) captured this effect of spontaneous switches by a new measure of the growing effect of previous reaches--the relative memory strength to A. The index is predicated on the simple assumption that each reach in one direction creates a memory trace that increases the likelihood of subsequent reaches in that direction. Thus, each time an infant reaches to A or to B, the memory of that target increases. By subtracting the memory of B from the memory of A, we can express the probability the infant will return to A. The maximum memory strength for any direction is 1 (8/8) meaning that the infant will have reached to that side in, say, in all 8 of 8 trials. Likewise, the minimum memory strength is 0/8, when the baby never reached to that direction. Figure 2 shows the evolving memory strength to A in a canonical two-target, no-hidden object task. The infants in this study were divided into those that perseverated on both B trials and those that got at least one B trial correct. Note that, from the earliest trials, the infants who were more likely correct at B were those who, spontaneously, had reached previously to B even when cued to A, while those who strongly perseverated had stuck with A throughout.

Figure 2. The effects of repeated reaching on the A-not-B error. The measure " Relative memory strength to A" (explained in text) for a canonical no-hidden object task in 9 month-old infants. Shown for infants who made the error on both B trials (n = 9), on only the first B trial (n = 3) and who were correct on both B trials (n = 5). Correct B trial responding was preceded by spontaneous reaches to B even when A was cued.

How are these results reconciled with the conclusions of Wellman et al. (1986) that "number of A trials was consistently unrelated to performance" (p. 31). As mentioned previously, in all A-not-B studies, experimenters train infants to do the hidden object task, which may involve some number of reaches to the A side. These training trials were thought to be prelude to the task, and were not included in the actual count of repetitions to A. Infants thus may have actually reached to the A location three or four times before the first official A trial is counted. Furthermore, in some versions of the task, infants are cued to the A side until they reach a criterion of two or three successive "correct" reaches to A and then switched to B (e.g. Diamond 1985; Bell & Fox 1992). Correct in these tasks means not just touching the cover, but lifting the cover and touching the hidden toy. In this version of the procedure, therefore, the actual number of reaches to A before the switch is not reported and is unknown. An infant may make several "incorrect" reaches to the A side, or several "correct" reaches but not in succession. Without counting training and/or actual movements in the A direction, the full range of parametric effects for repetition was therefore not obtained, and indeed cannot be ascertained from the information provided in the published reports (Diedrich et al. 1999 ) When these parameters are fully explored as in Smith et al (1999b) the effect of repeating movements is overwhelmingly strong. Indeed, a recently published new meta-analysis of A-not-B reaches the identical conclusion: that perseveration is a function of the number of A-side reaches (Markovitch & Zelazo, 1999).

That the direction of reaching depends critically on where infants just previously reached means that the A-not-B task is a memory task on two time scales. First, it requires that infants remember the location of the specified target in its absence, as discussed above. But second, there must be memory dynamics between the time when the reach is executed in one trial and the sight of the target and decision to reach on the succeeding trial. We suggest that this time between retrieval trials was also of critical importance because infants remember the actions they just performed during the time between recovering the toy and the next specific cue. Thus, when the second cycle of cuing, deciding and reaching commences, it is initiated within the time span of the memory of the previous cycle. The third cycle builds upon the memory of the first two and so on, such that when the baby is finally cued to B the memory traces from the first six reaches to the A side may be very strong. This memory is a motor memory whose content can be examined, at least in part. Critical to the model, therefore, is that the memory of one action is in the same space of movement parameters as the plan for the subsequent actions and can thus, influence the evolution of the next movement. Again, we substantiate these claims in a later section.

Infants make perseverative location errors because the motor memory of one reach persists and influences subsequent reaches.

3.2.6. Development. In the canonical form of the task, only infants between 7 and 12 months of age consistently make the error. Before this age span, infants do not search at all for hidden objects, and after a year, infants search successfully where they saw the object last disappear, even after it has been displaced several times. Thus, in traditional interpretations, perseveration is seen as a distinct stage in cognitive development: that of incomplete object representation.

It is incorrect, however, to assume that perseverative reaching responses are unique to a particular stage in infancy. Try moving an article in your kitchen from a long-established location and start cooking dinner! Indeed, adults with no perceptual or motor impairments can be trained into perseverative or biased responding within an experimental session in the laboratory (e.g., Ghilardi et al. 1995). Especially relevant is a recent study where Smith et al (1999a) elicited strong perseverative reaching responses in 2 year-olds, at an age at which the Piagetian object concept should be strongly established. These authors asked toddlers to retrieve a toy hidden in a long narrow sandbox , a task that differed from the classic A-not-B exercise primarily in the less well-delineated targets. (Once the toy was hidden in the sand, no covers to the hiding places helped mark the possible locations.) After several recoveries from the A side, toddlers continued to search in the vicinity of the A location even after the toy was hidden on the other side.

We contend here that the processes that create perseverative responding in infancy are not special, but are the very same processes that lead to correct responding at this age, and also to correct and perseverative responding in individuals at any age. Nonetheless, it is incumbent upon us to explain which aspects of the processes can account for the developmental effects: why location errors occur at particular ages and under certain conditions and not at others.

We believe that all of the above-mentioned contexts and parameters contribute in coupled and perhaps nonlinear ways, and that they all may have age- and experience-specific dynamics. In short, the age differences reside in the particulars of the environment and timing demands of the tasks. Consider, for instance, the problem infants face in distinguishing the two identical lids. There are good developmental reasons for infants to have difficulties at say, nine months, and to improve their abilities in this regard by 12 months, but still to be confused at 24 months when the task becomes more ambiguous. In the last part of the first year, infants have limited experience with spatial localization, especially before they have moved themselves around. Understandably, self-locomotion may focus infants' attention on where things are in the environment because these objects and places become relevant in ways they were not when babies are still transported by others (Horobin & Acredolo 1986). Increasing visual attention alone may change the impact of the various manipulations of the task. In addition, infants may learn to pay attention to the relationships of objects in the environment to each other and thus form a better conception of right side versus left side when faced with an otherwise ambiguous visual scene. These experiential effects are co-evolving with infants' increasing skill at reaching. As infants move about by walking, cruising, and crawling, they reach for different items, at different levels and locations, and from a variety of postures. Thus, their action capabilities expand and become more flexible. At the same time, we may suppose, these interrelated perception-action experiences also impact memory processes, as babies now have reasons to remember where things once were when they are presently out of sight. However, and what is critical here, is that these changes are gradual and not all-or-none: given good reaching skills and adept use of landmarks, toddlers and even adults may still become confused when the target locations are transient or not well-marked or when the delay between target cue and go signal for reaching is extended.

Given these multiply-determined, dynamic, and cascading effects, the power of the model is to offer entry points for actually probing how the process parameters may change over developmental time. As a starting assumption, we focus on one of the likely sites for developmental change, the mental processes involved in the motor planning and decision to reach to either the A or B side. We will spend considerable time in the next sections justifying why it is appropriate to express the evolution of the A-not-B error dynamics in terms of a motor field, and therefore, how developmental events may impact upon the behavior of that field under certain task conditions.

Thus, and finally, age-related changes in the likelihood of perseverative reaching may result from parameter changes in multiple contributing processes; one candidate is a change in the properties of the integrative motor planning process.

4. A dynamic field model: Overview and rationale

Previous accounts have assigned the cause of the A-not-B error to infants' deficits in object knowledge, spatial localization, memory, or inhibition. In contrast, we center our attention on reaching, and in particular, the processes that lead to a directional reach to A or B. The challenge is to explain, therefore, in terms of the normal processes involved in reaching, behavioral phenomena that look like infants really do have problems with object permanence, and that they cannot escape their body-centered understanding of space, and that there is something lacking in their memories or inhibitory mechanisms. Thus, our formulation is justified as far as we can show that the well-documented context- and age- effects described above can be manifest in the domain of the action of reaching.

The model describes the mental events that constitute the decision to reach to A or B as activations in a dynamic field expressed as directional movement parameters (Erlhagen & Schöner 1999; Schöner et al. 1997). The field, which has nonlinear properties, evolves continuously under the influence of input dynamics from three sources, also expressed in compatible directional parameters. These include the specifications of the task environment, which establishes the decision field, the specific cue to reach to A or B, which is transient and must be remembered, and, after the first reach, a memory dynamics which biases the field for the subsequent reach. Both the properties of the field itself and the input dynamics can be assigned parameters that are derived from data, and we simulate, using such parameters, the robust effects documented in the literature. We also use the model to generate novel predictions.

Note here three critical aspects of the model. First, although we maintain that the model is biologically plausible, and discuss this further below, it is an entirely abstract model, and not a neuroanatomically specific account of looking, reaching, and remembering. In its present form the model also does not incoporate specific bodily parameters such as muscle anatomy, segment masses and centers of gravity, or joint configurations. Rather,the model captures the abstract, collective dynamics of multiple processes, which are likely happening in parallel in many integrated sites in the brain and the body. Second, and related, the concrete mathematical functional form of the model is not unique (see for instance, Grossberg, 1980; Wilson & Cowan, 1973). What is important here are the assumptions of blended, continuous performance and how closely these assumptions both match the previous experimental results and generate good, testable new experiments. That this precise formulations works validates our assumptions but does not preclude other theoretical instantiations using similar assumptions. Thirdly, we put forth this model as only the first step towards a fully embodied account by casting the mental events that precede the movement into the same dynamic language appropriate for movement itself: continuous and time-based. The important step of integrating the actual movement dynamics with the motor plan dynamics remains to be done. It is a difficult challenge, given the complex nature of the coordination and control of limb movements (see Erlhagen & Schöner 1999; Kopecz & Schöner 1995, also Jordan 1990; Bullock & Grossberg 1998; Houk et al, 1995 for neural network accounts of the motor control problems.).

4. 1 Integration in a motor planning field

Reaching in infants, like reaching in adults, begins when individuals see objects they want. We know from a vast literature that 150-300 milliseconds elapse between the visual fixation of the target or some other "go" signal and the actual movement of the arm and hand toward it. During this time, conventionally called the reaction time, persons are processing the visual stimuli, effecting some sort of transformation from visual space to body-space and establishing the specific movement parameters for the execution of a movement that will attain the goal. After this "motor program" is set, the traditional explanation goes, then the reach is triggered and the actual movement occurs. In terms of the cartoon of Figure 1, the series of events can be imagined as the diagonal row of icons from left to right: look, plan, reach.

The first part of the process involving attention, vision, and planning are the mental or cognitive events usually studied by psychologists. The prevailing theoretical frame of reference has been that of information processing: the vocabulary revolves around notions of programs, codes, and representations, and processes such as feature extraction, response choice, and serial stages. These mental constructs are accessed through manipulating the attentional demands of the task, the nature of the stimuli or the memory load and by looking at reaction times and error rates. Investigators cared about mental events; the movement itself was of little or no concern. In contrast, movement scientists have been more focused on the actual control and execution of the reach itself. In this case, they focused on kinematic variables such as trajectories and velocities, and the biomechanical and neuromuscular contributions to the transport of a physical entity-- a real limb-- in space and time. They might vary the load on the limb, speed and accuracy requirements, work space demands, postures, and so on, and measure actual movements, forces, and muscle patterns. People plan in order to move; yet, in the traditional formulations, there was no common currency between the activities of the mind and those of the limb (for discussions see Allport 1987; Poulton 1981; Keele 1981; but also Georgopolous 1986; Hommel 1996; Prinz 1997; Rosenbaum et al. 1995; Schöner 1995).

Despite this historical dualism, it is increasingly recognized that the processes of action planning and execution cannot, in reality, be so neatly parceled and expressed in such incommensurate dimensions. Empirical advances over the last decade or so have gone far in breaking down the strict serial processing assumptions of "first preparation, then execution," in favor of much more parallel, mutually influential mechanisms. There is growing evidence, for instance, that the visual input is not just important at the start of the process, but is continuously and intimately influential throughout the planning and execution of the reach trajectory. And that adults are proficient at producing on-line corrections to movements, indicating that the programming process is an ongoing dynamic, not rigidly fixed by the initial target specifications (see, for instance, Goodale et al. 1986; Prablanc & Martin, 1992). Relatedly, perception itself is not isolated from action: the very act of perceiving is enmeshed with actions that accompany it (Muessler & Hommel 1997). Indeed, we believe that it is just this amalgam of processes that gives rise to infant perseveration.

Because we take the unusual stance that the A-not-B error arises from a motor planning process that is part of a dynamic perception-action loop, we take some time at this point to substantiate this fundamental assumption. Specifically, we give evidence for 1) Actions are planned in movement parameter space; 2) The plans are continuous and graded in nature; 3) Plans evolve under continuous perceptual influence of both task and cue; and 4) The system has history.

4.1.1. Actions are planned in movement parameters. The most compelling and direct evidence that what has been traditionally called "cognition" can be effected in movement parameters comes from the pioneering work of Apostolos Georgopoulos and his colleagues (for reviews, see Georgopoulos, 1986, 1990, 1991, 1995). In their now-classic experiments, these investigators trained monkeys to reach for different targets in space while they recorded simultaneously from many neurons in the motor cortex. They discovered populations of neurons that together code for the direction of movement. This code is a product of an ensemble of neurons, each of which is only broadly directionally tuned. Together, however, they provide a unique population vector that points in the direction of the movement to the target. There are several aspects of these findings that are especially relevant for our model. First, the code is body-centered: the population vector points in the direction of the target despite the monkey starting from various positions. Visual and body information are in the same parameters. Second, the vector emerges gradually and continuously in the planning period between cue and reach. As illustrated in Figure 3 (top panel), in the approximately 150 msec between the presentation of the target and the actual start of the movement (indicated by the velocity vectors of the movement) neuronal activity gradually builds and predicts the upcoming movement. Third, the vectors mirror the direction of the velocity of path of the hand, suggesting that the population vector carries information about the instantaneous velocity of the hand. Again, this is strong evidence of the compatibility and continuity of the planning/action code. Fourth, the population vector predicts the direction of reaching during a delay. In a good analogue of the infant A-not-B task, researchers presented the target briefly, then instructed the monkey not to move until a cue was given 450-750 ms later. Figure 3 (middle panel) shows the vectors in the direction of the cued signal held in memory during the delay, again as a continuous and graded signal. Indeed, when the light was not turned off during the delay, requiring no memory, the vector length is somewhat decreased (Smyrnis et al. 1992). Finally, specific task-related cognitive manipulations learned by the monkeys can be detected in the evolution of the directional vectors. In one such study, Georgopolous and colleagues trained a monkey to sometimes move a handle 90 degrees and counterclockwise from a reference direction and sometimes move directly to it. During the reaction time--before actual movement--this directional rotation could be detected in the population vectors (Figure 3 bottom panel), indicating that the animal was performing a mental rotation from the accustomed to the novel motor direction. Again, this rotation was gradual and continuous and involved activation of cells tuned not just to the stimulus and rotated direction, but those in the directions passed through during the rotation. This, according to Georgopoulos, "provided for the first time a direct visualization of a dynamic cognitive process" (1992, p. 514). Similarly, in a recent paper, Kettner et al. (1996) showed that in monkeys trained to move to two locations in a sequence, activation of both sets of directionally specific neurons--to the first and then the succeeding movement--could be detected in motor and premotor cortex, again before the movement actually began.

Figure 3: Top panel: cortical population vectors point in the direction of the movement before the movement begins. Middle panel: population vectors in the direction of the cued signal are held in memory during the delay. Bottom panel: Rotational vector detected during the reaction time when animal was performing a mental rotation. (A) Rotation task showing direction of stimulus (S) and movement (M). (B) Neuronal population vectors calculated from the onset of S until the onset of M. (From Georgopoulos, 1995).

The A-not-B error is an error of reach direction. Directional vector coding has been described in several areas of the brain, including primary motor cortex (Georgopoulos et al. 1982), dorsal premotor cortex (Caminiti et al. 1991), areas 2 and 5 of the parietal cortex (Kalaska et al. 1983), and in the cerebellum (Fortier et al. 1989). Moreover, in several of these areas, the neural response dynamics are congruent with the visual-motor integration we propose in the model, as we discuss further in a later section (Pelligrino & Wise 1993). Thus, it is tempting to conceptualize activation in the model to be direction-specific. However, there is also considerable debate among motor physiologists over which movement parameters are actually encoded in the brain. For instance, researchers have discovered neurons in motor, premotor, and parietal cortex that are specific to a body-centered code, and change with the postures of the arm, before, during, and after the movement (e.g. Caminiti et al. 1991; Lacquaniti et al, 1995; Scott & Kalaska 1997; Scott et al. 1997). It is possible and likely that the CNS uses multiple frames of reference. The precise coordinates of the movement parameters are not critical for our model, however, as the activation in parameter space is purely abstract, and could therefore be topographic in respect to body segments as easily as metric in respect to extrinsic coordinates.

4.1.2. The plans are continuous and graded in nature. The analogue character of motor planning is clearly manifest in the direct recordings from monkey cortex. But there is similarly compelling evidence of the continuous and graded nature of this process from behavioral studies of adult humans done by Claude Ghez and his collaborators (Hening et al. 1988a, 1988b). Their task, like the one used in the monkey studies, is similar to the infant A-not-B in that the target was ambiguous and that there was delay between preparation and execution. Participants were asked to match forces generated by isometric muscle contractions to one of three target amplitudes shown on a computer screen (Think of contracting your biceps with your arm held still-- a little, some more, and lot). The experimenters instructed the participants to respond on the last of four successive, equally-spaced tones. The manipulation was that the target specification flashed on the screen at varying intervals between the third and fourth tone (see Figure 4). When the interval between target and go signal was long, individuals had a long time to prepare their response. When the interval was short, participants had to respond presumably before the planning process was complete. Indeed, what the Ghez group found was that the distribution of responses varied systematically with the time available for preparation. Figure 4 also illustrates these results. At short intervals and without knowledge of the upcoming target, individuals responded with a middle, "default" response. Gradually, as the time between target and response signal increased, these force amplitudes shifted toward the target values. (In contrast, when participants knew well ahead what their target would be, this interval did not matter). By this clever experiment compelling participants to move before they were "really ready," these researchers sampled the response preparation time and showed that the planning gradually evolves--it is not an all-or-nothing trigger for response initiation. Equally remarkably, and also relevant, is that the default response, the middle amplitude, was prepared in advance, presumably specified from prior trials in the task.

Figure 4: The effect of response preparation time on target accuracy. Top panel: participants were given variable times between the appearance of the target for an isometric force amplitude (three levels of force, indicated by the filled bars in the top panel) and the "go" signal indicated by the fourth of 4 tones spaced 500 msec apart (S-R interval). The diagonally striped bar is the center or default value, which is what the participants prepare ahead of the specific target cue. The effect is shown on the bottom panel. When the S-R interval is short, participants respond with the middle default response. Target accuracy increased continuously with increased planning time. (Redrawn from Henning et al.1988a).

Together, the Ghez and Georgopoulos studies point to planning for action as specified in movement parameters and as having real dynamics, that is, a time course of activation and decay. The dynamics can be viewed directly by neural recording or indirectly through behavioral manipulations. We contend that the A-not-B error is, much like the Ghez studies, a window on these planning dynamics, with the ambiguous targets and delays providing the necessary manipulations.

4.1.3. Plans evolve under continuous perceptual specification of task and cue. Having established the likelihood that the plan for action is expressed in the same dynamic variables that control action--that planning and reaching are meshed and continuous--we now add the third parallel channel, looking. There is a large and growing body of evidence that visual behavior is intimately linked to every aspect of the reaching task, and therefore, that we are justified in assimilating the visual dynamics into the reaching field. Looking affects reaching and reaching affects looking.

Consider the following adult experiments which link direction of eye movements with motor responses. Fisk and Goodale (1985) asked participants to reach with right and left hands to targets that were both ipsilateral and contralateral to the hand used. Latency to reach onset was shorter when participants moved to targets on the same side as the reaching hand and also when using their dominant hand. This effect was precisely echoed in their eye movements. Although eyes moved toward the targets 50 msec sooner than hands did, eye movements latencies were also decreased when the hand was reaching to the ipsilateral side and/or was the preferred hand. Now there are reasons to expect that neuromuscular and biomechanical factors would make it easier to prepare movements on the same side or with the accustomed limb. Eye movements to the right or left, however, if independently specified, should be equally facilitated; there should be no biases in the system. But in the dynamics of this task, there are. One conclusion is likely: "The relationship between ocular and manual latencies suggests that production of these two motor responses is far from independent and that programming within the two systems must be integrated (at least temporally) at some level of the central nervous system" (p. 170).

A second study of looking and reaching closely parallels the A-not-B task: the target is only briefly presented and must be remembered and the task situation provides ambiguous cues. This experiment, conducted by Enright (1995), demonstrates that momentary changes in gaze affect reaching accuracy. Participants initially fixated a central target. After a few seconds, a peripheral target lit up and then was turned off. The central target remained on for another 2 seconds--the delay-- and when it went off, participants had to reach toward the remembered target. Three different conditions were used. In one, individuals kept their gaze on the central focus and pointed in the dark. In the second, they were instructed to look at the missing target immediately after the fixation light went off, and in the third, to shift their gaze to the target during the time it was lit and to keep it there. Participants had more accurate aiming when they shifted their eyes toward the target, whether the target was visible or not. It is not so surprising that people would be better when they immediately shifted their gaze and held it at the target. But the difference between those who kept their gaze at midline and those who shifted just as they reached suggested that it was the eye orientation during the pointing process itself (even though the target was memorized and not visible in either case) that had the impact on the movement outcome. This must mean, according to the author, that information about the direction of the gaze and not the visual information per se, combines with the spatial specification of the upcoming movement which, in this case, is stored in memory. Where one looks matters in how one reaches.

These experiments showed the ongoing influence of looking at the specific targets on reaching, but other studies also show that even seemingly irrelevant distractors in the visual field perturb reaching performance, and most importantly, that the disturbance is action-centered. These experiments are pertinent because the A-not-B task can be seen as one where the target (the cued location) is made ambiguous by distractors in the field. Tipper et al. (1992) presented adults with a 3 x 3 matrix of buttons on a sloped board. Buttons in the middle row, when lit by a red light, were the targets and those in the front and back rows, when flashed yellow, acted as distractors in some trials. Latency to reach was consistently longer when distractors were present, but asymmetrically so. Distractors interfered more with the reach plan when they were below and in front of the targets, and thus acted as visual perturbations to the path of the hand, than when they were above and behind them, out of the hand path. In addition, and consistent with the Fisk and Goodale studies described above, right-handed participants were slower when the distractor was on the right side, again in the path of hand. The visual pull for attention was again manifest in motor dimensions.

Another particularly clear demonstration of the influence of the entire visual task environment, in interaction with the target dynamics, comes from a study by Jackson et al. (1995). These authors asked participants to reach for a wooden block, sometimes in the presence of a wooden dowel placed either midline or peripheral to the target. They found a significant distortion of both reach and grasp kinematics when the distractors were in the visual field, but only in the condition when participants closed their eyes. There was an effect of flanker position as well, whereby distractors at midline affected only those reaches that crossed midline whereby those at the periphery distorted reaches to the ipsilateral side. The effect of vision is especially relevant: when the targets were continually in view, the authors suggested, participants had time to attentively select the correct target and prepare motor response. When eyes were closed, however, the target location had to be remembered for about 2 seconds. As in the A-not-B task, during this gap, memory of the distractors must have influenced the memory of the specified target location --in reach kinematic parameters.

One further line of evidence of the reach-look synergy comes from studies where limb proprioception is manipulated as well as vision of limb and target. For example, when participants were instructed to move to either visible or memorized targets without seeing their limbs, the directional biases in target accuracy depended on how far the hand starting position was from the middle of the body (Ghez et al 1995; Ghilardi et al. 1995; see also Prablanc et al. 1979; Rossetti et al. 1994; Desmurget et al. 1995, among others). When both the hand and the target were visible, no such biases appeared. Presumably, given the relatively less practiced task of reaching from either one side or the other, people needed to update their hand-to-target motor plan with additional visual information about the position of their limbs.

4.1.4. The system has history. The A-not-B task is about looking and reaching, but it is also about remembering on two time scales. The target position must be remembered during the reach. But also critical to our model assumptions is that the memory of one-just completed reach is retained to bias subsequent movements. As documented in the previous section, this memory likely includes the entire task set-up, which "pre-shapes" and biases the field in the same movement parameters as the reach plan (see also Mitz et al. 1991; Spencer 1999). But we also propose that the motor memory of the just completed movement is also retained and integrated into the next plan.There is evidence for this assumption in the adult literature. For example, in the Ghilardi, et al (1995) study described earlier, participants showed biased directional errors when reaching from unusual starting positions such as the far right or left of their bodies. With extensive training in one area of this novel work space, say to the right or to the left, these biases were eliminated. However, the repeated movements in the trained area then skewed performance in the untrained areas toward the trained position, even creating systematic errors in midline where none were found before the experimental training. Directional accuracy, therefore, was a direct consequence of repeated moving in that task. When people are accustomed to reaching from a middle position, they make errors in other areas of the workspace. This visual-motor map must be highly dynamic however, as it could be changed through altered experience. The system retains a memory of previous movements that incorporates the feel of the arm in relation to the target and uses the memory to plan future responses.

Equally dramatic evidence of these movement field memories was provided by studies of Shadmehr and Mussi-Ivaldi (1994) (see also Lackner & Dizio 1994) who created novel reaching conditions by subjecting participants to an artificial force field. At first, participants could not make straight reaches to a target; when they tried their habitual reach parameters, the novel forces created curved trajectories. With extensive training, however, participants learned to adjust their arm dynamics to the new environment, and performed straight reaches again. When the force field was then unexpectedly turned off, the learned adaptation remained, and participants now produced curved movements, as though they were still compensating for the unusal field. These "after-effects" again demonstrated, that even in adults, repetition of an action changes subsequent movements, indicating that the system retains a memory for the movement parameters from trial to trial.

4.2. Motor plans in infants

Are the fundamental dynamics that produce these coupled interactions of looking, reaching, and remembering in adults the same as those that lead to perseverative reaching in infants? Although direct behavioral evidence is sparse, a few recent experiments increase our confidence that model assumptions chosen from adult studies are also good for infants.

We discuss first the issue of looking and reaching, and especially that the direction of the gaze is mutually coupled to the direction of movement. This was directly tested by Smith et al. (1999b) using the canonical A-not-B hiding, but adding a simple manipulation of the direction of visual attention, as we described earlier. Infants whose visual attention was pulled in the direction of their original movement training stuck with their ongoing motor habit; conversely, infants whose glances were in the opposite direction of their movements were more likely to switch to the new target. In infants, as well as adults, goal-directed reaching is coupled to the direction of visual attention.

Second, infants code reach direction in a manner that incorporates both postural and trajectory information, as suggested by the neurophysiological evidence in primates mentioned previously. Furthermore, these parameters are held in memory and influence subsequent reaches. For instance, in an another experiment, Smith et al. (1999) reasoned that if the memory built up of repeated reaches to A is based specifically on the position of the hand and arm in relation to the target, then shifts of posture that disturb the remembered hand-target trace should also disrupt the perseverative pull to A, much like a glance in the B direction competes with the activity pulling the baby to A. To test this idea, the researchers gave infants the standard hidden object A-not-B task. For the training trails and A -side test trials infants sat on a parent's lap, as is customary. However, between trials A2 and B1, parents stood their infants up so that the baby had to reach down to uncover the toy. Control group infants were distracted visually with a colorful, noisy toy shown to them at midline. Infants who saw the centered visual distraction perseverated when cued at B. However, when their posture was shifted, infants' tendencies to return to A were dramatically reduced. Indeed, infants who reached from a standing position tended to reach correctly to B, a level of performance not seen in any previous manipulation. (In other experiments, even when perseveration is reduced, infants are not normally correct, but at chance levels of going to either A or B). Body memory, therefore, was whole-body memory, incorporating the trace of the reach from a specific arm-to-body position to a specific location. Disrupting that memory through a bodily perturbation was especially powerful in interrupting the influence of previous reaches.

Recently, Diedrich et al. (in press) provided equally compelling support to our assumption that the A-not-B error is generated from motor memories. These investigators, for the first time, actually tracked the path of infants' hands while they engaged in a no-hidden-objects version of the task, using computerized motion analysis equipment. At nine-months -- the age of the participants-- infants are not yet fully skilled reachers, as evidenced in their hand trajectories. Although they go quickly and rather accurately to the target lid, their hands trace a somewhat bumpy course, speeding up and slowing down several times (Hofsten 1991; Thelen et al. 1996). Each reach of each infant, therefore, has a distinctive speed signature, which normally varies from reach to reach.. However, when infants reached repeatedly to the A side in the two-target task, Diedrich et al (in press) discovered a remarkable result. In those infants who perseverated to A on both B trials, and presumably had built up a strong motor memory of the direction of their reaches, their trajectories converged in form. That is, the speed bumps became increasingly alike, as evidenced by increasingly strong pairwise correlations. This is illustrated in Figure 5 which is an example from a single infant reaching to a single target "C" (left panel) and in an A-not-B lids-only task where the baby perseverated on both B trials (right panel). Note that although the repetition at the single lid produced some trajectory resemblance in C3 and C4, there was a strong tendency for trajectory convergence in the two-lid task. For the group of infants, convergence was less strong in infants who spontaneously reached occasionally to B during the A trials, who reached correctly to B, and in infants reaching to only one target. It is unlikely that several reaches will have exactly the same time-space signature repeatedly in the absence of some memory of the previous reach. We can conclude that, in the conditions that produce perseverative reaching--repetition in the face of the novel and confusing two targets-- both the direction and the pattern of changes of forces producing hand accelerations are held in memory from one trial to the next. And this memory is a powerful influence on the movement parameters generated for the next reach. In sum, there are good behavioral and neurophysiological reasons to seat the reach decision in a movement parameter field that integrates the visual characteristics of the task and the memory of the previous actions. Also critical is our assumption of a graded and continual field where these integrated dynamics evolve.

Figure 5. Speed profiles of an individual nine-month-infant in two reaching tasks. Successive trials are shown on the Y axis. Left panel: hand speeds while reaching to a single, centered target 6 times. Right panel: hand speeds while reaching in a no-hidden object A-not-B task where the infant reached to A on both B trials, although the kinematic data from trail B1 was missing. (See Diedrich, et al,(in press)).

5. The dynamic field model

The model presented herein is an extension and modification of a dynamic field theory of motor programming originally formulated by Erlhagen and Schöner (1999) that is based on mathematical models formulated by Amari (1977), Grossberg (1980), and Wilson & Cowan (1973) (see Grossberg, 1988 for reviews). Erlhagen and Schöner developed the model to formulate the processes of movement planning in dynamic language that ultimately may be reconciled with the dynamics of movement execution. Kopecz and Schöner (1995) and Schöner, Kopecz, & Erlhagen (1997) have applied a similar model to the planning of eye movement saccades. Readers are referred to these papers for additional technical details concerning the models.

5.1. The movement planning field

We begin by describing the dynamics of the movement planning field, the site of the integration of visual input and motor memory, and the generation of the decision to reach to A or B. As we have argued earlier, this field must be able to generate and maintain specific activation states denoting the directional parameters of the reach in a continuously evolving manner that simulates the gradual specification of motor plans seen in experiments. Thus, the dimensions of the field in this case are the movement parameters appropriate to planning and executing a reach in a specific direction to the right or to the left. The field represents the relative activation states of those parameters. At this point, we conceptualize this field only in abstract terms as a site where visual input and memory are integrated into movement parameters supporting movement amplitude, direction, or time. Later in the discussion, we will speculate further as to possible neuroanatomical areas where such a field might evolve.

Although the model dynamics result in a dichotomous choice-- A or B--it is important to emphasize again that the behavioral dimensions supporting the choice are continuous directional parameters where A and B are locations on this continuum. Unlike classic symbolic models where either A or B constitute the universe of choices, the field model allows metric specification of particular parameters from a continuum of possible actions (see Lewin, 1946, for an early version of choice behavior on a continuum.) The activation field then assigns an activation variable to each site on the dimension. The specification of the movement is thus a function of the amount of activation at particular values representing, in this case, direction. Thus, the field literally has a shape that reflects different possible movement states: a sharp local peak indicates a well-specified motor act. Activations that are more graded and distributed imply that the movement parameters are less-well specified, resulting in more random responding or less accurate actions toward the targets.

In Figure 6, we depict the dynamic field in terms of direction to the A or B side as a continuous space spanning the infants' visual and reaching field and locating the targets at A or B. A reach to A requires strong, above-threshold activation at the A target in movement parameters: infants need to activate whatever combination of muscles are needed to get their hand to the cued target. In the upper left panel we depict the specification of a reach to A. In contrast, without any cues to A or B and when both are visible, infants may have equally relatively low activation at both locations, and may not reach at all. The upper right panel illustrates this condition. Graded information about the target choices is reflected in the activation space, but not in sufficient strength to trigger a movement. The bottom panel represents another possibility where the distributed activation is asymmetric, favoring the A site, but where the activation has still not reached threshold for movement generation.

Figure 6. Activation in continuous movement parameter fields at values corresponding the to A or B direction. Top left: activation passes threshold (dotted line) for a reach to A. Top right: subthreshold activation leads to no reach at all. Bottom: Graded, and asymmetrical, but still subthreshold activation at both sites.

5.1.1. Dynamics. Given our extended justification for choosing the parameters of the field, we begin by defining the movement parameter, x, and its dynamic field u(x) as representing those motor values the baby can continually specify to move in the direction anywhere from right to left. Our foundational assumption is that this dynamic field changes continuously with time, thus u(x,t). The state of the field depends, however, not just on the dimension x, but also with respect to its own level of activation, u. This means that u(x,t) itself has continual dynamics, where the next state depends on the previous one. The level of activation, therefore, cannot jump, but must build up gradually instead.

This build-up depends on the nature of the field and the inputs to the system, in this case the information about the task structure itself, the specific cue to A or B, and after the first reach, the memory of previous reaches. These inputs, as we stated earlier, are expressed in the dimensions of the field: S(x,t), here, again, in a reach direction specifying A or B. (We explain the contributions to S(x,t) in detail in the next section.) So the most simple case where the inputs are added together to the field can be expressed mathematically as

(1)

where we have also added a linear decay term, -u(x,t) which together with together with the constant defines the time scale over which the field gradually builds up or decays.

5.1.2. Time scale. Because the field is dynamic, its time scale of activation is critical. We illustrate the operation of , the time scale parameter, with a simple case where each site on the field evolves independently, that is, without influencing its neighboring sites. Under this condition, the activation at each site relaxes over time to the level of the input, and the stationary solution, u(x) = S(x), directly reflects the input. However, when the input changes, the activation in the field does not change with the input instantaneously, but has a certain inertia. Specifically, when the input changes in a step-like manner, say from S(x) = 0 to S(x) =S0 (x) , then the field changes exponentially according to:

(2)

where u(x) is the initial deviation of u(x,0) from S0 (x). Thus, is the amount of time during which the distance between the current activation level and the input level is reduced to 1/e, or about 37% of its initial value. Because expresses a percentage change, the temporal evolution of a site proceeds independently of its current activation, although its activation level itself is strictly a function of its previous activation. This is a characteristic of any dynamic system close to its stationary state. Thus, without cooperativity, the field evolves over time to assume the shape of the specific input and then it decays.

5.1.3. Cooperativity. We used the simple case of site independence to illustrate the temporal properties of the field. This limit case is unrealistic, however, because if all the sites were independent, the field would only reflect the exact parameters of the inputs and could not reach a decision in the face of several competing inputs. A mechanism for integrating graded information is needed: the sites must be coupled so that a single decision can evolve (Amari & Arbib, 1977). To produce such a self-sustaining peak from multiple inputs, the field is endowed with interactions such that sites that are close together are mutually excitatory, whereas more distant sites are inhibitory. We refer to these interactions within the field itself that can enhance (or inhibit) activations as cooperativity. (Note that, strictly speaking, these interactions contain both excitatory and inhibitory activations. Typically only the former are called cooperative connections, while the inhibitory interactions are usually considered competitive. We adopt the term cooperativity to stand for the combined effects of the interactions as both mutually produce (or inhibit) self-sustaining peaks.)

Again, cooperativity, gintra-field, like the other contributions, is a function of the state of the field itself and so may be expressed as an added term to equation (1);

(3)

More specifically, gintra-field is composed of two functions: an interaction kernel, and a threshold function. The interaction kernel w(x-x') allows the model to generate self-sustaining solutions by the balance of local and global excitation and inhibition. The interactions may arise from any point, x' in the field. An additional assumption is that these cooperative interactions are homogeneous within the field, so that no point is privileged over others and the interaction thus depends only on the difference, x-x', the distance between the sites. The interaction kernel has the form:

(4)
 
 

where wi > 0 and we > 0 are the strengths of the inhibitory and excitatory components and w > 0 is the size of the excitatory region, which establishes the size of the localized activation patterns. This is illustrated in the top panel of Figure 7.

Figure 7. Cooperative interactions within the dynamic motor field. Top panel: Interaction kernel w(r) consists of a local excitatory zone of width w and strength wexcite and a global inhibitory contribution of strength winhibit. Bottom panel: The contribution of any location in the neural field to the cooperative interaction is determined by a threshold function f(u) to the neural activation. The slope(beta)of this function determines the degree to which subthreshold values of activation contribute to the interaction.

Not all sites in the motor parameter field can contribute to the interaction at all times, however, as this would lead to a single, inflexible solution. As is true in real neural systems, only those sites that are activated communicate to other sites. Thus, a threshold function, f(u), allows only certain levels of activation to enter into the interaction:

(5)

where is the slope of the sigmoid function, set to a zero-to-one range, where the system becomes activated, depicted in the bottom panel of Figure 7. (For a discussion of the different types of nonlinearity involved, see Grossberg, 1973).

Thus gintrafiield is the product of the interaction kernel and the threshold function which is then integrated over all sites of the field:

(6)

When the cooperative functions are added to the other contributions to the field, as sketched in Equation (3), the dynamics take the form of Equation (7), reflecting the continual evolution of the motor field with a particular time scale, cooperativity, and inertia, in the presence of sensory inputs.

(7)

We now add two additional parameters to the equation. The first, h, sets a resting level to the field. Recall that we are able to set a threshold for entering into the interaction kernel. At any given threshold, however, the number of sites that actually participate depend upon the resting level of the field. If this level, h, is small, only sites with strong input contribute to the interaction because the threshold is relatively greater. In contrast, if the sites are already somewhat active, and the threshold is effectively lower, the interaction is much less localized and more widely distributed.

This resting level has profound implications for the resulting field dynamics. When h is low and only strong inputs predominate, the system is driven largely by inputs and less by the local interactions. The field behaves more like the one shown in Figure 7 where the output reflects the input. When h is large, however, many sites contribute to the interaction and the localized excitation becomes amplified by the many excitatory connections of its neighbors and the corresponding surrounding inhibition. In this regime, excitation can become self-sustained even without continual input, and the field can express a decision in the face of multiple inputs (Amari 1977). This is the critical mechanism for the integration of perception and memory into a decision field as it allows inputs of different relative types, strengths, and degrees of specificity to contribute to an integrated motor outcome.

Finally, we add a term, q(x,t), for Gaussian noise, giving the field dynamics the overall form:

(8)

As in any dynamic system that exhibits multiple states, these fields are sensitive to noise near instabilities, and thus noise is justified in the equation. In reality as well, infants' behavior is noisy. The A-not-B decision is always a probabilistic one; sometimes infants spontaneously reach to either A or B whatever the input or history of the system.

As we show in the simulations, below, these intrafield dynamics capture both the age and the delay effects in the A-not-B error, as well as their interactions. In principle, one might imagine developmental changes in any of these interaction parameters: in h, the ability of the field to generate a localized solution; in the strengths of wi, we, and w which determine the relative basins of inhibition surrounding peaks of excitation, and/or in the threshold function. For the purposes of our simulations, however, we will assume that the main developmental effect lies in h, the ability of the field to enter the cooperative, self-sustaining regime.

5. 2. Inputs to the field

With these intrinsic properties, the action decision evolves in the motor planning field under the specification of three sources of input:

(9)

The first two, Stask and S specific, are the parameters of the persisting task environment and of the cuing event that are both present in the very first reach to A. The third, Smemory, is the contribution to the current motor decision from the system's history and thus effectively enters into the model after the first reach. Critical to this formulation is that the inputs to the decision field are expressed in movement parameters so that they may be mutually coupled. The assumption here is that the inputs, like the movement parameters, are not discretely specified, but are identified as locations on a continuous field.

5.2.1 Task input. People move within an immediate spatial environment which usually remains stationary. This environment provides persistent visual (or tactile or auditory) input that specifies the task space-- what objects and surfaces delineate the continuous targets and supports for action. These are the features of the world which constitute the behavioral alternatives within the intentions of the actor: the possibilities to move in one direction or another (or forcefully or gently, etc). The task layout is thus prespecified and independent of immediate signals to act. In experimental situations, the task parameters typically do not change during the performance of a single trial, although they will often be varied for different experimental contexts. Without additional specific attentional cues or memory traces, the strength and symmetry of the task input determine the contours of the decision field. When specific attentional or memory inputs are added, they combine with the existing and tonic activation patterns set by the task environment.

Consider the infant reaching task with the two possible target locations, A and B, always in view. In Figure 8, we illustrate the task input (before any additional cues are provided) as activations in motor parameter space. The first way that the task input can be characterized is by the target locations along the decision field. These are represented as probability distributions in the field centered on two locations, xA, xB. In the upper left graph of Figure 8, we represent the activation distributions around two similar targets that are well-separated and provide two clearly specified goals. The upper right panel, in contrast, shows these distributions when A and B are close together, their distributions overlap, and there is a greater probability that the baby would, by chance, reach to either A or B. Indeed, target confusion as evidenced by A-not-B errors is reduced when the covers are relatively farther apart (Acredolo 1985).

Figure 8. Inputs to the decision field at the first reach to A, showing possible configurations of the visual stimuli. Task input. T1: two clearly specified, separate targets; T2: two specified targets close together; T3: two, boring, identical targets; T4: one attractive and one boring target. Specific input. S1: well-specified cue to A; S2: poorly specified cue to A, for example, a quick wave of the hand over the target; S3: well-specified cue to B. The bottom panel shows the time evolution of T3 and S1, where a strong cue to A interacts with two weakly specified targets.

The second way in which the task environment can be parameterized is by the distinctiveness of the targets, and hence, their relative attractiveness. Imagine as an extreme case, a hungry infant faced with two targets, a cookie and a familiar toy . Without any additional cuing, the baby would likely spontaneously reach for the cookie and would persist in going for the cookie despite being repeatedly enticed with the toy. This is a measure, therefore, of the strength of the task environment to compel reaching to one side or another. Thus, Stask,0 is the dimension of the field expressing the tendency of the infant to go to B when A is cued or to go to A when B is cued, in other words, to "spontaneously" be pulled to one side or another. Finally, parameter cA/B is used to express any asymmetry in the task arrangement. When the targets are alike and placed side-by-side, cA/B = 1.

To illustrate this, refer again to Figure 8. The bottom left panel represents the typical A-not-B situation used in Smith et al (1999b) where the task input specifies two identical covers or targets at A and B. Since the targets are indistinguishable from each other and not highly distinct from the background, there is no strong incentive for infants to reach to either A or B or even to reach at all. The task contribution to the motor decision field is centered around the target areas, but not biased to either location, and not very strong. Next, imagine that the usual A-not-B targets are replaced by one plain brown lid and one colorful and attractive toy. As in the illustration on the lower right of Figure 8, the decision field would then be biased even before the experimenter called attention to the A or B side because of the increased possibility that infants will go for the toy rather than the lid. A third possibility --two attractive toys side-by-side-- is shown in the illustration at the top left of Figure 8. Here, as in the first situation, the task environment does not offer a basis for choice of A or B, although both targets are attractive and the probability for a reach to either is high. Finally, a parameter task characterizes the spatial spread of the activation function, similar to that used in the field dynamics. For our simulations, this parameter is fixed.

Together, then, the specification of the task input for the A-not-B paradigm has the

following Gaussian form:

(10)
 
 

We chose the Gaussian because the function conveniently expresses the forms of the three parameters of the task input (location, strength, and width).

As discussed in an earlier section, because the targets are typically similar and relatively close, without other input, infants have no compelling reason to reach to preferentially to A or B and their performance is at chance levels (Smith et al. 1999). To overcome this, infants must be trained to go consistently to the A side by making that location more salient. Experimenters do this by moving the object-to-be-hidden, or the lid, forward so that it is closer to the baby, providing a visually more distinctive target.(1) This adds asymmetry to the task input field by making cA/B not equal to one, and thus biases the first reach--and subsequent reaches-- to one side. Changing the distinctiveness of the lids, that is making one lid a different shape or color, also biases the task field, so that once one side is cued, infants may have continual reminders of the differential targets. We simulate both the training and distinctive target effects.

Experiments have shown that other features of the task environment such as visual landmarks or background colors and surfaces can interact with the distinctiveness and placement of the targets to determine whether infants perseverate or not. As summed up by Butterworth et al. (1982) whatever the perceptual basis of this task, "it is extremely sensitive to variations in context" (p. 447). The task input field can be adjusted to simulate these variations.

5.2.2. Specific input. While looking at the task scene, the infant is cued to one target, A or B, and this cue also contributes in a graded way to the movement decision. Thus, the model has a second source of input, Sspec, the phasic visual cue of the experimenter waving, tapping, or otherwise calling attention to the target object. The specific input is similar to the task input as it is characterized as a location in motor parameter space, with a particular strength, and activation spread. It differs because it is a time-limited input to the field to simulate the transient visual cue which then must be held in memory. Thus, xspec represents in the equation the location where the attentional cue is delivered, in this case either at A or B. The strength of the input, Sspec,0, can be varied to capture the saliency of the cue. For instance, waving or hiding a bright, glittery or noisy object, a brightly colored toy, or a cookie will be more attention-grabbing than a plain colored cloth or lid, as is further illustrated in Figure 8. In addition, as in the task input, the specific input can have different values for its activation spread, spec. The cue duration is entered into the model as a step function. Thus, before the cue is given, the specific input is zero. During the cue, the specific input is non-zero at a constant level and then it instantaneously returns to zero at the termination of the specified time. A cue of longer duration provides more input to that location in the field. The form of the specific input is:

(11)
 

during the time interval of length T when such input is present and zero otherwise.

5.2.3. Memory input. The third source of bias to the field adds its influence and is in the form of a memory field in movement parameters retaining the shape of previous decisions to go to A or B. The memory field gets its input from the motor planning field, and thus encodes the history of all previous reaches. Because the planning field itself has integrated the two visual inputs, that of the task and specific cue, the memory also indirectly reflects the whole perceptual aspects of the repeated task. As it builds, the memory input contributes not only to the probability of a reach to A or B after cueing, but also to the likely direction of a spontaneous reach. As reported in Smith et al. (1999b), once infants have made a decision to go to one target or another in the absence of a specific cue, they are likely to stick with their choices. The memory input becomes stronger after each repeated reach to one location such that it may swamp the task and the transient cue. This is the heart of the A-not-B error.

The memory field itself has dynamics: it evolves continuously in time. These dynamics occur on two time scales, one related to its growth and one to its decay. First, the memory field grows in parallel with the motor planning field in the few seconds between cue and reach. More specifically, the planning field enters the memory field whenever the planning field is activated above a certain threshold, u0. The memory field has its own time scale, specified by mem :

Here mem governs the growth process of the memory, and

 

(u(x,t,)-u0)=1 if u(x,t,) >u0

and zero otherwise. The memory field reaches a maximum level after approximately 6 trials: mem =6.

This is where we conventionally test for the error with a cue to B.

The second time scale is that of the decay of the memory field. In theory, we assume that the memory field decays slowly in the absence of activity in the motor plan field in the time that elapses between one reach and the cue for the succeeding trial. For the purpose of these simulations, we have assumed that this time scale of this decay is much slower than the inter-trial interval, and therefore does not enter into the model as a parameter.

We believe this to be reasonable because perseveration, and hence, the persistence of the memory, is robust despite variability in the inter-trial intervals: delays between the end of one reach and the initiation of the succeeding reach ranged from 20 to 50 seconds. This means that within the time scale of the typical A-not-B task, this decay may not be critical, although we have not empirically tested these limits.

In summary, the contribution of the memory input takes the form

where Smem,0 is the strength of the memory input.

5.3 Output

The field represents a parametric movement plan, but is not linked to a model of the actual motor control of the arm (see Kopecz & Schöner, 1995, for integration with motor control of eye movements). Here we approximate such control b a simple read-out procedure: when the movement is elicited after the dealys, we assume that the locaiton in the field with the maximul activation describes the movement that is actually performed. This approximation is reasonable as long as no other manipulations or perturbations intervene at the control level.

6. Simulations

Before we discuss the simulations of the model and compare them to experiments, a few comments about the model parameters may be useful. As with any model, we have constructed a mathematical abstraction--one of many possible abstractions-- of complex and multi-determined behavioral events. Some aspects of the events, such as the timing of cues and trial length, can be assigned realistic parameter values. For others, such as the relative separation of the targets or the strength of the cues, parameter values are less directly mapped onto experimental factors. In principle, the basic effects are described by the model over a wide range of values of these parameters. However, the constellation of parameters is strongly constrained by the experimental results. The model is successful if two conditions are met in parameter assignment. First, the ensemble of parameters must produce the primary experimental effects when values are fixed such that all the orders of magnitude are reasonable in relation to one another. For instance, to produce the canonical error, the specific input must be sufficient to produce an A-side decision on the first reaches, but not so strong as to dominate the field interactions as the memory strength builds. Within such constraints, the precise parameter values are not critical: the qualitative effects are robust within a range of parameter values. But having determined these values on the primary result, the test of the model is to keep certain parameters fixed--the characteristics of the field, for instance--and to simulate different experimental results, as well as generate new testable predictions. Here, for example, we use the settings of the canonical effect and manipulate different values of the visual input. Together, these conditions show that the model is both internally consistent and externally valid. Note that the internal structure of the dynamic field model precludes simulation of any arbitrary input-output relationship through judicious choice of parameters. Strong theoretical assumptions of the mechanisms involved are incorporated into the equations, such as localized input with uniform width, homogeneous symmetrical interaction kernel, and superposition of inputs. Many connectionist models, in contrast, have been shown to be universal approximators under some conditions (Hornik, Stinchcombe, & White, 1989). The question addressed by the model, then, is how a decision to reach to A or B evolves under varying conditions, represented by the parameters of the model. To reach a decision, the dynamics of the inputs are thus coupled to the dynamics of the movement field. The model is integrated in time for 10 seconds, which realistically represents the time interval that begins when the infant looks at the display, the experimenter waves the target, the delay is imposed, and ending with the decision to go to one target to another. Evolution in time results from the dynamic equation, the solutions of which provide values of activation at all sites in the field as a function of time. These solutions can be visualized as activation landscapes representing the relative strengths of various values of the movement parameter signifying direction.

In the simulations, we solved the dynamic equations on a digital computer using the Euler procedure, in which one time step represents 50 msec. The results do not depend on the time step, which we chose to be sufficiently small so that the numerically obtained solutions approximate the real solutions of the dynamical system. An individual simulation run of 10 s realistically models a single reaching act. At the end of each run, the activation in the movement field is reset to zero, readying the system for a new trial. By contrast, we continuously update the memory field to reflect the build up of a history of reaches. A sequence of such individual trials reflects the experimental paradigm (6 reaches with cues at A, two reaches with cues at B). Such entire trial sequences are then repeated 500 times to obtain statistics. The outcomes vary from repetition to repetition due to the presence of stochastic forces. We store the value, x, at which the field has maximal activation during the sequences so that across the 500 repetitions, we can generate histograms that estimate the probabilities for moving to A or to B.

In the following series of simulations, we fixed the following parameter values of the motor field throughout: = 500 [msec], wi = 1, w e = 2, = .05, = -1.5. We changed the resting level, h = -6 for cooperative regime and h = -12 for the non-cooperative regime. Moreover, in the task field Stask,0 =1 remained constant and the spatial spreads of the inputs, w = task = speci= 10 throughout. Likewise, Smem,0 = 3 was fixed throughout. Thus, experimental effects were generated only through manipulation of h and the inputs, cA/B and Sspec,0.

The field was supplied with the task input for 1 sec, presented with the specific and memory inputs for 4 sec, and the delay was always also 4 seconds. These values are consistent with the real task timing.

6.1. Results

6.1.1. Properties of the movement field without specific input. The first simulation emulates the condition where the infant looks at the two identical targets, but no cues are given and there has been no training to A or to B. In these simulations, we have set the cooperativity of the field to be weak so that the local interactions cannot lead to a self-sustaining peak, as we would assume in an 8-10 month old infant. Figure 9 shows the evolution of a typical field at the first reach to A under these conditions, while (b) illustrates the corresponding growth of the memory field, in this case, nearly flat. The bottom histogram gives the probablities of A or B decisions from the repeated simulations shown over 17 time steps within the trial. The main influence on the field is the rather diffuse cues given by the task input; the field is noisy and converges to the weak specification of the task input. As can be seen in the histogram, the infant is equally likely to go to either A or B and because both are noisy and weak, possibly not to reach at all. Additionally, the memory dynamics generated by this initial condition are themselves weak and poorly differentiated, as can be seen in Figure 9.

Figure 9. (a) Motor field dynamics in non-cooperative regime with task input only (no specific input) at the first reach to A (A1). Parameters: Sspec = 0, Stask = 1, Smem = 3 (a). Motor planning field evolution (b) Corresponding evolution of memory field. In this figure and subsequent figures, x axis denotes field location, y is time, z is activation. On the y axis a letter code indicates the input present at different moments in time: T, task input, S, specific input (none added here), and D, the delay where no specific input is added. (c). Histograms of decisions to A or B from an ensemble of 500 simulations per trial showing the read-out of the field as a function of time. The decision to reach to A or B is probablistic; in this case, A or B is equally likely at any point in time.

The dynamic approach to the A-not-B error offered in Smith et al (1999b) made the novel predictions that without training that highlighted the A side, reaches to A and even reaching at all would be fragile. This is formally simulated by the model as well, and has been confirmed in experiments (Smith et al. 1999b). Identical targets, especially without additional cues, are confusing for infants.

6.1.2. Properties of the motor field at the first cued trial to A. In the second simulation, we illustrate the effects of adding just the transient specific input on the very first trial to the A side. Note in Figure 10 (a) under the non-cooperative regime (e.g.,younger infant) that this causes a rapid increase in activation to the A side and also a rapid decay as the peak is not self-sustaining. In the cooperative regime (e.g., older infant, Figure 10 (c)), the nonlinear properties of the field result in a more sustained above-threshold activation. This is further reflected in the evolution of the memory dynamics which are stronger at the end of the motor planning process in the cooperative vs. non-cooperative regime (Figure 10 (b & d)). This simulation captures an important developmental effect: the ability to remember the location of the cue when it is no longer there. The novel contribution of the model is to embed that memory in movement parameter space where it is behaviorally expressed only as part of the evolution of the other contextual and field dynamics (see Rizzolatti et al. 1997).

Novel predictions: This simulation also predicts that for younger infants, simply cuing at A without any previous training would lead to an increase of correct responding with shorter delays. This is currently being tested: however, age and delay interactions have been amply demonstrated on the B test trials (e.g. Diamond 1985; Clearfield, et al. 1999).

Figure 10. Motor field dynamics with task and specific input at A at A1. (a) Non-cooperative regime (h = -12). (b) Memory field evolution for (a). (c) Cooperative regime (h = -6) (d) Memory field evolution for (b). Parameters: Sspec = 4, Stask = 1, Smem = 3, cA/B = 1.

6.1.3.Motor field dynamics after repeated reaches to A. What happens when the memory of the previous reach is added to the task and specific input? In Figure 11, we report the results of a typical field evolution after 6 A trials, in both non-cooperative (a and b) and cooperative (c and d) regimes. But to more realistically simulate the actual experimental design, we also included an effect for the training trials. Recall that in the typical A-not-B task, infants were trained to A by gradually hiding the toy, or in the lids-only condition, progressively moving the A lid back over several trials. The effect of this training was to make the A side more salient at first, and then gradually more similar to B. This asymmetry in the training trials was implemented by decreasing the relative influence of the A location in the tonic task input over the first 4 trials by setting parameter cA/B to 2(A1), 1.75(A2), 1.5(A3), and 1.25(A4). In subsequent trials, when the task field was symmetrical, cA/B remained at 1.

In both regimes, there is a strong tendency to reach to A and a persisting memory of that location after the field evolves. The differences are in the relaxation of the field, which is more pronounced in the non-cooperative state and the strength of the memory, which is stronger in the cooperative regime. The simulations suggest that when cued at A repeatedly, both younger and older infants should be strongly attracted to that location. Why, then, the A-not-B error in younger, but not older infants?

Figure 11. Motor field dynamics after 6 reaches to A. (a) Non-cooperative regime. (b) Memory field evolution for (a). (c)Cooperative regime. (d) Memory field evolution for (c). Parameters:Sspec = 4, Stask = 1, Smem = 3, cA/B = 1.

6.1.4. The A-not-B effect: The B trials. We now add the critical test trial-- the cue at B after repeated reaches to A. The exemplar trial simulations illustrated in Figure 12 compare the motor fields in non-cooperative and cooperative regimes and show the A-not-B error (a) and correct responding (c), respectively, at the first B trial. In the simulation of the younger infant, the field evolves within the residual memory of A. The cue to B causes strong activation at first, but it cannot be sustained and decays rapidly. If the resulting field evolves over the full time segment, the movement parameters for an A reach predominate. If however, the infant were allowed to reach without the delay, the B activation would swamp the memory of the A reaches. In the cooperative regime, in contrast, the ability to hold the B side activation for the full extent of the field evolution leads to a stronger tendency reach to B both with shorter and longer delays, despite the influence of the A memory.

Additional simulations predicted that even after the second cue to B, infants should still be making the A-not-B error in the non-cooperative state and they should continue to be correct in the cooperative state. Indeed, in the classic version of the task, most infants perseverate to A on both the first and subsequent cues to the B side, although the number of infants making the error on both B trials is less than those erring on trial B1 (Smith et al. 1999b, Diedrich et al, 1999). That is, the memory of A persists through the second indication to reach to the B side. The simulation of the older infants, in contrast, indicates continued correct spatial choices. These differences are shown again in Figures 13 and 14 , which are the probability histograms generated by the full set of simulations.

Figure 12. Motor field dynamics at trial B1. (a) Non-cooperative regime. (b) Memory field evolution for (a). (c)Cooperative regime. (d) Memory field evolution for (c). Parameters:Sspec = 4, Stask = 1, Smem = 3, cA/B = 1.

Figure 13. Distribution of decisions to A or B from 500 runs per trial in trials A1-B2. Field is in non-cooperative regime; infants make A-not-B error. Note in B1 and B2 that the error emerges over the delay. At no delay, infants are correct, as shown experimentally. Other parameters same as in Fig. 12.

Figure 14. Distributions of decisions to A or B in Trials A1-B2. Field is in cooperative regime; infants reach correctly on B trials before and after the delay, despite the lingering memory of the A reaches. Other parameters same as in Fig. 12.

Thus, the model captures the important and well-documented developmental and delay effects of the classic A-not-B error: the error is reduced with age (and the ability to retain the memory of the target stimulus parameters in the motor field) and with short delays. But we can also address two related phenomena not usually considered in the traditional accounts: the effects of repeated reaching and the production of "spontaneous errors" or "incorrect" reaches to B on the A trials. This is best depicted in the relative memory strength to A measure that shows the evolving attraction for each location (Figure 15). The simulation in the non-cooperative regime closely modeled the experimental data of infants who made the error. But-and importantly- the simulation also shows that infants who are presumably older (i.e. cooperative) are correct on the B trials for a different reason than those who are younger (i.e. non-cooperative). The former are largely correct on all A trials and on all B trials. The latter are reaching to B on the A trials and thus form a stronger B attraction. There is not only an increased likelihood of A-not-B error (reaching to A when B is cued) in the non-cooperative regime, but also increased variation in the relative strengths to each side, suggesting more spontaneous errors (reaching to B when A is cued), as indicated in Figure 16

Thus, two processes are working in the noncooperative state. First, the A memory builds and without the ability to sustain the B-specific input the field relaxes to the A memory, creating the error. But the same dynamics also can produce correct responding typically seen in about 20% of the infants tested in the classic task. In some infants, the decaying A activation field allows for occasional noise-induced spontaneous reaches to B. Once an infant has reached to B during the A trials, the cumulated memory strength for A is reduced, generating less of pull to A and more tendency to go to B. If the A strength is sufficiently diminished by fewer A reaches, then the probability of the returning to A on the B cue is also decreased. This is precisely what was discovered in experiments (Diedrich et al. 1999; Smith et al. 1999b).

The appearance of spontaneous errors depends on noise, but also on the relative confusability of the targets. Note in Figures 15 and 16, as in Figure 2, that spontaneous errors become noticeable not in the earliest trials, but at A3 and A4, after the infant has been cued several times to A. Why would spontaneous errors arise more often after several reaches to A, when the perceptual motor habit for the target location presumably has been strengthened? Recall that in the first few trials, the task input is asymmetrical because the A target is made more salient. This asymmetry is strong enough to keep infants' attention directed toward the A side. However, as the targets become spatially closer and presumably less distinct, spontaneous mistakes to B begin to accumulate.

Figure 15. Relative memory for A in non-cooperative and cooperative regimes for simulations shown in Figures 14 and 15 plotted along with experimental data reported in Figure 2 for 9-month old infants who made errors on B1 or B1 and B2 and infants who were correct on B1/B2. The non-cooperative regime simulations closely fits the data for the infants who made the errors. However, the cooperative regime represents older infants who are correct on both A and B. The 9-months who were correct did so for a different reason: they made spontaneous errors to B on the A trials. Correct performance arises from different mechanisms in younger and older infants.

Figure 16. Standard deviations of the memory to A measures of Figure 16.

Novel predictions. So far, the model and the dynamic assumptions underlying it, generate several additional testable predictions based on the simulated effects in the movement field. First, the error resides, we maintain, in the ability to generate a reach in a specified direction and to retain a memory of that action that is sufficiently strong to swamp a new visual input. The model assumes that a critical contribution to the error is the inability of young infants to maintain a self-sustained activation peak in the movement field once the input disappears. One new prediction is that reaching location perseveration may be evident in infants younger than 8 months. This has never been tested because in the traditional view, eliciting the A-not-B error requires that infants can be trained to uncover hidden objects, and infants under 8 months will not do so. But in a no-hidden object task, perseveration may be seen earlier. More specifically, we suggest an inverted U-shaped developmental function: before about 5 months, infants do not reach consistently to even single targets. They will find the A-not-B task especially daunting and may not reach at all, or if they do, in such a sluggish and inconsistent manner as to not build up the memory to the A place. In addition, very young infants have unstable posture while reaching and their own postural wobble may act as perturbations to the growing motor memory. However, as soon as infants begin to stabilize their reaches and posture(Thelen et al. 1996), A-not-B errors should be evident.

In the simulations, we assigned the developmental shift from errors to correct responding to the parameter of the field, h, that determined the balance of activation widths and strengths in the interaction kernel, and we gave it two discrete parameter values to shift it between the non-cooperative and the cooperative regime. But we know from developmental evidence that there is an interaction between age and delay between hiding and search such that even 12-month old infants will make the error if the delay exceeds 5 seconds or so. This would lead us to believe that the developmental changes in the motor field's ability to sustain a peak of activation are gradual, not switch-like as in the simulations. Indeed the exact point at which the planning field switches from a non-cooperative (and thus perseverative) regime to a cooperative (non-perceverative) regime depends on all contributions to the resting level, not only on h. Different amounts of task input, controlled, for instance, by the attractiveness of both lids, affect this transition. Less attractive lids corresponding to less task input lead to lower resting levels in the field, which shifts the transition to larger values of h.

If so, we should be able to produce A-not-B type errors in infants and children much older than even a year, if we manipulate either the delay or the strength of the inputs. In particular, A-not-B errors should be evident with increasing age as the delay increases, the task environment provides fewer tonic specifications, and/or the specific input is less salient (i.e. shorter in duration, less attention-grabbing, etc.). This prediction is borne out in two recent studies. Smith et al.'s (1999a) use of the sandbox with two-year olds was described previously. In addition, Spencer (1999) showed systematic response biases in 6, 8, and 11-year olds and adults in a "space-ship" task where the task field was completely homogeneous and the delay between target and reach initiation was increased up to 20 seconds.

6.1.5. The role of target distinctiveness. As we stated earlier, there is suggestive experimental evidence that perseveration is reduced when the babies can more easily tell one target from another. In the following set of simulations, we systematically increased the distinctiveness of the targets: such a manipulation has not been done before experimentally. Then we report a test of the model predictions.

Figure 17 reports the relative memory strength to A measure from simulations generated as cA/B has been increased from 1 (both targets contribute equally) to 3 (targets contribute asymmetrically) in the non-cooperative regime. When the targets are alike, a clear A-not-B error emerges on both B cues. At moderate distinctiveness, there are more spontaneous errors, that is, more incorrect reaches to B on A trials, seen especially in early trials, and a decreased tendency to make errors on the B trials. Indeed, infants are likely to be correct on the second B trial. This effect is highly pronounced in the highest degree of distinctiveness, where infants continue to respond at chance levels throughout the early trials, and because of their many spontaneous reaches to B, more likely to be correct on both B trials.

Model predictions and tests. These parameter effects of target distinctiveness were specifically tested by Diedrich et al. (1999). One group of nine-month-old infants reached for one of two red targets on a red background. Three other groups performed the task with a B target that varied in its distinctiveness from the A target and background, which were always red. The B targets ranged from moderately distinctive (orange), to more distinctive (yellow) to highly distinctive (striped yellow and blue with red polka-dots and smiley faces). The percentage of A-not-B errors on the first B trial depended on target distinctiveness: 78% made the error in the red condition, 83% in the orange condition, 61% in the yellow condition, and 22% in the striped condition. Infants confused the red and orange lids, the yellow lid pulled the error rate down to chance responding, but infants were still making errors. In the striped condition, infants were likely to be correct-they could tell the A lids on the A trials and the B lid on the B trial.

Examination of the experimental memory strength measure shows that, as indicated in the simulations, the target distinctiveness may produce the discovered effects through two processes. First is through increasing the spontaneous error rate and thus, reducing the number of reaches to A (Figure 18). This is seen with the striped lids--the most distinctive condition, where there is a dramatic diversion of pulls beginning by the second or third trial. Recall that in trials A1-A6, the infant is being cued with the red lid, but even when that lid is made more salient by pushing it forward, the striped lid attracts sufficient visual attention to cause many reaches to that target. With more reaches to B, there is a strong tendency to reach again for B on the B trials. In contrast, in the other three conditions, when infants are trained to the A side they tend to stick there; just the presence of the B lid is not sufficiently compelling to pull them away. And in the red and orange conditions, the A habit predominates even when they are specifically directed to B. However, the yellow lid does exert some counterbalancing cue on B1 because more babies reach to B than when B was red or orange. The specific input cue partially overcame the perceptual-motor memory built up at A.

Figure 17. Relative memory for A for the effects of target distinctiveness. cA/B = 1-3. Parameters:Sspec = 4, Stask = 1, Smem = 3. Shown with standard deviations.

Figure 18. Experimental results on target distinctiveness. Relative memory for A measure from Diedrich et al. (1999).

The ability to "know" whether to go to one location or another is thus exquisitely influenced by the nature of the tonic visual field. These influences are dynamic, however, and depend both upon the relative differences between the targets as well as the system's immediate history of looking and reaching. We show by the final set of simulations that similar complex and coupled dynamics are predicted for the remaining input parameters, those of the specific input.

6.1.6. Strength of the specific input. In contrast to the task input, the specific input is time-limited. The experimenter hides the toy or waves the lid and then this stimulus is gone. One assumption of the model is that the relative strength of the specific input should also contribute to the A-not-B error. A plausible prediction from the model is that, with stronger specific input, there should be less A-not-B error and less spontaneous error as well, because the salient cue would more strongly specify the target.

The simulations revealed, however, more complex and dynamic effects. Figure 19 shows the memory strengths from four simulations produced by increasing Sspec from 3 to 6. For these simulations, we increased specific input the same amount for both A and B trials. Consider first the two extremes. When specific input at A and at B was weak, the model made many spontaneous errors to B and was thus more likely to be correct on B trials. This is also shown by the high standard deviations in Figure 20. This simulates the course of events for the 20% of the infants who are correct in the canonical task, and suggests that one factor in their "spontaneous" errors may be the relative, transient salience of the particular cueing to A or B. In contrast, when the cues at A and B were highly salient, the model predicted correct responding on both A and B trials, as attention remained riveted to the cued side, revealed also by low standard deviations in Figure 20. This suggests that even younger infants should not err on either A or B when the cueing event is highly attention-grabbing. Behaviorally, younger infants with highly salient cues look like older infants who can retain the cue longer in memory.

The two intermediate values revealed a nonlinear effect. The canonical A-not-B effect is seen at Sspec = 4, where there are some number of spontaneous errors on the A trials, but sufficient A responding to build a strong memory to that location such that it cannot be overridden by the B cue. The simulation at Sspec = 5 demonstrates well the dynamic competition between inputs and memory. Here the cue is strong enough to keep A responding mostly correct, but the B cue is just not quite attractive enough to counter the growing A memory and the model predicts errors on B1 and fewer errors on B2, where variability was especially increased.

Figure 19. Relative memory for A measure for increasing specific input, Sspec = 3 to 6.

Figure 20. Standard deviations of the A memory shown in Fig. 20

We can account for these complex results by considering more closely the changing dynamics, where several interacting processes are likely at work. First, recall that the motor field continues to evolve over the delay period under the influence of the three inputs, task, specific, and memory (e.g. Figures 13 and 14). But the impact of the specific input is most most potent at the very beginning of the trial. This generates a clear empirical prediction: with stronger specific input and with no delay on any of the trials, infants should be more often correct on the A trials and on the B trials as well because their decisions will be dominated by the specific input. The specific input has other, less direct, effects, as well. Weaker specific inputs at A create a comparatively weaker pull to that side and more noise-induced spontaneous errors.Additionally, and in concert, as the field evolves, the memory trace left by strong specific inputs exerts a stronger influence to the A side, also decreasing the spontaneous errors. However, these effects would tend to increase A-not-B responding, which is not what the simulations produced.

The simulations suggest that the effect of the stronger A memory built in the A trials is counteracted by the also stronger B cue at the B trials at the higher levels of Sspec. In other words, the effects of the specific input are nonlinear: at low and moderate levels, and with delay, the memory input dominates and the B cue cannot overcome it. However, at higher levels of specific input, and despite the clear good performance at A, the cue at B will be sufficient to cause correct responding. This is certainly consistent with the experiment of Smith et al (1999) described earlier, where diverting infants visual attention at the B trials could reinforce or perturb the A-side habit. The B cue here clearly acted to increase the specific input to one side or the other.

These simulations reveal intricate dynamics whose plausibility we are currently testing by empirical studies. The studies involve manipulations of the relative strengths of the specific inputs by varying the saliency of the stimuli (e.g. highly colored flashing lights vs. brown bottoms of the lid), their duration (long versus short presentations) and the delays between terminating the specific input and allowing the motor field to evolve (1 vs. 3 seconds). The logic of the experiments and the predictions are summarized in Table 1. Preliminary results strongly support the predictions (Clearfield, et al., 1999). Such experiments can be followed by another class where the strengths of the specific input are directly pitted against the memory dynamics, such as was done by Munakata (1997) where a toy at the B trial was tested against a plain lid on the A trials.



Table 1

Predicted interactions between specific input and delay

Levels of specific input at A-not-B errors

A and B, from lowest (1)

to highest (4) No delay Delay
 
1 R E
2 R-C E
3 C R
4 C C

R= random responding on B1

E= errors on B1

C= correct on B1
 
 

7. Discussion

7.1 Strengths of the model.

7.1.1. Realistically simulates all A-not-B results. The model simulates the known age, delay, and context effects of the well-studied Piagetian Stage IV error in a very powerful way. But it also captures additional, new effects discovered by Smith et al. (1999b) in experiments designed from a qualitative dynamic perspective. These include 1) hidden objects are unnecessary; 2) the task is novel and perceptually confusing; 3) reaching and looking are coupled 4) the reach memory is body-coded, and 5) repetition is essential. The model accomplishes all this without invoking constructs of "object representation," or other knowledge structures. Rather, the infants' behavior of "knowing" or "not-knowing" to go to the "correct" target is emergent from the complex and interacting processes of looking, reaching, and remembering integrated within a motor decision field. Moreover, the model accounts for the contextual variations discovered in dozens of studies within a single set of dynamic equations. This does away with the need for invoking new cognitive strategies for the discontinuities between performance, for instance, at no-delay versus delay or between the A and the B trials (i.e. "direct" vs. "inferred" search strategies). The nonlinearities inherent in dynamical interactions produce overt behavioral switching as a result of small changes in parameters and as a function of the time-evolution and previous history of the system.

Likewise, the model shows explicitly how behavior that looks like inhibition or lack of it can arise from processes where no explicit inhibitory mechanisms are switched on. Inhibition plays an important role in Diamond's (1985) influential account. In particular, Diamond maintains that the ability to inhibit reaches to A increases with the age-related maturation of the prefrontal cortex. The puzzle, however, is in the A-not-B task, infants look like they are unable to inhibit their reaches to A at some ages, with some delays, and with certain task features, but can inhibit A-responses at the same ages, under different circumstances. It is not clear in Diamond's account, why, if prefrontal maturity alone is the critical element, the number of A reaches or the distinctiveness of the targets should matter so profoundly or what indeed switches inhibition on or off at any given age. In the dynamic field model, inhibitory connections are embedded in the interactions in the field. But the behavior of either reaching or not reaching is emergent entirely from the dynamic interactions between activation processes and their changing strengths over time. We need not manipulate inhibition per se. Both age- and context-effects are simulated in the very same regime.

7.1.2. Reaching errors come from general processes. A second important strength of the model, therefore, is to situate this particular developmental phenomenon--perseverative reaching under particular conditions-- within the continuum of normal processes. We demonstrate that the error can be understood in both its "classic" form and in all its previously perplexing variations entirely by the dynamics of these processes. We are able to account for known and quite subtle interactions between, say, delay timing and age, but we also make entirely novel predictions about the relative contributions of the visual input, number of reach repetitions and delay as well. These complex interactions would have been difficult, if not impossible, to understand without the formal properties of the model. In addition, the model accomplishes these matches with relatively realistic parameters based on known experimental effects, for example, the trial, inter-trial, and delay times and specific input parameters.

Although we have devoted this paper to the developmentally-specific A-not-B error, the model was adapted from one originally formulated to account for much more general issues of movement programming. (The primary modification here was to take into account the task input, to deal with delays, and to vary the degree of cooperativity leading from an input-driven to a self-sustained mode of operation.) In its original form, the Erlhagen and Schöner (1999) model captures an impressive number of important and well-researched phenomena associated with motor planning, but conventionally described in information-processing terms. These included not only the Ghez effects we described in an earlier section, but also many basic reaction time findings such as increases of reaction time with increased number of choices, decreased probability of choices, and increased distance between targets, as well as many different priming effects, stimulus-response compatibility, and pre-cueing experiments.

That the very same dynamics that produce these fundamental aspects of adult perceptual- motor functioning also give rise to the developmentally-constrained A-not-B error gives strong support to our contention that the error is not one of object representation in the accepted Piagetian sense. Rather, the error appears as part of continuum of developmental changes affecting reaching, looking, and remembering and the planning for action. One assumption of the infant model is that there are changes in the ability of the movement planning field to sustain an activation peak in the absence of specific input. Once the cue--either waving a lid or hiding a toy-- has disappeared, infants must hold the location of that cue "on-line" for a few seconds until the target is reachable. Clearly, this ability to keep relevant, but no longer visible, stimuli in memory is a critical developmental achievement. The degree of cooperativity of the field may have other potentially powerful developmental effects as well. For instance, limited cooperativity leads to an increase of the effective time scale over which the field builds up, as the excitatory action contributes relatively little to the growth process. This results in a relative "sluggishness" of the field and thus of infants' abilities to reach a threshold activation sufficient to generate a reach. Informal observations have indicated dramatic differences between new reachers and one-year-olds in their ability to generate a reach once the target is presented. Young infants intently stare at the atttractive toy, open their mouths in anticipation of the toy, and yet they may take many seconds to actually begin arm movements directed to it. Older infants, in contrast, rapidly produce an appropriate movement (Thelen, unpublished observations). We believe that the parameter changes that lead to this decrease in infant "reaction time" may be part and parcel of the same processes that produce perseverative reaching errors.

But other parameters may be undergoing developmental change as well. The contribution of developmental changes in the perceptual differentiation of the targets (Gibson 1969) is unknown, for instance. However, it is reasonable to assume that 6-month-olds view two brown lids on a brown box differently than do 12-month olds. Experience interacting with many different objects and, as locomotion develops, viewing the world from many different perspectives must produce the ability to distinguish more action-relevant features. The model allows us to simulate and then empircally test not just the presumed developmental changes in the field cooperativity, but also changes in the relative activation spread in the input fields, a measure of the relative distinction of the targets.

Finally, we believe that the A-not-B-type task can provide a sensitive assay for infants' changing abilities to form perceptual categories, and therefore, a way to investigate the functional integration of the "what" and "where" visual streams. Based on the model, we understand that graded differences in both the task and specific input are coupled together in the decision field for action and that these inputs are retained in memory along with parameters of movement. Thus, if we ask infants to repeatedly reach for one of two brown, circular lids several times, what would happen if we then substituted, say, a brown star- shaped lid for the first B stimulus? By asking when they "notice" a difference in a second test stimulus after they have become accustomed to the first, we can probe the nature of the stimulus memory in the context of reaching in a specified location. Is a star-shape sufficiently distinct as an object to-be-reached-for to pull infants away from the original habit? Could we get infants to generalize to a class of any novel brown-colored lids by switching shapes on each A trial? Could we get them to ignore novelty completely by appropriate training or, conversely, might training on variable targets retard the specific spatial perceptual-motor memory altogether? Many other variations are possible. This procedure is logically similar to the frequently used habituation-dishabituation paradigm, where infants are allowed to look at a target until they lose interest and then they are tested with a second more-or-less different target. The attractive difference is that with A-not-B type techniques (for infants who can reach) the infant remains keenly interested in the task throughout, which eliminates a motivational confound.

7.1.3. Biological plausibility. The model as it stands is neutral as to an anatomical instantiation in the central nervous system; it is a model of the behavioral dynamics. The leap from behavioral dynamics to neural anatomy must be undertaken with great caution. Nonetheless, we believe that there are two ways in which the behavioral dynamics are biologically realistic and fully compatible with processes occurring at the neural level.

First, dynamic field concepts can provide a plausible depiction of behavior of populations of neurons representing simple movement parameters such as movement direction (Georgopoulos, et al, 1988) or extent (Fu, Suarez, & Ebner (1993) (see Erlhagen & Schöner, 1999, for further discussion.). The usefulness of such concepts was recently demonstrated by Bastian, et al (1998) in an experiment where monkeys were given varied amounts of prior information about upcoming movement tasks. Recorded activity from populations of neurons in motor and premotor cortex showed that this information biased the dynamic population field before the movement task itself was cued, much like the memory and task inputs preshape the field in the current model.

Having suggested this potential link to neural population activity, we want to reiterate that at this point, our model is not anatomically specific, but is best conceptualized in terms of the entire ensemble of processes involved in the A-not-B task. These coupled processes-- attention, motivation, visual processing, planning, moving, and remembering over several times scales--surely require the entire, densely connected and multiply re-entrant circuits of the brain. For instance, at least eight distinct motor fields have been identified in primate frontal cortex and each receives varying inputs from frontal and parietal visuospatial cortical regions as well as multiple connections from basal ganglia and cerebellum, all of which are critically implicated in motor sequencing, timing, planning, memory, and learning (see, for example, Jeannerod 1997; Schwartz 1994). In the last few years, studies in awake, behaving monkeys have clearly demonstrated that the kind of integrative processes we propose in the model can be detected widely in the brain, and especially in areas long thought to be primarily motor. Using tasks that have much in common with the A-not-B error, these studies suggest that motor cortical area may be deeply involved in both sensorimotor transformation and a retention of the stimulus in memory. Table 2 summarizes a sample of such studies in four regions of cortex. In each study, investigators recorded activation of single neurons or neural populations in different reaching tasks. In some, delays were imposed between stimulus and response to probe the maintanence of the stimulus over the delay. In others, monkeys were given instructional sets to disassociate the actual movement direction from the stimulus direction. An entry in the table means that researchers found significant neural response to the identifiable aspects of the task.



Table 2

Neural activity in four brain areas related to sensorimotor functions in A-not-B task

Neurons encoding: Motor Premotor Parietal Prefrontal
Visual stimulus
cue-based 12, 16 4, 10, 17 1, 11 4, 5, 13
set-based 16 4, 17 1, 11 4, 5
location-specific 16 10, 17 1, 11 5
object matching 13
Look
eye position or

saccades

2 1 5, 13
Delay
cue held over

delay

12 4, 10, 12, 17 1, 11 4, 5, 13
Plan
active before

move. onset.

7, 8, 16 3, 4, 17 1, 11 4, 5
Reach
direction 6, 16 3, 17 1, 11 5
force 9 15
arm posture 5, 17 10, 15 11, 15
somatosensory 10, 17
Remember
memory of

previous acts

16 17 13

1. Anderson, et al (1979); 2. Boussaod (1995); 3. Caminiti, et al (1991); 4. diPelligrino & Wise (1993); 5. Fuster (1989); 6. Georgopoulos, et al (1988); 7. Georgopoulos, et al (1989a); 8. Georgopoulos, et al (1989b); 9. Georgopoulos, et al (1992); 10. Graziano, et al (1997); 11. Kalaska & Crammond (1995); 12. Kettner, et al (1996); 13. Miller et al (1996); 14. Scott & Kalaska (1997); 15. Scott, et al (1997); 16. Shen & Alexander (1997a). 17. Shen & Alexander (1997b).

Table 2 makes clear that the kinds of integrated neural activity assumed by the infant model is widely distributed, even in brain areas traditionally believed to be primarily "sensory" or "motor." The traditional distinctions do not hold (see Jeannerod 1997, for a full review). Each area has neurons responsive to nearly all aspects of a reach-over-delay-type task: cells in the motor cortex respond to the visual stimulus and hold a cue over delay and the prefrontal cortex has neurons that respond to movement direction. Likwise, neuroscientists have revised their views of function of parietal cortex. This area, long thought to be a site of visual integration, is now believed to both abstractly represent space and to form motor plans and intentions (Andersen 1995; Snyder, Batista & Andersen 1997).

7.2. What develops?

7.2.1 The role of the prefrontal cortex. The issue of what brain regions support the sensorimotor and memory integraton suggested by the model is important because, based on the work of Diamond (1990a,b; Diamond & Goldman-Rakic 1989), the A-not-B error is believed to be a hallmark task for dorsolateral prefrontal cortex maturation. The heart of Diamond's argument is the similarity between infant perseverative behavior and the kinds of errors made by monkeys and adult human patients with prefrontal lesions in similar, but not identical tasks. Both groups have difficulty remembering a stimulus over a delay and seem unable to inhibit prepotent responses. (Human prefrontal patients have many additional problems with attention, sequencing of actions, planning, and emotional modulation. Damasio 1995; Fuster 1989.) We believe that the issue is not whether the prefrontal cortex is involved short-term memory and action planning, for the evidence is incontrovertible. Rather, we question whether the developmental changes seen between say 8 and 12 months in the A-not-B task can be confidently attributed to maturational changes in prefrontal cortex alone, such that it is now assumed to be a marker of such maturation.

There are several reasons why we think such a conclusion may be premature. First, it is true that infants perseverate, and so do lesioned monkeys and humans. But there is little evidence to show that infants make perseverative errors using the same underlying neural mechanisms as adult primates with the loss of a piece of adult brain. It is seductive, but dangerous, to assume that an animal lesioned after thousands of trials of training equals an inexperienced infant.The experience-dependent and epigentic processes of brain development cannot be reversed by lesioning. Moreover the evidence from lesioning infant monkeys is equivocal (e.g. Malone 1994). For all of these reasons, we believe it is theoretically inappropriate to try and "lesion" the model to simulate prefrontal patients and monkeys.

Second is the problem of circularity. The evidence that prefrontal cortex "matures" between 8 and 12 months is performance on A-not-B type tasks at the same time that A-not-B tasks are taken as the hallmark of prefrontal maturation. Indeed, when measured by a host of other tasks (which are also deficient in frontal patients) such as Tower of Hanoi problems as well as by anatomical, electrophysiological, and imaging techniques, the progression of prefrontal cortex towards adult-like states is very gradual, extending into adolescence (Malone 1994). What task and what aspects of that task, then, can be independently labeled as containing the essential elements of prefrontal maturation?

Diamond (1997) identifies two critierion responses that are both necessary and sufficient to mark prefrontal involvement: remembering the hiding location over a delay and the ability to inhibit the previous responses. But our model and experiments suggest that both of these seemingly fixed properties of the task are not fixed at all, but entirely dependent upon the way the experimenter structures the targets, the cues, and trial sequences. This is a particularly critical issue because Diamond's version of the A-not-B task seriously conflates the two aspects of memory identified in the model. Diamond's procedure is complex (see Diamond 1997), but essentially presents the infant with two trials at A with a certain delay, after initial training to uncover a hidden toy. If the infant correctly reaches twice to A, the infant gets a switch trial, where the toy is hidden at B. If the infant is correct on all three trials, then the delay is increased with the 2 A trials now positioned at the former B location. If the baby was incorrect the delay is decreased. This procedure is repeated until experimenters find the delay at which infants were correct on A trials, but made the error at B, and this delay is what determines their prefrontal functional level. Diamond adds elaborate rules for determining how many reversals, how many correct trials and how many perseverative strings to one location are allowed before the experimenter changes the procedure. The problem with this procedure is that, while providing multiple reversals to test perseveration, it is completely uncontrolled as to the number of actual reaches to either A or B before any particular "test" reversal, and these data are not reported. As we have seen in both model and experiments, the number of elicted and spontaneous reaches to each target is absolutely critical in building up the memory input, and there is a complex relationship between the delay, the cue and the preshaping of the field with the history of reaches. Say, for instance, an infant is tested at a 3 s delay and reaches correctly to both A and to B. The experimenter would increase the delay to 5 s and test with two more reaches to B and a reversal to A. If the infant then makes the error, Diamond would interpret this as a prefrontal deficiency involving inhibition or delay. But, on average, given the strict relationship between number of reaches and tendency to stick (Smith et al. 1999b), the baby would make the error at 5 s because he or she would have reached more recently 3 times to B. But the baby also might have made the error at 3 s, because the delay and the number of repetitions are not independently controlled. So it is impossible to tell whether failure is due to delay or to repetition, or success is due to chance. As these procedures commonly make 12 or 15 target switches to establish the criterion delay, each involving 3 or more reaches, the infant ends up reaching many times to both A and B. The tendency of the infant to reach randomly to A or B is greatly increased. In addition, to keep the infant's interest in the procedure, Diamond (1997) sometimes changes the toy that is hidden. If our model and experiments are correct, this changes the strength of the specific input, which itself has a complex and nonlinear interaction with repetition and delay. All of these manipulations matter, and thus, the experimenter's ability to interpret either switches or perseverative strings as caused by either delay or inhibition is seriously compromised.

7.2.2 Modeling developmental change. Infants will reach correctly over longer delays as they get older. This fact is not at issue. What is still open for discussion is by what mechanisms this change comes about. The lesson from the model, the experiments, and from Table 2 is that there are multiple, parallel, integrated mechanisms contributing to this performance at any time and in any particular context. Thus, there may be and probably are, many avenues of change.

In the present model, we capture these distributed processes in a single integrative motor planning field. In particular, we simulate the appearance and disappearance of errors by changing h, the field's cooperativity. This is the heart of a dynamic model: a small change in one parameter value creates an instability that drives the field into a new regime, the ability to self-sustain the stimulus input. Manipulating the h parameter raises the resting level of the field and thus increases the weight of the field interactions relative to those of the inputs to the field. In this sense, we are changing the relative influence of the internal, mental processes over those dominated by the input alone.

The question then becomes: What might be driving such a developmental change? Behaviorally, infants look as though they gain an increase in working memory, and it is seductive to attribute this to autonomous "maturational" changes. But the process may be more complex and contingent. For instance, recall that the simulations and experiments showed that nine-month old infants will perserverate or not as a function of the visual task parameters. This cannot be attributed to maturational improvements in working memory. Rather, an especially salient target or cue acted to change the balance between the internal dynamics of the field and those of the input such that the input dominated. In this sense, the strength of the input determined the strength of the memory.

This shift from perseveration to no-perserveration with a real-time manipulation may hold one clue to what might be happening over developmental time. For example, we know that with development, perceptual abilities become more refined (Gibson, 1969). With perceptual learnng, infants may better distinguish the two hiding covers. Thus, what might initially be represented as two broad, diffuse bands of overlapping excitation might at a later time be characterized as more sharply focused, stronger peaks of activation. Because of the connectivity of the field, stronger input at these sites increases the interaction effects and can lift the whole field above the h threshold, with the resulting increase in cooperativity. A stronger attentional mechanism would have a similar effect if increased attention resulted in stronger perceptual activitation. Indeed, the process of perceptual learning may be reflected in changes in the connectivity itself: As perceptual discrimination sharpens, the local sites of excitation may be pruned back, leading to more sustained and localized peaks of activation, again with the behavioral appearance of enhanced working memory.

But there are other ways as well in which infants' experiences may lead to improvements in A-not-B performance. Between 8 and 12 months, infants undergo many relevant and important behavioral changes associated with reaching for objects and remembering spatial locations. During this period, infants gain a measure of control over their arms such that their reaching stabilizes to become more direct, smooth, and appropriately timed (Thelen et al. 1996). They learn to sit independently and to use their stable torso as a base of support for reaching in many areas of the space without tumbling over. They learn to shape their hands in anticipation of objects to be reached and then to differentiate the fingers to pick upsmall items. They begin to break out of the nearly obligatory object-in-the-mouth movement to hand-to-hand transfer, and fine manipulation. They start to incorporate manual actions with locomotion such as crawling and walking. And they begin to have highly differentiated manual activities with objects of different properties, such as squeezing soft toys and banging noisy one. It is a time of active exploration of the properties of objects by acting on them and of active exploration of space by moving through it. It is likely that these actions would profoundly change all areas of the brain, but especially those directly involved in attention, perception, planning, movement, and memory, including, but not limited to those areas mentioned in Table 2. Any or all of these multiple developmental advances can be envisioned as "priming" the field similar to the simulation of the resting level, h. Alternatively, such activities may work to change the dynamics of the motor memory such that there is more rapid decay between trials. Perhaps these changes occur primarily in the prefrontal cortex, but at this time, there is no direct evidence --except analogies to lesioned animals--to support this.

7. 3 Limitations

The model, therefore, while biologically plausible, captures an integrated behavioral outcome. It is not anatomically specific and is not directly couched in terms of central nervous system structures and circuits. Whether anatomically-driven models such as the powerful ones of Bullock and Grossberg (1988) can also simulate the infant behavioral dynamics is an important question and one that we would hope to see addressed.

Second, given our emphasis on the motor aspects of this task, the model is incomplete in not yet coupling with the trajectory dynamics of the actual reach that results from the motor plan. This is important because the memory that builds to capture subsequent reaches is not just a spatial location memory and a memory of the task space, but also some lingering trace of the trajectory of the arm going toward the target (Diedrich et al, in press). Moreover, as reported in Smith et al. (1999b), the motor memory embodies the infants' postural set as well such that postural manipulations perturb the location memory. It will be a major challenge to actually bridge the mental dynamics of the plan and the execution dynamics of the reach. We have taken the first step by casting the mental events in compatible dimensions.

Thirdly, the model is limited to a single parameter dimension, that of spatial location. It is a "where" stream model, in the language of the classic division of visual processing into a dorsal "where" stream and a ventral "what" stream. In this sense, we have turned the traditional interpretation of the A-not-B task as one of object recognition on its head. In the model, we have deliberately interpreted the hiding event as a stimulus that increases activation at a "where" site, making that position more or less salient and between cookies and toys as one of intensity not object identity. But, in everyday life, what and where components of actions are completely interwoven. People do not just reach out for locations, they reach for particular items, and the nature of the objects and people's intentions with them determine the characteristics of their manual actions (Castiello 1996). Somewhere in the final set of motor plans, these aspects of the visual world must be melded together (see Jeannerod 1997, for a contemporary interpretation of visuomotor processing). As we mentioned previously, A-not-B type experiments have important potential for understanding how this integration develops, and thereby helping to blur the distinction between pure "knowing" and mere "acting." This might be accomplished by experiments where targets similar in salience but differing in object properties are tested using various parameters of delay and repetition. Theoretically, we believe that dynamic field models can in principle be expanded and extended to encompass the semantics of action decisions as well as their spatial aspects. The driving assumption here is that such semantic content is represented in the same fluid, continuous, and time-based spaces as those integrating the multiple contributions to the spatial reaching decision of the A-not-B error.

Finally, the present model is restricted to static visual targets. Infants are more adaptive. They can actually intercept moving targets at remakably young ages (Hofsten 1983). Moreover, they can reach in the dark to auditory targets after they have seen them first specified visually (Clifton et al. 1993). We believe the model can accomodate these special conditions and we are currently pursuing this line of work.

7.4. Comparison to other models of the A-not-B error

Our model shares a basic assumption with two connectionist models of A-not-B performance, but there are fundamental differences as well. Both the earlier Dehaene and Changeux (1989) and the more recent Munakata (in press) models are similar to the present one in suggesting a competition between the history of the system and the current perceptual specification. In the first, Dehaene and Changeux (1989) simulated both the A-not-B error and the delayed-match-to-sample behavior using a simple two-layer connectionist model. The A-not-B error arises from the structure of the first layer, which establishes a direct stimulus-response linkage and works by Hebbian learning. During each A trial, the association between the sensory representation of the A location and the grasping response is strengthened. After a number of trials to A, the response is so strong that the B representation cannot override it. Behavior is determined by the history of success. The second layer contains an active memory component whereby the memory of the specific association can be maintained during a delay. When this layer is connected to the first, the perserverative response to A can be counteracted by the current association. Younger infants make the error because, their second layer, presumed to be located in the prefrontal cortex, is not yet connected.

There are several ways in which the Dehaene and Changeux model is unrealistic. First, the model learns on the basis of reinforcement for correct responses and punishment for incorrect ones. Reinforcement, i.e. being "correct," does not seem to matter in real infants, and indeed, researchers have not identified what aspects of the A-not-B task are actually motivating. Infants may not distinguish between grabbing the "wrong" cover or just reaching itself from obtaining the hidden toy. Our model does not need reinforcement or punishment to get the perserverative error. Second, their model situates the error in a different process (layer 1) than the one associated with correct responding (layer 2). Since the layers are either connected or not, the model cannot account for the many graded and contextual effects that are the hallmark of this task. Our model well captures the interactions between the stimulus situation, motor field characteristics and memory within a single system. Finally, and related, they cannot account for the delay effects . In their scheme, young infants will make the error no matter what the delay. In contrast, we can quite naturally explain delay effects and again, the important interactions between delay and other aspects of the task.

Munakata's (1998) connectionist model is more successful in simulating the real-life variations in the A-not-B task than the one offered by Dehaene and Changeux. Her current model is a departure from a previous connectionist model of Munakata et al's (1997) where the A-not-B error was conceptualized soley as a failure of object representation strength. In the more recent version, Munkata (1998), inspired by Smith et al.'s (1999) finding that hidden objects are unnecessary, rejects "infants knowledge of an object's location...as a reified entity, disembodied from underlying processing mechanisms" for an account based on two types of memory mechanisms, active and latent. The model consists of a 6-layer architecture. Three input layers encode information about the object location, cover type and toy type. The one hidden and two output layers contain units representing locations A, B, and C. The output is mapped onto both reaching and a gaze or expectancy layers which differ in their frequency of response since infants can presumably always look, but not always reach. The hidden and output layers have both inhibitory connections between units and self-recurrent excitatory connections back to each unit.

Starting with a small initial bias toward correct responding, the network develops an association between the input location and the motor response by Hebbian learning adjustments to the connection weights. As in other models, the repeated presentation at the A location strengthens the mapping to the A response. Because the B response stimulus shares input representation, when B is cued, the more potent A networks respond and the error occurs. The developmental change is modeled by changing the degree of recurrence in the network. With more recurrence, the activation of the units changes more gradually over time and the active memory can be maintained longer over the delay. This leads to a decrease of the influence of the A-latent memory and increase of relative strength of the B active memory. At low levels of recurrence, the B representation fades more rapidly than the latent A memory, accounting for the delay effect. Finally, Munakata (1998) can simulate the effects of multiple and distinctive covers by reducing the degree to which the sensory representations overlap in the network. Overlap is necessary for the B cue to activate latent A representations.

Thus, despite the different instantiations, there are strong similarities between Munakata's model and our own. Both have two forms of memory that interact, giving rise to the delay. Both simulate the age effects by comparable mechanisms allowing for self-sustained representations of the cue. And both can assign parameters to aspects of the input that result in shifts from error to no-error performance within the same system.

Nonetheless there are also some non-trivial differences (Smith & Scheier 1998). First is our adoption of a unified field in motor space that integrates the visual aspects of the task based on experimental results that looking and reaching are coupled.The unified field is the essential notion of embodiment. Munakata, in contrast, assumes two disassociated systems for looking and reaching, which do not interact. She does not represent target location in the same metric as body-location, which makes it impossible to incorporate the peculiarly egocentric and syncretic nature of infant reaching (Diedrich et al. 1998; Smith et al. 1999). Second, Munakata's model does not take into account the persistence of the task input, separate from the actual cue. Thus our model can, without further additions, simulate situations where both the specific input and the distinctiveness and/or location of the covers is parametrically varied. Finally, Munkata's model does not attempt to model the stochastic nature of real performance, i.e. that babies, when cued to A, sometimes reach to B, and that this chance B behavior influences the next reach. Our model behaves like real infants not only in the outcome of error or no error, but also in the real time, trial-by-trial execution of the behavior.

8. Conclusions

Readers who have made it to this point have perhaps asked themselves along the way, "Why all this fuss about such a simple little task in infants"? As the considerable army of A-not-B researchers have all discovered, the task and its infinite subtleties are endlessly intriguing. This was true when the A-not-B error defied a unified explanation, and is even more so now that we are moving closer to such an explanation. This is because, thanks to Piaget's genuis, we have an entry into much more general processes of perceiving and acting by observing them in human creatures who are not fully skilled.

There are two take-home messages. First is that both the robustness of the task and its exquisite context-sensitivity are emergent from the dynamic processes that produce decisions to act in the face of a rich visual scene and the recent and more distant memories of similar actions. But this snapshot of a developmental moment in infancy is revealing only as it illuminates issues inherent anytime we use an experimental task to probe the contents of mind. If the A-not-B task can be understood in terms of these coupled dynamic processes, then similar analyses can be applied to a large class of tasks used by infant researchers to determine what infants know.

For instance, we have situated these dynamics in motor plans for reaching. But the model is equally suited to the dynamics of eye and/or head movements, which are, of course, motor decisions as well (see Kopecz & Schöner 1995; Kopecz et al. 1995; Schöner, Kopecz & Erlhagen, 1997). Extending this model to gazing would be a particularly critical theoretical breakthrough, because much of what we know about infant cognition is garnered through looking experiments, using both static displays and events. There is a widespread assumption, for example, that experiments, using preferential looking or more commonly, habituation/dishabituation designs, are a more sensitive indication of infants' core "knowlege" than their behavior in the more demanding reaching experiments (see Bertenthal 1996; Munakata et al. 1997, for instance). Thus, the disassociation between what infants show that they "know" by looking and what they "know" by reaching is attributed to the performance demands of a manual action, while looking remains unfettered by motor constraints. But in a very real sense, looking at or away from a event display is a motor act integrating attention, vision, and memory processes just as much as reaching toward one place or another. The motor fields are different, but the activities are formally identical and should be captured by the same class of dynamic equations. Likewise, a formal model should generate precise predictions about how the parameters of the task change the outcome, and should reflect the delicate nonlinearity we have demonstrated here. By this reasoning, neither looking nor reaching provides a direct readout of the contents of mind. Both are constituted on line, within the moment, the bounds of the task, and the child's history in similar situations. Looking tasks, like the A-not-B, can tell us a lot about how these processes work together, but they cannot claim privileged access to the enduring contents of mind.

8.1. Embodiment: Are we there yet?

The second message of this analysis relates to the embodied nature of mental activities. How, the reader may ask, can we stake a claim to embodiment when our model does not incorporate the behavioral output in terms of the kinematics or kinetics of the actual movement?

We acknowledge again that a full model of infant reaching would integrate the decision level with the real-time control of the movement. Indeed we have reason to believe that this can be accomplished. First, in their original version of the dynamic field model, Kopecz and Schöner (1995) linked the levels of visual information and motor planning to the actual control of gaze in saccadic eye movements. They did this by having the motor control level align the gaze to the planned position by generating an eye trajectory with an appropriate endpoint. Of course, the motor control of three-dimensional, multi-joint reach trajectories is considerably more complex than that of one-dimensional eye saccades. Thus, the linkage between the decision level and the control of the movement is more difficult to model, but the difficulty, we believe, is in identifying the appropriate parameter space at the motor control level. When the movement parameters are well-defined, the field model can link decision to output (and output back again to decision).

The success of the field model in this regard has been quite dramatically demonstrated in robots. Schöner, Dose & Engels (1995) implemented this linkage from information to planning to movement using an automonous, mobile robot controlled by dynamic field architecture. The architecture allowed the robot to approach a target and avoid obstacles by means of field dynamics that integrated of visual input, memory, and planning, and generated movement. Finally, with Wolfram Erlhagen and Estela Bicho, we recently adapted this robot to simulate infants' A-not-B behavior, using the current field model architecture. The robot, equipped with microphones, faced two tonically active sound sources (A and B), analogous to the task input. One sound source (A) signalled the robot with a short tone, similar to the specific input in the visual simulations. These two inputs were integrated in the field. After the simulated delay period, a homogeneous input ("go" signal) drove the field into a decision-making mode in which a single, localized peak emerged. Once this peak reached a criterion threshold, the robot moved in the specified direction, to A or B. The probabilities of deciding to go to A or B were similar to those seen in real infatns. Then, the read-out of the decision field--the actual heading direction of the robot-- was used as input in subsequent trials as memory of the just-completed direction. After a number of successful A trials, the robot perseverated when subsequently cued to B, as did the infants. Importantly, the robot could be made to perserverate or not by the parametric settings of the input and the critical h parameter of the field.

It is gratifying that the model could be implemented in a real-world, moving device. There is, however, even a deeper sense in which the robot (and the simulations) are models for embodiment. This is because the dynamic field acts, in the words of Andy Clark, to "simultaneously describe aspects of the world and prescribe possible actions" (1997, p. 49). In other words, as the field is the site of integration of the external world and the memory of previous actions in it, no other stored maps or central controllers need intervene. The world and experiences in the world are the both controller and the controlled. The consequences are that the division between what is "conceptual" and what is "perceptual-motor" may be very hard to draw. Perception and mental planning contribute to a decision to act--the essence of human cognition--but the memories of actions are equally involved. Most importantly, both the history of acting and the current situation are expressed in the same action-centered dynamics. If the model captures any truth about how acts are generated, then body memory is encoded in these densely-connected fields from the start.

From this point of view, then, the critical developmental process may not be transgressing some line dividing the conceptual and the perceptual-motor--the traditional issue-- but the ability to use memory and to make decisions off-line when the situation demands. It is obvious that there are occasions when mental activity is well-served when less dominated by the immediate perceptual scene and less tied in an obligatory way to what was just done, or when immediate action is not warranted. Under such circumstances, the dynamic field might play out primarily under the influence of the experiential pre-shape, the memory of earlier responses in such situations, without the immediate input and without reaching the threshold for movement, On the other hand, people face many other occasions when a tight "clamping" to the environment, as Glenberg (1997) recently called it, is highly desired. There are circumstances when too much contemplations puts one at risk for life and limb! But everyday life is a dynamic mix between the more mental and contemplative and ongoing action. Indeed, skilled people shift rapidly and effortlessly between these modes: between acting immediately and tightly coupled to the input, and delaying action in favor of remembering and planning.

We suggest that people can shift between on-line and off-line so frequently, so rapidly, and so seamlessly only because acting and thinking are in commensurate dynamics. The alternative is that the two modes are incommensurate: symbolic and conceptual codes for the purely "mental" part and dynamics for the perceptual-motor. How do they seamlessly interface? We come up against both the symbol-grounding problem and its inverse: how do symbols (concepts) arise from perception and how does the symbolic (conceptual) code get transduced into the dynamics of movement?

Development surely involves acquiring the ability to hold aspects of the environment in memory and using those stored memories to plan new actions. But infants must also learn to act expediously when a task is better performed on-line (Ballard, et al 1997). Moreover, children must also gain the ability to switch between these modes of functioning. The lesson to be taken from the A-not-B model is that these are not wholly separate ways of functioning, but rather, emerge from a single set of dynamic interactions. It is because we express perception, memory, and action in the same graded and time-based dynamic currency that we can account for the myriad contextual effects in the A-not-B task. This same currency is the foundation for the abilities of adults to so effortlessly shift between off-line and on-line. Thus, while development brings forth new forms of behavior--for example, correct responding rather than perseveration-- the processes underlying these shifts are continuous. Indeed, the model illustrated how continuous parameter changes can generate nonlinear performance.

Finally, does this model have anything to say to Piaget's issue: when do infants acquire the object concept? We believe this question is ill-posed and cannot be answered because there is no such thing as an "object concept" in the sense of some causal structure that generates a thought or a behavior (Smith et al., 1999b). There is only "knowledge" of objects as embedded in the immediate circumstances and the history of perceiving and acting in similar circumstances. What motivates us to search for our missing keys is not some disembodied belief in the permanence of objects. Rather, it is a lifetime of discoveries that our keys have not vaporized after all and the recognition that the present situation is shaped like those in the past.

Indeed, the field model leads us back, in deference, to William James. There is no such "thing" as "an idea" said James (1890, p.236) in any pure or reified form. There are only dynamics:

"For there it is obvious and palpable that our state of mind is never precisely the same. Every thought we have of a given fact, is, strictly speaking, unique, and only bears a resemblance of kind with our other thoughts of the same fact....Often we are ourselves struck at the strange differences in our successive views of the same thing....But what here strikes us so forcibly on the flagrant scale exists on every scale, down the imperceptible transition from one hour's outlook to that of the next. Experience is remoulding us every moment, and our mental reaction on every given thing is really a resultant of our experience of the whole world up to that date." (1890, pp. 233-234).


ACKNOWLEDGMENTS

This research was supported by NIH RO1 HD22830, Indiana University, and by an NIMH Research Scientist Award to E.T. We thank Fred Diedrich, John Spencer, Melissa Wechsler-Clearfield, Jana Iverson, Dan Robin, Bob Titzer, and Dewey McLin for many contributions, both theoretical and empirical, to this work. We are especially indebted to Wolfram Erlhagen for sharing unpublished theoretical work and discussion. Fred Diedrich, Olaf Sporns, Arthur Glenberg, David Robertson, Charles Twerdy, Richard Carlson, Tim van Gelder, and several anonymous reviewers made many useful comments on the manuscript. We discussed these ideas and experiments with Jay McClelland, Liz Spelke, Marc Jeannerod, Mark Latash, and Giuseppe Pelizzer at the 1995 Workshop on Object Retrieval. A more lively and challenging group of colleagues could not be found. Finally, we are grateful to Gerald Edelman, Einar Gall, and the Neurosciences Institute for supporting the 1995 Workshop and for graciously hosting E.T. as a Visiting Fellow in Fall, 1997 where a portion of this article was written.



 

References


Acredolo, L. P. (1979) Laboratory versus home: the effect of environment on the 9-month-old infant's choice of spatial reference system. Developmental Psychology, 15:666-667.

Acredolo, L. P. (1985) Coordinating perspectives on infant spatial orientation. In: The development of spatial cognition, ed. R. Cohen. Erlbaum.

Ahmed, A. & Ruffman, T. (1998) Why do infants make A not B errors in a search task, yet show memory for the location of hidden objects in a nonsearch task. Developmental Psychology 34:441-453.

Almássy, N., Edelman, G. M., & Sporns, O. (1998) Behavioral constraints in the development of neuronal properties: A cortical model embedded in a real world device. Cerebral Cortex, 8, 346-361.

Allport, A. (1987) Visual attention. In: Foundations of cognitive science, ed. M. I. Posner, MIT Press.

Amari, S. (1977) Dynamics of pattern formation in lateral-inhibition type neural fields. Biological Cybernetics 27:77-87.

Amari, S. & Arbib, M. A. (1977). Competition and cooperation in neural nets. In J. Metzler (Ed.) Systems neuroscience. Academic Press.

Anderson, R. A., Snyder, L. H., Bradley, D. C., & Xing, J. (1997) Multimodal representation of space in the posterior parietal cortex and its use in planning movements. Annual Review of Neuroscience 20:303-330.

Baillargeon, R., & DeVos, J. (1991) Object permanence in young infants: Further evidence. Child Development 62:1227-1246.

Baillargeon, R., & Graber, M. (1988) Evidence of a location memory in 8 month-old-infants in a non-search AB task. Developmental Psychology 24:502-511.

Ballard, D. H., Hayhoe, M. M., Pook, P.K., & Rao, R. P. N. (1997) Deictic codes for the embodiment of cognition. Behavioral and Brain Sciences 20:723-767.

Bastian, A., Riehle, A., Elrhagen, W., & Schöner, G. (1998). Prior information preshapes the population representation of movement direction in motor cortex. Neuroreports, 9,315-319.

Bell, M., & Fox, N. (1992) The relations between frontal brain electrical activity and cognitive development during infancy. Child Development63:1142-1164.

Bertenthal, B. I. (1996) Origins and early development of perception, action, and representation. Annual Review of Psychology 47:431-459.

Bertenthal, B. I. & Campos, J. J. (1990) A systems approach to the organizing effects of self-produced locomotion during infancy. In: Advances in infancy research, Vol 6, eds. C. Rovee-Collier & L.P. Lipsitt. Ablex.

Boussaoud, D. (1995) Primate premotor cortex: modulation of preparatory neuronal activity by gaze angle. Journal of Neurophysiology 73:886-890.

Bremner, J. G. (1978a) Egocentric versus allocentric spatial coding in nine-month-old infants: Factors influencing the choice of code. Developmental Psychology 14:34-355.

Bremner, J. G. (1978b) Spatial errors made by infants: Inadequate spatial cues or evidence of egocentrism? British Journal of Psychology 69:77-84.

Bremner, J. G. (1985) Object tracking and search in infancy: A review of the data and a theoretical evaluation. Developmental Review 5:371-396.

Bremner, J. G. & Bryant, P. E. (1977) Place versus response as the basis of spatial errors made by young infants. Journal of Experimental Child Psychology 23:162-177.

Brooks, R. A. (1991) Intelligence without representation. Artificial Intelligence 47:139-159.

Bullock, D. & Grossberg, S. (1988) Neural dynamics of planned arm movements: Emergent invariants and speed-accuracy properties during trajectory formation. Psychological Review 95:49-90.

Butterworth, G. E. (1977) Object disappearance and error in Piaget's stage IV task. Journal of Experimental Child Psychology 23:391-401.

Butterworth, G., Jarrett, N. & Hicks, L. (1982) Spatiotemporal identity in infancy: perceptual competence or conceptual deficit? Developmental Psychology 18:435-449.

Caminiti, R., Johnson, P. B., Galli, C., Ferraina, S. & Burnod, Y. (1991) Making arm movements within different parts of space: The premotor and motor cortical representation of a coordinate system for reaching to visual targets. Journal of Neuroscience 11:1182-1197.

Castiello, U. (1996) Grasping a fruit: Selection for action. Journal of Experimental Psychology: Human Perception and Performance 22:582-603.

Chiel, H. J. & Beer, R. D. (1997) The brain has a body: Adaptive behavior emerges from interactions of nervous system, body, and environment. Trends in NeuroScience 20:553-557.

Clark, A. (1997) Being there: Putting brain, body, and world together again. MIT Press.

Clearfield, M.W., Thelen, E., & Smith, L. B. (1999). The role of specific input and delay on infant perseverative reaching errors. Presented at the Biennial Meeting, Society for Research in Child Development, Alburquerque.

Clifton, R. K., Muir, D. W., Ashmead, D. H. & Clarkson, M. G. (1993) Is visually guided reaching in infancy a myth? Child Development 64:1099-1110.

Damasio, A. R. (1994) Descartes' error: Emotion, reason, and the human brain. Putnam.

Dehaene, S. & Changeux, J-P. (1989) A simple model of prefrontal cortex function in delayed-response tasks. Journal of Cognitive Neuroscience 1:244-261.

Desmurget, M., Prablanc, C., Rossetti, Y., Arzi, M., Pauligan, Y. Urquizar, C., & Mignot, J. C. (1995) Postural and synergic control for three-dimensional movements for reaching and grasping. Journal of Neurophysiology 74:905-910.

Diamond, A. (1985) Development of the ability to use recall to guide action, as indicated by infants' performance on AB. Child Development 56:868-883.

Diamond, A. (1990a) Development and neural bases of AB and DR. In: The development and neural bases of higher cognitive functions, ed. A. Diamond. New York Academy of Sciences.

Diamond, A. (1990b) Developmental time course in human infants and infant monkeys and the neural bases of inhibitory control in reaching. In: The development and neural bases of higher cognitive functions, ed. A. Diamond. New York Academy of Sciences.

Diamond, A. (1997) The role of motivation in performance of the A-not-B task. Manuscript submitted for publication.

Diamond, A. & Goldman-Rakic, P. S. (1989) Comparison of human infants and rhesus monkeys on Piaget's AB task: Evidence for dependence on dorsolateral prefrontal cortex. Experimental Brain Research. 74:24-40.

Diamond, A., Prevor, M. B., Callendar, G. & Druin, D. P. (1997) Prefrontal cortex cognitive deficits in children treated early and continuously for PKU. Monographs of the Society for Research in Child Development 252:4.

Diedrich, F. J., Highlands, T., Thelen, E. & Smith, L. B. (1999) The role of target distinctiveness in infant perseverative reaching errors. Manuscript submitted for publication. Diedrich, F. J., Thelen, E., Smith, L. B. & Corbetta, D. (In press) Motor memory is a factor in infant perseverative errors. Developmental Science.

Edelman, G.M. (1987) Neural Darwinism: The theory of neuronal group selection, Basic Books.

Enright, J. T. (1995) The non-visual impact of eye orientation on eye-hand coordination. Vision Research 35:1611-1618.

Erlhagen, W. & Schöner, G. (1999) Dynamic field theory of motor programming. Manuscript submitted for publication.

Fisk, J. D. & Goodale, M. A. (1985) The organization of eye and limb movements during unrestricted reaching in targets in contralateral and ipsilateral visual space. Experimental Brain Research 60:159-178.

Fogel, A. (1993). Developing through relationships: Communication, self, and culture in early infancy. Cambridge, England: Harvester Press.

Fortier, P.A., Kalaska, J. F. & Smith, A. M. (1989) Cerebellar neuronal activity related to whole-arm reaching movements in the monkey. Journal of Neurophysiology 62:198-211.

Freeman, W. & Skarda, C.(1985) Spatial EEG patterns, nonlinear dynamics, and perception: The neo-Sherringtonian view. Brain Research Reviews 10:145-175.

Fu, Q.-G., Suarez, J. I., & Ebner, T. J. (1993). Neuronal specification of direction and distance during reaching movements in the superior precentral premotor area and primary motor cortex of monkeys. Journal of Neurophysiology, 70, 2097-2116.

Fuster, J. M. (1989) The prefrontal cortex: Anatomy, physiology, and neuropsychology of the frontal lobe, 2nd edition. Raven Press.

Gelman, R. (1979). Preschool thought. American Psychologist, 34, 900-905.

Georgopoulos, A. P.(1986) On reaching. Annual Review of Neurosciences. 9:147-70.

Georgopoulos, A. P.(1990) Neurophysiology of reaching. In: Attention and performance XIII, ed. M. Jeannerod. Erlbaum.

Georgopoulos, A. P. (1991). Higher order motor control. Annual Review of Neurosciences 14: 361-77.

Georgopoulos, A. P. (1995). Motor cortex and cognitive processing. In: The cognitive neurosciences. ed. M. S. Gazzaniga. MIT Press.

Georgopoulos, A. P., Ashe, J., Smyrnis, N. & Taira, M. (1992) Motor cortex and the coding of force. Science 256:1692-1695.

Georgopoulos, A. P., Crutcher, M. D. & Schwartz, A. B. (1989) Cognitive spatial-motor processes. 3. Motor cortical prediction of movement direction during an instructed delay period. Experiment Brain Research 75:183-194.

Georgopoulos, A. P., Kettner, R. E. & Schwartz, A. B. (1988) Primate motor cortex and free arm movements to visual targets in three-dimensional space. II. Coding of the direction of movement by a neuronal population. Journal of Neuroscience 8:2928-2937.

Georgopoulos, A. P., Lurito, J. T., Petrides, M., Schwartz, A. B. & Massey, J. T. (1989) Mental rotation of the neuronal population vector. Science 243:234-236.

Ghez, C., Gordon, J., Ghilardi, M. F., & Sainberg, R. (1995) Contributions of vision and proprioception to accuracy in limb movements. In: The cognitive neurosciences. ed. M. S. Gazzaniga. MIT Press.

Ghilardi, M. F., Gordon, J. & Ghez, C. (1995) Learning a visuomotor transformation in a local area of work space produces directional biases in other areas. Journal of Neurophysiology 73:2535-2539.

Gibson, E. J. (1969) Principles of perceptual learning and development. Prentice-Hall.

Glenberg, A. M. (1997) What memory is for. Behavioral and Brain Sciences 20:1-56.

Goodale, M. A., Pélisson, D. & Prablanc, C. (1986) Large adjustments in visually guided reaching do not depend on vision of the hand or perception of target displacement. Nature 320:748-750.

Gratch, G., Appel, K. J., Evans, W. F., LeCompte, G. K. & Wright, N. A. (1974) Piaget's stage IV object concept error: Evidence of forgetting or object conception. Child Development 45:71-77.

Graziano, M. S. A., Hu, X. T. & Gross, C. G. (1997) Coding the locations of objects in the dark. Science 277:239-241.

Grossberg, S. (1973). Countour enhancement, short term memory and constancies in reverberating neural networks. Studies in Applied Mathematics, 52, 217-257.

Grossberg, S. (1980). Biological competition: Decision rules, pattern formation, and oscillations. Proceedings of the National Academy of Sciences (USA), 77, 2338-2342.

Grossberg, S. (1988). Nonlinear neural netmworks: Principles, mechanisms, and architectures. Neural Networks, 1, 17-61.

Harnad, S. (1990) The symbol grounding problem. Physica D 42:335-346.

Harris, P. L. (1973) Perseverative errors in search by young infants. Child Development 44:28-33.

Harris, P.L. (1987) The development of search. In: Handbook of infant perception. Vol. 2, ed. P. Salapatek & L. B. Cohen. Academic Press.

Hening, W., Favilla, M. & Ghez, C. (1988a) Trajectory control in targeted force impulses. V. Gradual specification of response amplitude. Experimental Brain Research 71: 116-128.

Hening, W., Vicario, D. & Ghez, C. (1988b) Trajectory control in targeted force impulses. IV. Influence of choice, prior experience, and urgency. Experimental Brain Research 71:103-115.

Hofstadter, M. C. & Reznick, J. S. (1996) Response modality affects human infant delayed-response performance. Child Development 67:646-658.

Hofsten, C. von (1983) Catching skills in infancy. Journal of Experimental Psychology: Human Perception and Performance 9:75-85.

Hofsten, C. von (1991) Structuring of early reaching movements: A longitudinal study. Journal of Motor Behavior 23:280-292.

Hommel, B. (1996). The cognitive representation of action: Automatic integration of perceived action effects. Psychological-Research/Psychologische-Forshung 59:176-186.

Hornik, K., Stinchcombe, U., & White, H. (1989). Multilayer feedforward networks are universal approximators. Neural Networks, 2, 359-366.

Horobin, K. & Acredolo, L. (1986) The role of attentiveness, mobility history, and separation of hiding sites on Stage IV search behavior. Journal of Experimental Child Psychology

Houk, J. C., Keifer, J. & Barto, A. G. (1995). Distributed motor commands in the limb premotor network. Trends in Neurosciences, 16: 27-33.

Jackson, S. R., Jackson, G. M. & Rosicky, J. (1995) Are non-relevant objects represented in working memory? The effect of non-target objects on reach and grasp kinematics. Experiment Brain Research 102:519-530.

James, William (1897,1984) The essential writings. Edited by B. Wilshire. Albany: State University of New York Press.

Jeannerod, M. (1997) The Cognitive Neuroscience of Action. Oxford: Blackwell.

Johnson, M. (1987) The body in the mind: The bodily basis of meaning, imagination, and reason. University of Chicago Press.

Jordan, M. I. (1990). Motor learning and the degrees of freedom problem. In M. Jeannerod (Ed.), Attention and Performance XIII (pp. 796-836). Hillsdale, NJ: Erlbaum.

Kalaska, J. F., Caminiti, R. & Georgopoulos, A. P. (1983) Cortical mechanisms related to the direction of two-dimensional arm movements: relations in parietal area 5 and comparison with motor cortex. Experimental Brain Research 51:247-260.

Kalaska, J. F. & Crammond, D. J. (1995) Deciding not to GO: Neuronal correlates of response selection in a GO/NOGO task in primate premotor and parietal cortex. Cerebral Cortex 5:410-428.

Keele, S. W. (1981) Behavioural analysis of movement. In: Handbook of physiology. Sect. 1 The nervous system. Vol. II. Motor control, Part 2 , ed. V. B. Brooks. American Physiological Society.

Kelso, J. A. S. (1995) Dynamic patterns: The self-organization of brain and behavior. MIT Press.

Kettner, R. E., Marcario, J. K. & Port, N. L. (1996) Control of remembered reaching sequences in monkey. II. Storage and preparation before movement in motor and premotor cortex. Experimental Brain Research 112:347-358.

Kermoian, R. & Campos, J. J. (1988) Locomotor experience: A facilitator of spatial cognitive development. Child Development 59:908-917.

Koch, C. & Davis, J. L., eds. (1994) Large-scale neuronal theories of brain function. MIT Press.

Kopecz, K. & Schöner, G. (1995) Saccadic motor planning by integrating visual information and pre-information on neural, dynamic fields. Biological Cybernetics 73:49-60.

Kopecz, K., Schöner, G. & Erlhagen, W. (1995) Dynamic representations provide the gradual specifications of movement parameters. In: Neural networks: Artificial intelligence and industrial applications, ed. B. Kappen & S. Gielen.

Lackner, J. R. & DiZio, P. (1994) Rapid adaptations to Coriolis force perturbations of arm trajectory. Journal of Neurophysiology 72:299-313.

Lakoff, G. (1987a) Women, fire, and dangerous things: What categories reveal about the mind. University of Chicago Press.

Lakoff, G. & Johnson, M. (1980) Metaphors we live by. University of Chicago Press.

Lacquaniti, F., Guigon, E., Bianchi, L., Ferraina, S. & Caminiti, R. (1995) Representating spatial information for limb movements. Role of area 5 in the monkey. Cerebral cortex 5:391-409.

Lewin, K. (1946) Behavior and development as a function of total situation. In: Manual of child psychology, ed. L. Carmichael. Wiley.

Malone, S.M. (1994) The prefrontal cortex and its functions: Toward a more developmental neuroscience. Unpublished mss. University of Minnesota.

Markcovitch, S. & Zelazo, P. D. (1999). The A-not-B error: Results from a logistic meta-analysis. Child Development, 70, 1297-1313.

Merleau-Ponty, M. (1963) The structure of behavior. Trans. C. Smith. Routledge and Kegan Paul.

Miller, E. K., Erickson, C. A. & Desimone, R. (1996) Neural mechanisms of visual working memory in prefrontal cortex of the macaque. Journal of Neuroscience 16:5154-5167.

Mitz, A. R., Gottschalk, M. & Wise, S. P. (1991) Learning-dependent neuronal activity in the premotor cortex: Activity during the acquisition of conditional motor associations. Journal of Neuroscience 11:1855-1872.

Muesseler, J. & Hommel, B. (1997) Blindness to response-compatible stimuli. Journal of Experimental Psychology: Human Perception and Performance 23:861-872.

Munakata, Y. (1997) Perseverative reaching in infancy: The role of hidden toys and motor history in the AB task. Infant Behavior and Development 20:405-416.

Munakata, Y. (1998) Infant perseveration and implications for object permanence theories: A PDP model of the AB task. Developmental Science 1:161-184.

Munakata. Y., McClelland, J. L., Johnson, M. H. & Siegler, R. S. (1997) Rethinking infant knowledge: Toward an adaptive process account of successes and failures in object permanence tasks. Psychological Review 104:686-713.

Pelligrino, G. & Wise, S. P. (1993) Visuospatial versus visuomotor activity in the premotor and prefrontal cortex of a primate. Journal of Neuroscience 13:1227-1243.

Pfeiffer, R. & Scheier, C. (In press) Understanding intelligence. Cambridge, MA: MIT Press.

Piaget, J. (1954) The construction of reality in the child. Basic Books.

Piaget, J. (1952) The origins of intelligence in children. International Universities Press.

Port, R. & van Gelder, R., eds. (1995) Mind as motion: Explorations in the dynamics of cognition. MIT Press.

Poulton, E. C. (1981) Human manual control. In: Handbook of physiology. Sect. 1 The nervous system. Vol. II. Motor control, Part 2, ed. V. B. Brooks. American Physiological Society.

Prablanc, C., Echallier, J. F., Komilis, E. & Jeannerod, M. (1979) Optimal response of hand and eye motor systems in pointing at a visual target. I. Spatio-temporal characteristics of eye and hand movements and their relationships when varying the amount of visual information. Biological Cybernetics 35:113-124.

Prablanc, C. & Martin, O. (1992) Autonomous control during hand reaching at undetected two-dimensional target displacements. Journal of Neurophysiology 67:455-469.

Rizzolatti, G., Fadiga, L., Fogassi, L. & Gallese, V. (1997) The space around us. Science 277:190-191.

Rosenbaum, D. A., Loukopoulos, L. D., Meulenbroek, R. G. J., Vaughan, J. & Englebrecht, S. E. (1995) Planning reaches by evaluating stored postures. Psychological Review 102:28-67.

Rossetti, Y., Stelmach, G. E., Desmurget, M., Prablanc, C., & Jeannerod, M. (1994) Influence of viewing the static hand prior to movement onset on pointing kinematics and accuracy. Experimental Brain Research, 101: 323-330.

Schöner, G. (1995) Recent developments and problems in human movement science and their conceptual implications. Ecological Psychology 7:291-314.

Schöner, G., Dose, M., & Engels, C. (1995). Dynamics of behavior: theory and applications for autonomous robot architectures. Robotics and Autonomous Systems 16, 213-245.

Schöner, G., Kopecz, K. & Erlhagen, W. (1997) The dynamic neural field theory of motor programming: Arm and eye movements. In: Self-organization, computational maps, and motor control, eds. P. G. Morasso & V. Sanguineti. Elsevier-North Holland.

Schwartz, A. (1994) Distributed motor processing in cerebral cortex. Current Opinion in Neurobiology 4:840-846.

Scott, S. H. & Kalaska, J. F. (1997). Reaching movements with similar hand paths but different arm orientations. I. Activity of individual cells in motor cortex. Journal of Neurophysiology 77:826-852.

Scott, S. H., Sergio, L. E. & Kalaska, J. F. (1997) Reaching movements with similar hand paths but different arm orientations. II. Activity of individual cells in dorsal premotor cortex and parietal area 5. Journal of Neurophysiology 78:2413-2426.

Shadmehr, R. & Mussa-Ivaldi, F. A., (1994) Adaptive representation of dynamics during learning of a motor task. Journal of Neuroscience 14:3208-3224.

Sheets-Johnstone, M. (1990). The roots of thinking. Philadelphia: Temple University Press.

Shen, L. & Alexander, G. E. (1997) Preferential representations of instructed target location versus limb trajectory in dorsal premotor area. Journal of Neurophysiology 77:1195-1212.

Shen, L. & Alexander, G. E. (1997) Neural correlates of a spatial sensory-to-motor transformation in the primary motor cortex. Journal of Neurophysiology 77:1171-1194.

Singer, W. (1990) Search for coherence: A basic principle of cortical self-organization. Concepts in Neuroscience 1:1-26.

Smith, L. B. & Scheier, C. (1998) Babies have bodies: Why Munakata's net fails to meet its own goals. Developmental Science, 1, 202-204.

Smith, L. B., Spencer, J., & Thelen, E. (1999a). General processes of goal-directed reaching make the A-not-B error. Presented at the Biennial Meeting, Society for Research in Child Development, Alburquerque, April, 1999.

Smith, L. B., Thelen, E., Titzer, R. & McLin, D. (1999b) Knowing in the context of action: The task dynamics of the A-not-B error. Psychological Review, 106, 235-260.

Smyrnis, N.,Taira, M., Ashe, J. & Georgopoulos, A. P. (1992) Motor cortical activity in a memorized delay task. Experimental Brain Research 92:139-151.

Sophian, C. (1985) Perseveration and infants' search. A comparison of 2- and 3-location tasks. Developmental Psychology 21:187-194.

Spelke, E. S. (1990). Principles of object perception. Cognitive Science, 14, 29-56.

Spencer, J. P. (1999) The delay-dependent dynamics of location memory. Manuscript submitted for publication.

Talmy, L. (1988) Force dynamics in language and cognition. Cognitive Science 12:49-100.

Thelen, E. (1995) Time scale dynamics and the development of an embodied cognition. In: Mind as motion: Explorations in the dynamics of cognition, eds. R. Port & T. van Gelder. MIT Press.

Thelen, E., Corbetta, D. & Spencer, J. P. (1996) The development of reaching during the first year: The role of movement speed. Journal of Experimental Psychology: Human Perception and Performance 22:1059-1076.

Thelen, E. & Smith, L. B. (1994) A dynamic systems approach to the development of perception and action. MIT Press.

Tipper, S. P., Lortie, C. & Baylis, G. C. (1992) Selective reaching: evidence for action-centered attention. Journal of Experimental Psychology: Human Perception and Performance 18:891-905.

Turvey M. T. (1990) Coordination. American Psychologist 45:938-953.

van Gelder, T. (1998) The dynamical hypothesis in cognitive science. Behavioral and Brain Sciences, 21: 615-628.

Varela, F. J., Thompson, E. & Rosch, E.(1991) The embodied mind: Cognitive science and human experience. MIT Press.

Wellman, H. M., Cross, D. & Bartsch, K. (1986) Infant search and object permanence: A meta-analysis of the A-not-B error. Monographs of the Society for Research in Child Development 54: No. 214.

Wilson, H. R., & Cowan, J. D. (1973). A mathematical theory of the functional dynamics of cortical and thalamic nervous tissue. Kybernetik, 13, 55-80.



Notes

1. In informal experiments, we noticed that infants have a strong predilection to reach for whatever is closest to them. Gavin Bremner (personal communication to ET) also observed this. return to text