To be published in Behavioral and Brain Sciences (in press)
© Cambridge University Press 2002



Below is the unedited, uncorrected final draft of a BBS target article that has been accepted for publication. This preprint has been prepared for potential commentators who wish to nominate themselves for formal commentary invitation. Please DO NOT write a commentary until you receive a formal invitation. If you are invited to submit a commentary, a copyedited, corrected version of this paper will be posted.


Working Memory Retention Systems: A State of Activated Long-Term Memory

 

Daniel S. Ruchkin

University of Maryland

School of Medicine

Department of Physiology

Program in Neurosciences

Baltimore, MD USA

 

Jordan Grafman

National Institutes of Health

Cognitive Neuroscience Section

NINDS

Bethesda, MD USA

 

Katherine Cameron

Washington College

Department of Psychology

Chestertown, MD USA

 

Rita S. Berndt

University of Maryland

School of Medicine

Department of Neurology

Program in Neurosciences

Baltimore, MD USA

 

 

Short Abstract: Electrophysiological studies of short-term storage in working memory indicate that there is 1) sustained co-activation of pre-frontal cortex and posterior cortical systems involved in the initial encoding of the retained information, 2) an increase in neural synchrony between pre-frontal cortex and posterior cortex, and 3) enhanced activation of the long-term memory representations of the material maintained in working memory. A parsimonious interpretation of these findings is that activation of long-term memory associated with posterior cortical encoding systems provides the representational basis for working memory. In this view, there is no reason to posit specialized neural networks whose functions are limited to that of short-term storage buffers.

 

Abstract: High-temporal resolution event-related brain potential (ERP) and electroencephalographic (EEG) coherence studies of the neural substrate of short-term storage in working memory indicate that the sustained co-activation of both pre-frontal cortex and the posterior cortical systems that participate in the initial perception and comprehension of the retained information are involved in its storage. These studies further show that short-term storage mechanisms involve an increase in neural synchrony between pre-frontal cortex and posterior cortex, and enhanced activation of the long-term memory representations of the material held in short-term memory. This activation begins during the encoding/comprehension phase and evidently is prolonged into the retention phase by attentional drive from pre-frontal cortex control systems.

 

A parsimonious interpretation of these findings is that the long-term memory systems associated with the posterior cortical processors provide the necessary representational basis for working memory, with the property of short-term memory decay being due to, primarily, the posterior system. In this view, there is no reason to posit specialized neural systems whose functions are limited to that of short-term storage buffers. Pre-frontal cortex provides the attentional pointer system for maintaining activation in the appropriate posterior processing systems. Limitations on the number of pointers that can be sustained by the pre-frontal control systems determines short-term memory capacity and phenomena such as displacement of information in short-term memory.

 

Keywords: Coherence; Event-Related Potentials; Short-Term Storage; Verbal; Visuo-Spatial; Working Memory

 

 

1. Introduction

 

1.1. History of the working memory model

 

Working memory refers to the collection of cognitive systems thatmaintain task-relevant information in an active state during the performance of a task. It functions as a "work space" in which recently acquired sensory information and information from long-term memory are processed for further action (e.g., storage, computation, decision-making). The construct of working memory evolved from previously developed models of memory systems that postulated a distinct short-term store, such as Atkinson and Shiffrin’s (1968) modal model. In the moda1 model the short-term memory system received input from sensory stores and transferred information to and from long-term stores. While the modal model accounted for a number of empirical results, it did not provide an accurate account of how short-term and long-term memory interacted, did not correctly predict performance for certain dual task experiments, and did not provide an adequate explanation for the memory performance of amnesiacs. In their seminal and influential model of working memory, Baddeley and Hitch (1974) resolved many of these short-comings by postulating a multicomponent working memory system consisting of a central executive that controlled conscious processing, with access to a pair of subsystems that temporarily stored phonological and visuo-spatial information. Baddeley (2000) recently revised this model, postulating a third short-term storage subsystem, an episodic buffer that forms an interface between the short-term phonological store, the short-term visuo-spatial store, and long-term memory. The episodic store augments working memory storage capacity, holding integrated material such as scenes and events. The central executive is regarded as a controller of deployment of attention, with no storage capacity. A key aspect of Baddeley’s model is that the various subsystems draw upon different processing resources and can, to some extent, function independently of each other.

 

A number of other models of working memory have been proposed since the initial Baddeley and Hitch (1974) paper. Although the models differ, most, but not all, share Baddeley’s view of multiple subsystems and temporary stores based upon modality-specific codes (see Miyake and Shah (1999) for discussions and comparisons of various current models of working memory). An important distinction among the various conceptualizations of working memory is in how short-term storage is implemented and its relationship to long-term memory. Baddeley posited that the working memory short-term storage modules are separate from long-term memory storage modules (1986; 2001; 2002). This view is based upon neuropsychological dissociations between performance on tasks involving either primarily short-term memory or primarily long-term memory resources. Baddeley (1999) suggested that information not accommodated by the working memory short-term stores (e.g., lexical and semantic information in verbal working memory) contributes to working memory performance via activation of representations in long-term memory. Hulme et al. (1997) and Saint-Aubin and Poirer (1999) further articulated this idea, proposing that lexical and semantic contributions to serial recall in verbal short-term memory are via a redintegration process that reconstructs degraded phonological codes during retrieval.

 

1.2. Short-term memory as activated long-term memory

 

Investigators such as Crowder (1993) and Cowan (1995; 1999; 2001) have been proponents of a contrasting view of short-term memory operation, namely that “long-term memory” and “short-term memory” are different states of the same representations, with activated representations in long-term memory constituting all of short-term memory. Based on findings such as the occurrence of serial position and recency effects in both short-term and longterm memory tasks, Crowder (1993) argued that short-term and long-term memory follow similar rules, and hence there was no reason to postulate separate long- and short-term storage systems. In Crowder’s view, memory storage takes place in the same neural structures in which the information was initially processed. Fuster (1995; 1997) has taken the same proceduralist position, based upon observations of single neuron activity in primates during short-term memory tasks. Fuster (1995) commented that we are dealing with “the memory of systems, not ... systems of memory.”

 

Cowan’s (1988; 1995; 1999) views are similar to Crowder’s, namely that short-term memory stores are constituted by an activated subset of long-term memory. Cowan argues for the construct that short-term memory involves all information accessed by a task, including 1) activated memory in the focus of attention, 2) activated memory not in the focus of attention, and 3) inactive memory accessible by activated retrieval cues. Short-term auditory sensory memory processes in experiments involving presentation of multiple streams of stimuli are examples of the latter two types of activation (Cowan, 1984; Cowan, 1995). Deployment of attention is a crucial feature of Cowan’s model of short-term memory, with attention sustaining and limiting activation of long-term memory. In Cowan’s model the capacity limitation of short-term memory is due to the limited capacity of the focus of attention (1999).

 

With respect to the role of sensory information in short-term memory, Penny (1989) has hypothesized that verbal short-term memory involves, in addition to phonological codes, contributions from modality-specific auditory and visual codes. A number of lines of evidence support Penny’s view: 1) memory is improved when different items are presented in different modalities than when all items are presented in the same modality; 2) recall is enhanced when items are organized by modality than by time of presentation; 3) two concurrent verbal tasks can be more effectively performed when different input modalities are utilized in comparison with only one input modality.

 

Baddeley (2001) claimed that construing short-term memory as activated long-term memory is inconsistent with neuropsychological data, since there are individuals with long-term memory deficits but not short-term memory deficits, and individuals with short-term memory deficits but not long-term memory deficits. However, Cowan (1999) has argued that long-term memory deficits are not necessarily due to damaged stores. Rather, such deficits can be due to impaired binding processes involving hippocampal-neocortex connections responsible for eliciting simultaneous activations across long-term stores that lead to an episode being stored (Rickard & Grafman, 1998). Activation of individual stores that accompanies short-term retention of information can be preserved in amnesia.

 

With regard to patients with short-term but not long-term memory deficits, Vallar and Baddeley (1984) reported an individual whose span in verbal serial recall tests was below the normal range, but whose performance on word learning, paired-associates learning and short-story learning tests was within the normal range. The poor span in serial recall was attributed to an impaired phonological short-term store. While Baddeley (2001) interprets these results as evidence for distinct short-term and long-term phonological stores, the normal performance on the learning tests of long-term memory may have been due to lexical, semantic and syntactic processes invoked by the learning tasks that compensated for the impaired phonological processing. Nevertheless, if verbal short-term memory representations are activated verbal long-term memory representations, then deficits in verbal short-term memory for specific types of representations should be indicative of impairments in establishing long-term memories for those representations. Romani and Martin (1999) have reported that individuals with a semantic short-term memory deficit also have problems forming semantic but not phonological long-term memories, whereas individuals with a phonological short-term memory deficit show the reverse pattern of difficulty. Thus, when the nature of the representations are taken into account, the neuropsychological evidence for distinct short-term and long-term memory stores is not compelling.

 

1.3. Episodic memory

 

While activation of long-term memory representations of items contributes to their retention in short-term memory, it does not account for all aspects of the retention process. Serial recall of the order of events involves conjunctions of representations that are not likely to be retained and recalled by activation of the representations alone. Both Cowan (1995; 1999; 2001) and Baddeley (2000; 2001) proposed that retention of serial order information involves the formation of new episodic links between the activated representations of the items held in short-term memory. Baddeley (2001) further argued that there is a distinct short-term store for episodic links, citing a fMRI study by Prabhakaran et al. (2000) as providing neural evidence of such a store. Prabhakaran et al. (2000) found greater activation in right prefrontal cortex when integrated (i.e., requiring binding of individual items) rather than unintegrated visual stimuli were used in a short-term memory task. The authors’ claimed that this was evidence for a ‘buffer’ for the temporary retention of integrated information. We suggest that a more precise interpretation of the fMRI data is that the right prefrontal cortex participates in the process of maintaining binding information in an active state. The question of whether there is a separate store for binding information, or whether the binding ‘representations’ are based upon the same neural structures that, with consolidation, become part of the long-term memory representations for the bindings, was not resolved by Prabhakaran et al.’s (2000) findings. Our view is that the neural connections underlying the binding processes that produce episodic links are the basis for both shortterm and long-term episodic memory. Recall and maintenance of episodic information involves activation of the binding ciruitry, while retention of novel episodic information involves the operation of binding formation and the initial consolidation process. In either case, the same neural connections are involved.

 

1.4. Scope of reviewed research supporting activation models of short-term memory

 

In contrast with Baddeley’s claim that short-term and long-term memory memory stores are distinct, this article will argue for the view that short-term memory corresponds to activated long-term memory and that information is stored in the same systems that initially processed the information. On theoretical grounds, activation-proceduralist models have the advantage of parsimony with respect to models that postulate distinct short-term and longterm memory stores. On empirical grounds, there are electrophysiological and hemodynamic imaging data, reviewed in this article, from normal, intact humans that substantiate activation models.

 

Research into short-term memory has utilized cognitive experiments, studies of patients, and functional neuroimaging techniques to motivate an understanding of how information is retained over short periods of time and which brain areas are crucial for encoding, retaining, and retrieving information held in short-term memory. What has been lacking in these studies is accurate information regarding the timing and duration of the various processes enlisted in short-term memory operations. Hence, in this paper, we will concentrate upon studies that employed high temporal resolution event-related potentials (ERPs) to provide information about the timing of brain processes involved in short-term memory operations. These studies have provided unique and novel information on the mechanisms employed in active maintenance and how networks involved in short-term storage operations map onto networks involved in the perception, encoding and determination of meaning. We will also review hemodynamic imaging studies that have provided key anatomical information that complements the ERP findings. The results of these varies studies are most compatible with models of short-term storage operations that stress sustained activation of the perceptual and associated long-term memory systems involved in the initial bottom-up processing of information, and posit an important role for attentional systems in the maintenance process (e.g. Cowan (1999)), as opposed to models that stress buffers based upon neural systems that are specialized for short-term storage (e.g. Baddeley (1999)).

 

Lesion and hemodynamic imaging investigations have produced abundant converging data on the location of brain regions that contribute to short-term memory operations (Vallar, De Betta, & Silveri, 1997; Vallar & Papagno, 1995; Cabeza & Nyberg, 1997; Smith & Jonides, 1999; Cabeza & Nyberg, 2000). The data have consistently supported models of short-term memory that are based upon multiple subsystems and modality-specialized temporary stores. In this context, we note that findings of multiple, modality-specialized short-term stores are fully compatible with the position that short-term memory corresponds to activated long-term memory representations, given that longterm memory involves multiple, modality-specialized stores.

 

Tasks that entail manipulation of information and updating memory, functions of the postulated central executive, evidently involve multiple sites in frontal cortex (D'Esposito et al., 1995; Manoach et al., 1997; Postle, Berger, & D'Esposito, 1999; Owen, 1997). Encoding and storage of phonological information involves left parietal and left frontal regions that underlie language processing and speech production (Paulesu, Frith, & Frackowiak, 1993; Awh et al., 1996; Awh, Smith, & Jonides, 1995; Jonides et al., 1998; Henson, Burgess, & Frith, 2000), while encoding and storage of visuo-spatial information engages ventral (inferior temporal cortex) and dorsal (posterior parietal cortex) visual processing pathways involved in perceptual processing (Jonides et al., 1993; Courtney, Ungerleider, Keil, & Haxby, 1996; Courtney, Ungerleider, Keil, & Haxby, 1997; Haxby, Petit, Ungerleider, & Courtney, 2000; Awh & Jonides, 2001). Such studies have been very useful in mapping the cognitive architecture of human short-term memory to specific brain regions. However, the capability of hemodynamic imaging to provide direct, detailed information on the timing of neural processing underlying the operation of short-term memory is limited, since hemodynamic responses can be substantially delayed and prolonged in comparison with neural and behavioral responses.

 

The complementary approach of using high temporal resolution techniques such as electroencephalographic (EEG) and/or magnetoencephalographic (MEG) recordings from scalp can be useful when dealing with timing issues. Although the capability for determining the locations of brain activation from scalp recordings is limited, EEG and MEG recordings can provide real-time measures of brain activity with sub-millisecond accuracy. ERPs extend knowledge gained from hemodynamic imaging studies by providing a detailed tracking of the time-course of brain activation across encoding, retention, and retrieval phases of an eliciting event. Long duration, sustained processes and short, limited duration transient processes can be distinguished with ERPs. In addition, through the comparison of rapid fluctuations in EEG recordings from multiple sites, EEG coherence analysis provides information on the degree of neural synchronization between brain regions within a specific time interval and frequency band. EEG coherence measures provide an approach for investigating interactions among brain regions during short-term memory operations, thereby further extending knowledge gained from ERP studies. Issues such as whether cognitive processes that operate in parallel interact can be best addressed by coherence methods.

 

The ERP studies considered below usually employed either delayed matchto-sample or delayed serial recall paradigms, with delay intervals in the range of 3000 to 4000 ms. The strategy in these studies was to manipulate the type and/or amount of information held in short-term memory and test whether the manipulation produced differences in brain activity during the delay interval. Differences in the delay interval of ERP timing and/or amplitude as a function of the information held in short-term memory indicate that brain activity during retention is sensitive to such information. Finding that brain activity during retention is influenced by the type of information held in short-term memory is interpreted as evidence that the information is being held in an active state during retention. Differences between conditions in ERP scalp topography further indicate that the anatomical configuration of the generators of the brain activity differs between the conditions. (A brief discussion of topography and estimation of the brain sources of scalp recorded ERP activity is presented in the Appendix.) Consequently, variations in the delay interval of ERP topography with the manipulation of information maintained in short-term memory are interpreted as evidence that the configuration of brain systems active during retention varies with the nature of the information. Coherences between recording sites reflect the pattern and degree of connectivity between brain regions. Thus, differences between coherences as material maintained in short-term memory is manipulated are interpreted as evidence that connectivity between brain regions active during retention is sensitive to the properties of the maintained material.

 

The results of the ERP studies reviewed below indicated that short-term retention processes involve sustained activation of both frontal cortical control systems and posterior cortical systems involved in perception and comprehension of visuo-spatial and linguistic information, with enhanced neural synchrony between the frontal and posterior systems during retention. They further indicated a greater diversity and specialization of retention processes than originally proposed by Baddeley and Hitch. For visual stimuli, in addition to the demonstration of separate sustained storage systems for visual-object and visual-spatial material, there is also evidence for transient, intermediate duration storage systems. For language stimuli, in addition to phonological codes, there is evidence that lexical-semantic and modality-specific codes actively contribute to the retention process, as opposed to only during recall by redintegration. The temporal morphology of the ERPs indicated that brain regions active during initial processing (prior to the retention interval) remain active during the retention interval, supporting proceduralist-activation models of memory proposed by Crowder (1993), Fuster (1997) and Cowan (1995; 1999). The behavior of an ERP deflection sensitive to priming indicated that consciously maintaining items in memory raises the level of activation of the long-term representations of the items to above the level reached by priming due to processing the items but not consciously holding them in memory. This finding provides strong support for the notion that activation of long-term memory representations is the root of short-term memory performance. We also review complementary hemodynamic imaging studies that provide anatomical support for proceduralist, activation models and the idea that posterior cortex provides the representational basis for most short-term memory operations while pre-frontal cortex provides the attentional control.

 

2. Retention of visuo-spatial information in short-term memory

 

Numerous studies have shown that the visual system involves, beyond primary visual cortex, separate cortical pathways for the perception of object (ventral pathway) and spatial (dorsal pathway) information (Grady et al., 1992; Hanley, Young, & Pearson, 1991; Harter & Aine, 1984; Mangun, Hillyard, & Luck, 1993; Rosler, Heil, & Hennighausen, 1995; Ungerleider & Mishkin, 1982; Van Essen, Anderson, & Felleman, 1992). Both pathways include extra-striate cortex. The ventral pathway also includes inferior temporal cortex. The dorsal pathway also includes posterior parietal cortex. Results from hemodynamic (Smith et al., 1995) and ERP (Mecklinger & Pfeifer, 1996; Ruchkin, Johnson, Jr., Grafman, Canoune, & Ritter, 1997b) studies indicated that visual short-term memory divides along similar lines. However, the fMRI studies suggested that only perceptual and transient storage operations occur in posterior visual processing pathways (Haxby et al., 2000), while sustained storage occurs in pre-frontal cortex. In contrast, the ERP studies indicated that both transient and sustained storage operations occur in the posterior visual processing pathways. The ERPs showed that during retention there is sustained activation in brain regions underlying posterior and temporal scalp, with the amplitude of the sustained brain activity varying directly with memory load.

 

These results were obtained in delayed-match-to-sample tasks with linear arrays of geometric objects (object task) and two-dimensional patterns of randomly placed squares (spatial task) (Mecklinger & Pfeifer, 1996), or with schematic faces (object task) and the motion of an asterisk (spatial task) (Ruchkin et al., 1997b). The topographies of the scalp ERP activity in the retention interval of the Ruchkin et al. (1997b) study were sharply focused over parietal scalp in spatial tasks and more broadly distributed over parietal-to-frontal scalp in the object tasks (see Fig. 1), consistent with the sources of the scalp ERP activity during retention primarily involving the dorsal pathway for spatial information and the ventral pathway for object information. It has been further demonstrated that the patterns of ERP activity found in the retention interval of visual short-term memory tasks did not occur in control tasks that involved similar encoding and response processing, but had negligible memory demand (Ruchkin, Canoune, Johnson, Jr., & Ritter, 1995; Low et al., 1999).

 

 

Figure 1.  Estimated current source density (CSD) maps for the object and spatial tasks of the scalp topography of ERP activity at the end of the retention interval (3010 ms to 3550 ms after stimulus offset). The difference between contour lines corresponds to a current density increment of 1µV/cm2. The current source densities were derived from the across-subjects averaged ERP amplitudes. In this figure and subsequent figures with maps, The shaded areas of the maps indicate positive amplitudes and the unshaded areas indicate negative amplitudes. The maps are 90_ projections with the front of the head at the top. Electrode positions are indicated by the dots. The vertical line of three dots in the center of the map correspond (from top to bottom) to midline frontal, central and parietal scalp sites, respectively.

 

Note the differences along the mid-line between the CSD maps for the object and spatial tasks. For the spatial task, the CSD has a pronounced negative focus over parietal scalp. For the object task, the CSD negativity is broadly distributed from parietal to frontal scalp. This topographic difference indicates that the configuration of brain sources active during retention is different in the object and spatial tasks.

 

 

Estimates of the locations and time courses of the brain sources of the scalp recorded ERP activity (Scherg, 1990) were used to examine the timing of activation in specific brain regions during the encoding and retention of visual object and spatial information (Ruchkin et al., 1997b). The time courses of activation in primary visual cortex, posterior and anterior temporal lobes, posterior parietal cortex, and pre-frontal cortex are illustrated in Fig. 2. The sources that best represented activity in primary visual cortex displayed early phasic responses, with maximal activation during stimulus presentation and relatively little activation in the subsequent retention interval (Fig. 2, top row). Brain regions active during the retention interval were mainly located in the ventral and dorsal visual processing pathways, and pre-frontal cortex. For both object and spatial information, the source analysis revealed a mixture of early phasic activity during stimulus presentation followed by long duration transient activity in the posterior temporal lobes (Fig. 2, second row from top). The long duration transients began during stimulus presentation and continued into the retention interval for about 2500 to 3000 msec. The source analysis further indicated that there was sustained activation in both pre-frontal cortex and posterior visual processing pathways during retention (Fig. 2, bottom two rows). For the object task, the sustained activity was in the anterior temporal lobes (ventral pathway), while for the spatial task the sustained activity was near the junction of parietal and occipital cortex (dorsal pathway). In the object task, where all the information to be retained was available at stimulus onset, sustained activity in the anterior temporal lobe started during stimulus presentation. The spatial task required memorization of a sequence of movements, with the last movement beginning 1500 ms after the stimulus began. In this case, sustained activity near the junction of parietal and occipital cortex did not begin until presentation of the stimulus sequence was complete. In both tasks, the onset of the sustained activity observed in the dorsal and ventral visual processing pathways was 60 to 300 milliseconds before the onset of the sustained activity observed in pre-frontal cortex.

 

The analysis of brain sources indicated that maintenance of visuospatial information involves sustained activation in both pre-frontal cortex and posterior visual processing systems. The finding of sustained activity in posterior cortex supports Cowan’s (1995) and Fuster’s (1997) views that maintaining activation in cortical regions subserving perception is a component of the retention process. Pre-frontal cortex and posterior perceptual regions interact, with pre-frontal cortex apparently providing the top-down control that extends activation in posterior cortex that began during perception and encoding.

 

2.1. Transient visual short-term memory

 

The sources with transient time-courses (Fig. 2, second row from top), brief windows of activity in the posterior temporal lobe presumably involved in intermediate visual processing operations, support Cowan’s (1995) contention that visual working memory consists of at least three stages, with a transient stage intermediate to an initial, high capacity, very limited duration iconic store that encodes the physical features of stimuli (Sperling, 1960), and a limited capacity, post-categorical sustained short-term store. It is noteworthy that the transient stores appeared to be in a part of the ventral pathway involved in preliminary processing of visual material, while sustained storage operations appeared to be in ventral or dorsal pathway regions involved in higher level processing of visual material.

 

2.2. Attention-based maintenance mechanisms

 

The view that short-term storage of visuo-spatial information depends, at least in part, upon enhanced activation in visual cortex due to attentionbased maintenance mechanisms, is supported by ERP studies of short-term memory and selective attention (Awh, Anllo-Vento, & Hillyard, 2000; Awh & Jonides, 2001). In the memory task, subjects remembered the locations of three falsefont characters, all of which were in either the left or right visual field. A probe stimulus presented during the delay interval elicited a short-latency phasic response that was larger when the probe was in the same visual field as the memory-set stimuli than when the probe was in the opposite visual field. The timing and topography of the enhanced response to the probe in the memory task was very similar to the enhanced response found in a visual selective attention task when the eliciting stimulus was in an attended location in comparison with when the stimulus was in an unattended location.

 

A combined PET-ERP source localization study (Hillyard & Anllo-Vento, 1998) has shown that enhanced ERP responses to stimuli in attended locations arise in extra-striate visual cortex, contra-lateral to the field of the attended stimulus. Furthermore, an fMRI study (Awh et al., 1999) that employed short-term memory and selective attention tasks similar to those in the Awh et al. (2000) ERP study, also found that hemodynamic activation was greatest in visual cortex contra-lateral to the field of the memorized or attended stimuli, with a high degree of overlap of fMRI activation in the memory and attention tasks. This convergence of results implies that shortterm storage of visual location information entails enhanced, sustained activation in cortical regions involved in the perception/encoding of the visual material, and the enhanced activation depends upon attention-based maintenance mechanisms.

 

 

Figure 2.  Estimated time courses of activation in visual cortices and prefrontal cortex during encoding (pre-3000 ms) and retention1 (post-3000 ms) of visual-object or visual-spatial information. The time axis extends from 360 ms before to 5550 ms after stimulus onset. Stimulus duration is 2000 ms. The waveforms and their brain locations were estimated by source analyses (Scherg, 1990) of across-subjects (n=12) averaged ERPs recorded from scalp by a 24 channel montage. The vertical scale is in arbitrary units. In this and subsequent figures, stimulus presentation intervals are demarcated by vertical lines.

 

The source analyses indicate that primary visual cortex is most active during stimulus presentation, with relatively short latency phasic activity  synchronized to the presentation of each stimulus (top row). There is sustained activity during retention in both pre-frontal cortex (bottom row) and the dorsal and ventral visual processing pathways (third row from top). The waveforms in the second row from the top further indicate that there is long duration transient activity that begins during stimulus presentation and then extends for 2000-3000 ms into the post-stimulus interval.

 

 

2.3. Summary: Visuo-spatial working memory

 

Implications of the ERP studies of visual working memory are summarized schematically in Fig. 3. The timing of the initial (less than 1000 ms poststimulus) phasic deflections in primary visual cortex and the posterior temporal lobes suggests that these brain regions contribute to the operation of the iconic store (Fig. 3, top row). The subsequent longer duration (_ 3000 ms) deflections in the posterior temporal lobes indicate the existence of transient, intermediate stores whose role may be to support the translation of information from iconic to sustained storage formats (Fig. 3, middle row). The sustained activity in the dorsal and ventral pathways indicates that short-term maintenance of visual information depends upon activation of posterior sensory processing systems as well as the pre-frontal cortex (Fig. 3, bottom row). This contrasts with fMRI studies of visual working memory by Haxby, Ungerleider and co-workers (Courtney et al., 1997; Haxby et al., 2000). In their fMRI studies, it appeared that activation in posterior visual processing pathways had a pronounced transient character in comparison with clear sustained activation in frontal regions. Thus, Haxby et al. (2000) suggested that the role of these posterior regions was mainly in the domain of perceptual processing, while short-term storage depended primarily upon frontal regions. The ERP findings suggest that the fMRI measures in posterior cortex may have given too much weight to the activation of the transient stores, thus cloaking the sustained activity in posterior cortex and its contribution to short-term retention of visual information. The role of the frontal regions may be in the domain of sustained attentional drive directed at those posterior regions whose activation is to be maintained.

 

3. Retention of verbal information in short-term memory

 

A number of ERP studies of delayed serial recall indicate that verbal short-term memory depends upon more than phonology during retention and redintegration of degraded phonological representations at retrieval. Utilizing sustained ERP activity recorded during the retention interval, Lang et al.(1992) and Ruchkin et al. (1997a) found that retention of verbal material involves processes that are specific to the modality of presentation, while Ruchkin et al. (1999) found that sustained supramodal lexical and semantic processes also were active during retention. Finally, an item recall study (Cameron, Haarmann, Grafman, & Ruchkin, 2002) indicated that the contribution of semantic representations in long-term memory to short-term retention was not simply a result of their being primed during study of the stimuli to be memorized. Rather, the act of maintaining information in shortterm memory results in a concurrent heightened activation of long-term memory representations, beyond the level of activation caused by priming associated with the initial processing of the stimuli.

 

 

Figure 3.  Schematic of the timing of the activation of three hypothesized short-term storage systems that contribute to the operation of visual shortterm memory. The duration of the initial, iconic stage is about 500-1000 ms (Sperling, 1960). Based upon the results of ERP studies, the transient store operates over the 500-4500 ms latency range, while the onset of the sustained store is dependent upon the timing of stimulus delivery. In Ruchkin et al. (1997b), the onset of the sustained store was approximately 500-800 ms after all the information provided by the stimulus had been delivered.

 

 

3.1. Modality-specific processing streams in verbal short-term memory

 

Penney (1989) has argued that, along with phonological rehearsal, auditory or visual modality-specific verbal short-term memory processes support retention, depending upon whether the material is heard or read, with the auditory processing stream being more durable than the visual processing stream. The results of two ERP studies (Lang, Starr, Lang, Lindinger, & Deecke, 1992; Ruchkin et al., 1997a) support Penney’s contention. The pattern of ERP activity during the post-stimulus retention interval differed for verbal material (digits in Lang et al., a non-word in Ruchkin et al.) that was heard or read. Note the differences between the ERP waveforms (Fig. 4a) and scalp topographies (Fig. 4b) associated with the two modes of stimulation in the Ruchkin et al. (1997a) study.

 

 

Figure 4a.  Across-subjects (n=13) averaged scalp ERPs in a delayed serial recall task in which the material was presented either aurally (dashed lines) or visually (solid lines). The task was to remember a pronounceable five syllable non-word. The time axis extends from 270 ms before to 5640 ms after stimulus onset. Stimulus duration was 2000 ms. In this and subsequent figures of scalp ERP waveforms, the ERPs were originally recorded with ACcoupled amplifiers (which attenuated low frequency ERP activity). The waveforms were digitally rendered to the approximate wave shapes that would have been obtained with DC-coupled amplifiers (no attenuation of low frequency ERP activity). The ERPs are plotted with negative polarity up with respect to a digitally linked A1 and A2 reference.

 

Note that for auditory stimuli there is a sustained frontal negativity, lateralized to the left, with a relatively short onset latency (during the stimulus interval). For visual stimuli, the sustained frontal negativity is lower amplitude, with an relatively late onset (after the stimulus interval). The ERPs elicited by the visual stimuli also display a transient positivity over centro-parietal scalp, that begins during stimulus presentation and ends about 2500 ms after stimulus offset, and a transient negativity, over bilateral posterior temporal and parietal temporal scalp. No such positivity is elicited by the auditory stimuli.

 

 

 

Figure 4b.  Estimated current source density maps for the scalp topography of the ERP activity presented Fig. 4a. The maps are for the activity at selected time points in the retention interval: 500, 2000 and 3500 ms after offset of the 2000 ms duration stimulus. The difference between contour lines corresponds to a current density increment of 1µV/cm2.

 

Note that the maps for the auditory stimuli (upper row) indicate a relatively rapid build up of ERP negativity focused over left frontal scalp. In contrast, the maps for the visual stimuli (lower row) indicate that the left frontal negative focus builds up more slowly, while early in the retention interval there is a focus of positive activity over central-posterior scalp, and a bilateral focus of negativity over posterior temporal scalp that is not seen in the maps for auditory stimuli. These differences in timing and topography between the brain responses to auditory and visual stimuli are evidence for the contribution of modality specific processes to the operation of verbal short-term memory.

 

 

For visual but not auditory stimuli, Ruchkin et al. (1997a) found a long duration transient positivity at mid-line parietal and central sites that began during stimulus presentation and ended about 2500 ms after stimulus offset (Fig. 4a). The amplitude of the transient positivity increased directly with verbal memory load (Ruchkin, Johnson, Jr., Canoune, & Ritter, 1990; Ruchkin, Johnson, Jr., Grafman, Canoune, & Ritter, 1992; Ruchkin et al., 1994). The mid-line posterior transient positivity was not found in ERP scalp recordings obtained in visual-object and/or visual-spatial short-term memory tasks (155,523,685,812). In view of its timing, sensitivity to memory load and apparently exclusive elicitation by verbal material that is read, this aspect of the visual processing stream evidently indexes the operation of a visual, non-phonological verbal storage process. This process is possibly based on orthographic codes; it is active around the time of phonological recoding, and maintains representations of material that has undergone initial visual analysis (Shallice & Vallar, 1990; Vallar & Papagno, 1995).

 

There was sustained negative ERP activity in the post-stimulus retention interval for both auditory and visual stimuli. However, there were timing and topographic differences for the two modalities. For auditory stimuli, the sustained negativity appeared to have two constituents: 1) a left-lateralized negativity that was largest over frontal sites and negligible over posterior sites; 2) a lower amplitude right-lateralized negativity with roughly the same amplitude at frontal and posterior sites. Both of these auditory sustained negativities began during the stimulus interval. The negativity elicited by the visual stimuli also appeared to have two constitutents: 1) a frontal, left-lateralized negativity, with a lower amplitude and later onset (during the post-stimulus interval) than the left frontal auditory negativity; 2) a bilateral, transient negativity, most clearly discerned at posterior temporal sites (see Fig. 4a), that began during stimulus presentation and decreased over the post-stimulus, retention interval.

 

The left-frontal negativity that is common to both modalities (albeit with different amplitudes and onset latencies) probably reflects operations associated with maintaining phonological representations in verbal short-term memory. Its earlier onset for auditory stimuli is evidence for auditory material having a rapid, direct access to the phonological memory system while visual material first undergoes a more time-consuming re-coding to a phonological format before entering the phonological system through articulatory rehearsal (Shallice & Vallar, 1990).

 

The amplitude of the left frontal negativity varies directly with verbal memory load (Ruchkin et al., 1990; Ruchkin et al., 1992; Ruchkin et al., 1994), and there is a significant across-subject correlation between its amplitude and articulation rate (Ruchkin et al., 1994), suggesting that phonological rehearsal operations covary with the processing indexed by the left frontal negativity. A study that contrasted retention of familiar, verbalizable material with unfamiliar, non-nameable material indicated that the left frontal negativity incorporates a composite of executive control processes (Bosch, Mecklinger, & Friederici, 2001). Taken together, these various findings suggest that the left frontal negativity indexes a combination of attentional control and phonological rehearsal operations that are involved in the short-term retention of verbal material.

 

The ERP data map onto Penney’s view of verbal short-term memory. The left frontal negativity reflects sustained mnemonic operations, probably involving phonological representations, that are common to both auditory and visual stimuli. Modality specific operations are indexed by the sustained negativity over the right hemisphere elicited by auditory stimuli, and the posterior transient waveforms elicited by visual stimuli, namely the mid-line positivity and bilateral temporal negativity. The timing of the modalityspecific ERP patterns suggest that auditory verbal mnemonic processes are more durable than visual verbal mnemonic processes.

 

3.2. Contributions of lexical and semantic codes to retention

 

There is widespread agreement, based upon evidence from behavioral studies of intact and impaired subjects, that phonological codes are involved in the maintenance of verbal information in working memory (Baddeley, 1986). There is less agreement about whether lexical-semantic codes actively contribute to the maintenance process. One view is that lexical-semantic codes are not actively involved in retention. Rather, lexical-semantic information is thought to contribute to verbal working memory during retrieval, with lexical-semantic codes in long-term memory facilitating recognition of partially degraded information in the phonological store (Hulme, Maughan, & Brown, 1991; Hulme et al., 1997; Walker & Hulme, 1999). Alternative views stress that language processing activates a variety of codes (modality-specific, phonological, lexical, semantic, syntactic) that are maintained at different strengths, depending upon task demand, over time (Penney, 1989; Monsell, 1984; Saffran, 1990; Saffran & Martin, 1990; Martin & Romani, 1994; Martin & Saffran, 1997). While phonological rehearsal is a possible contributor to the maintenance of information in verbal short-term memory, it is viewed as neither necessary nor sufficient for all of the retention operations required of verbal short-term memory. Cowan and Kail (1988; 1996) postulated that the attention given to maintaining verbal material in working memory raises and prolongs activation of the words' longterm memory codes, and, through this enhanced activation process, lexicalsemantic codes contribute to retention of verbal information in working memory.

 

It is difficult to decide between these different conceptions of verbal working memory from behavioral data alone, since behavioral data reflect a combination of encoding, retention, retrieval and decision operations. Using the temporal resolution of ERPs, brain activity specific to the retention interval can be delineated and analyzed, so that the types of codes that influence the retention process can be determined. This approach has been applied across a series of studies that examined the contributions of lexical and semantic processes to brain activity during retention. These studies have shown that the patterns of brain activation during short-term maintenance of verbal material are influenced by the lexical status of the material (Ruchkin et al., 1999), 2) whether the referents of words are concrete or abstract (Ruchkin et al., in preparation).

 

3.3. Lexical status

 

Evidence for an active contribution of lexical codes to maintenance of verbal information in working memory was obtained from ERPs recorded during performance of a serial recall task involving retention of aurally presented words or pseudo-words designed to be maximally similar to the words in their sound structure (Ruchkin et al., 1999). The number of items in the word (five) and pseudo-word (three) lists were such that recall error rates were approximately matched for the two types of stimuli. The finding that five words could be retained at approximately the same level of accuracy as three pseudo-words was consistent with prior studies of verbal working memory (Hulme et al., 1991; Roodenrys & Hulme, 1993). It has been argued that the cause of the advantage of words over pseudo-words was that restoration during retrieval of partially degraded information in the phonological buffer was more effective for words (Roodenrys & Hulme, 1993). While such processing during retrieval may contribute to the word advantage, there is no compelling reason to believe that it is the only contributor.

 

 

Figure 5a.  Across-subjects (n=11) averaged scalp ERPs in verbal short-term memory (left panel) and non-memory control (right panel) tasks contrasting the processing of words (solid lines) and pseudo-words (dashed lines). Stimuli were presented aurally. In order to approximately balance the error rates in the word and pseudo-word memory tasks, stimuli consisted of either five words or three pseudo-words. The time axis extends from 360 ms before to 8595 after word onset. To align the offset times of words and pseudo-words (5000 ms after word onset), pseudo-word onset was 2000 ms after the time of word onset.

 

Note that the sustained negativity during the post-stimulus retention interval in the memory task was larger for words. This effect was most marked in the vicinity of central mid-line scalp. There was no such difference between word and pseudo-word ERP activity in the post-stimulus interval of the non-memory control task.

 

 

 

Figure 5b. Maps of the scalp topography of the across-subjects averaged voltage fields for the ERP activity in Fig. 5a. The maps depict the distribution of ERP activity over the scalp at the end of the delay interval (3010 to 3500 ms after stimulus offset). The difference between contour lines corresponds to a voltage increment of .5 µV.

 

Note that the word and pseudo-word topographies display a marked difference over central scalp (more negativity for words) in the memory task (left column). The topographies in the control task (right column) are similar for words and pseudo-words, and differ from the topographies in the memory task. These results support the view that, when a conscious effort is made to hold words in short-term memory, lexical codes contribute to the maintenance process.

 

 

Lexical status also influenced ERPs during the delay interval (3600 ms) of the serial recall memory task, well after termination of stimulus presentation and well before retrieval commenced, indicating that brain activity during retention is directly influenced by the availability of lexical-semantic information (Ruchkin et al., 1999). Words elicited more negativity during retention than pseudo-words, with the effect being most marked at the central mid-line site (Fig. 5, left panel). This difference was sustained throughout the delay interval, with no indication of a significant increase with the approach of the time of retrieval. Thus, the effect of lexicality upon the ERPs reflected a process that subserved retention, rather than a retrieval-oriented process that developed during the retention interval. The effect of lexical status in the retention interval of the memory task was specific to consciously controlled memory operations, since lexical status had a negligible influence upon ERP activity in the poststimulus delay interval of a "non-memory" control task with similar attentional demands and stimulus and response processing requirements (Fig. 5, right panel).

 

Lexical status and number of items to be recalled (memory load) had different effects upon ERP activity during retention. Load was relatively high in the memory task (five or three items to be maintained in the delay interval) in comparison with the control task (one item, either "yes" or "no", had to be maintained in the delay interval). In contrast with the effect of lexical status (largest at the mid-line central site), the effect of number of items to be recalled was largest at the left frontal site. Figure 5 indicates that, starting at about 1000 msec after stimulus offset, the negativity over the left frontal site was largest in the word condition of the memory task (five items), next largest in the pseudo-word condition of the memory task (three items), and smallest in the control task (one item).

 

It might be argued that greater familiarity with the phonological structures of the words in comparison with the pseudo-words was responsible for the ERP results (Hulme, Roodenrys, Brown, & Mercer, 1995). Because the words consisted of familiar combinations of familiar syllables, while the pseudo-words consisted of unfamiliar combinations of familiar syllables, it is possible that familiarity may have affected ease of rehearsal. However, the phonological short-term memory studies reviewed above suggest that ERP indices of phonological rehearsal effects would most likely be manifested in the amplitude of the left frontal negativity, and not at the central sites where the lexicality effect was most pronounced.

 

The timing of the word/pseudo-word topographic differences suggested that the influence of lexical status upon retention began during encoding, starting with the presentation of the second item, and continued through the delay interval. Since the topographic differences occurred only in the recall task, and only after presentation of the first item, they were not likely to have been indices of the automatic activation of lexical codes postulated to occur as words are initially processed. Nor were the word/pseudo-word topographic differences likely to be only the remnants of lexical processing that occurred during intentional encoding for memory, since there was no such word/pseudo-word difference for the first item, and the difference was most pronounced and systematic during retention. Rather, the pattern of ERP activity suggests that the ERPs indexed the intentional maintenance of lexical codes subsequent to their activation, and that the maintenance process operated in parallel with encoding of later items in the stimulus series and then continued throughout the retention interval. Hence, the timing and topography of the ERPs in the Ruchkin et al. (1999) study support the contention that lexical processes contribute to verbal working memory maintenance operations when words are consciously held in working memory.

 

3.4. Lexical and semantic activation

 

Orthogonal variations of lexical and semantic properties of words to be remembered also influenced brain activation during retention, with different activation patterns for lexical and semantic manipulations (Ruchkin et al., unpublished data). A series of four different words was presented visually at a rate of one word/sec, followed by a 3500 msec delay interval that terminated with a serial recall test. All four words in a series had the same ombination of levels of frequency (high or low -- lexical variation) and concreteness (concrete or abstract -- semantic variation).

 

To delineate ERP activity that was specific to memory operations, there was a non-memory control task in which the stimulus and post-delay interval response processing demands were similar to those in the memory task, but there was no memory requirement in the delay interval. Subjects searched the series of four words for occasional deviant trials with a repeated word (probability = .10). Subjects were instructed to respond to a deviant trial with a finger movement immediately after presentation of the last word in the series, and to withhold the movement if the trial was not a deviant. At the end of the delay interval, an alphabetic character was displayed, and the subjects’ task was to say the letter in the alphabet that was in the third position after the letter in the post-delay interval display. For example, if the displayed letter was "j", then the response should be the letter "m". Only non-deviant control trials were used in the ERP analyses. The same words were used in the memory and control tasks, but with different combinations of four word series.

 

Error rates in the serial recall task were: 2.25% for high frequency, concrete words; 3.25% for high frequency abstract words; 5.05% for low frequency, concrete words; 8.16% for low frequency, abstract words. There were significant effects upon error rate of word frequency (F1,11 = 11.65, p = .0058) and concreteness (F1,11 = 8.55, p = .014). The error rate for the alphabet search that followed the delay interval in the non-memory control task was 1.13%. Debriefing indicated that only 7 of the 12 subjects in the study employed meanings of the words in their memorization strategy.

 

The ERPs for all subjects (n=12), regardless of memorization strategy, displayed a sustained increased negativity for low frequency words during the delay interval, with the effect of word frequency being maximal over mid-line central-parietal scalp (Fig. 6, upper panel, dashed lines). No such sustained difference was found in the control task (Fig. 6, lower panel, dashed lines). The average ERP amplitude over the last 2500 ms of the delay interval was used as a measure of ERP activity in the delay interval. ANOVA of these ERP amplitude measures at the six scalp sites in the centro-parietal mid-line region (C3 Cz C4 P3 Pz P4) that displayed the largest variation in sustained negativity as word frequency was manipulated revealed a significant effect of word frequency in the memory task (F1,11 = 11.23, p = .0065) and no significant effect in the control task (F1,11 = 0.63). Estimates of the temporal activation of the brain sources underlying the sustained increased negativity for low frequency words indicated that the sustained increased negativity started during presentation of the third word in the series.

 

The ERPs for those subjects (n=7) who used the meanings of the words in their memorization strategy displayed a sustained increased negativity for abstract words in the delay interval, with the effect being largest over frontal mid-line and left frontal-temporal scalp (Fig. 6, upper panel, solid lines). No such sustained difference was found in the delay interval of the control task (Fig. 6, lower panel, solid lines). ANOVAs of the ERP amplitude measures at the five scalp sites in the left frontal and frontal mid-line region (F7 F3 Fp1 Fz F4) that displayed the largest variation in sustained negativity as concreteness was manipulated revealed a significant effect of concreteness in the memory task (F1,6 = 10.81, p = .017) and no significant effect in the control task (F1,6 = .11). Estimates of the temporal activation of the brain sources underlying the sustained increased negativity for abstract words indicated that the sustained increase started during presentation of the second word in the series.

 

Concreteness also affected phasic ERP responses synchronized to the presentation of each word, with there being increased positivity in the ERP responses to abstract words in both the memory and control tasks. The scalp topography of these phasic responses to the stimuli differed from the sustained effect in the delay interval. In addition to the polarity difference, the phasic effect during stimulus presentation had a more posterior scalp distribution and was not lateralized to the left. Thus it is not likely that the same brain (and therefore cognitive) processes underlie the effects of concreteness upon the phasic ERP activity in the stimulus interval and the sustained ERP in the delay interval.

 

The effect of word frequency upon sustained ERP activity in the delay interval is not likely to be due to pre-lexical processing such as phonological rehearsal. Other studies have reported increased sustained negativity over left frontal scalp as phonological load increasd (Ruchkin et al., 1990; Ruchkin et al., 1992; Ruchkin et al., 1994; Ruchkin et al., 1997a). In contrast, the shift from high to low frequency words, which caused a significant increase in error rate, resulted in an increased sustained negativity over bilateral centro-posterior scalp. The topography associated with the word frequency effect is congruent with the effect of the word versus pseudo-word manipulation (see above, (Ruchkin et al., 1999)). However, the variation in sustained negativity as concreteness was manipulated only may be an indirect ERP sign of semantic processing during retention of verbal material, since its topography is consistent with findings from short-term memory studies where phonological load or general attentional demands were manipulated (Ruchkin et al., 1990; Ruchkin et al., 1992; Ruchkin et al., 1994). The error rates suggest that abstract words demand greater cognitive resources in short-term memory than concrete words. In such a case, other processing systems may have become more active as a compensatory effect when abstract words had to be retained. Nevertheless, the finding that variation of concreteness affected brain activity during the delay interval does indicate that semantic processes actively support the maintenance of verbal information in short-term memory.

 

 

Figure 6.  Across-subjects averaged scalp difference ERPs in a verbal shortterm memory task (serial recall of a list of four words - upper panel) and a non-memory control task (detect a repeated item in the list of words – lower panel). There was no memory requirement in the post-stimulus delay interval of the control task. Difference ERPs show the effects of concreteness and word frequency upon brain activity in the stimulus and delay intervals. The word frequency effect was revealed by subtracting ERPs elicited by lists of high frequency words from ERPs elicited by low frequency words (Low - High), pooled over abstract and concrete words and averaged over all subjects (n=12). The concreteness effect was revealed by subtracting ERPs elicited by lists of concrete words from ERPs elicited by abstract words (Abstract - Concrete), pooled over high and low frequencies, and averaged across those subjects (n=7) who employed word meaning in their memory strategy. The time axis extends from 360 ms before to 7520 ms after stimulus onset. Stimulus duration was 4000 ms and the duration of the subsequent delay interval was 3520 ms.

 

Note that in the memory task, during the post-stimulus retention interval, the concreteness effect is largest over frontal scalp, with greater negativity for abstract words. In contrast, the frequency effect is largest over centro-posterior scalp during retention, with greater negativity for low frequency words. In the non-memory control task, the effects of concreteness and frequency are relatively small in the post-stimulus interval. These results indicate that semantic codes also contribute to the maintenance of verbal information in short-term memory, and the combination of brain sources associated with the semantic processes differs from the combination of sources associated with the lexical processes.

 

 

3.5. Summary: Verbal working memory in serial recall and match/mismatch tasks

 

Fig. 7 summarizes in schematic form the timing results obtained from our ERP studies of verbal working memory in delayed serial recall and match/mismatch paradigms. These data support the view that multiple, simultaneously active processes are involved in the short-term maintenance of verbal information (Cowan, 1988; Cowan, 1995; Cowan & Kail, 1996). The data are consistent with the notion that phonological and lexical-semantic codes may interact throughout the retention interval, as opposed to only during retrieval. The activated lexical-semantic codes continuously counteract degradation of the material in the phonological buffer, while phonological codes may have a role in counteracting degradation of the lexical semantic codes (Martin & Romani, 1994; Martin & Saffran, 1997). The timing of the ERPs indicates that the sustained lexical-semantic processes start during encoding, evidently as information concerning the properties of the material to be memorized builds up, and continues in the subsequent retention interval. Along with behavioral data (Penney, 1989), the ERP time-courses also suggest that the modality-specific auditory-verbal store is more durable than the modality-specific visual-verbal store.

 

 

Figure 7.  Schematic of the timing of hypothesized stores that contribute to the operations of verbal working memory.

 

 

3.6. Activation of long-term memory and retention

 

Cameron et al. (2002) used ERPs to test the premise that retention of words in short-term memory involves activation of the words’ semantic representations in long-term memory (i.e., activated long-term memory provides a representational basis for short-term maintenance of information). Cameron et al. conceptualized the instantiation of the activation process as being akin to temporally extended priming. As words are initially processed, their various representations in long-term memory are activated. The depth of processing of these words determines the levels of activation (e.g., phonological, semantic). If no conscious effort is exerted to maintain the words in working memory, then the activation of their representations decays. If there is a conscious effort to maintain the words in working memory, then the level of activation of their long-term representations remains relatively high during the retention interval.

 

Cameron et al.’s (2002) approach was to contrast the degree of activation in the semantic neighborhood of a series of three words used in two tasks; one that required retention of the meanings of the words and one that did not require retention of the words. The three words in the stimulus series were associated with each other. The degree of semantic activation was determined from the ERP response to an incidental probe word presented during a delay interval that followed the stimulus series. The probe word was either unrelated to the last word in the stimulus series or semantically and categorically related to the last word in the series. In effect, the three words in the stimulus series were primes and the probe word was the target for the primes. Both the series of priming words and the probe word were presented aurally. Participants were instructed that the probe word was a distractor stimulus to be ignored, and no response was to be made to it.

 

The degree of semantic activation was assessed by means of a ERP phenomenon elicited by the probe, referred to as N400, that is sensitive to the extent to which semantic representations of the eliciting word have been activated by the previous presentation of verbal material (Bentin, McCarthy, & Wood, 1985; Bentin, Kutas, & Hillyard, 1993; Holcomb, 1988; Nobre & McCarthy, 1994; Bentin, McCarthy, & Wood, 1984). N400 is a phasic negative deflection that is most prominent 200-800 ms after stimulus onset for auditory stimuli (Bentin et al., 1993; Holcomb & Neville, 1990). N400 negativity is reduced when the eliciting word has been primed by (i.e., is in the semantic neighborhood of) previously presented verbal material and is increased when the eliciting word has not been primed (Kutas & Hillyard, 1989; Bentin et al., 1993; Holcomb, 1988). This variation in N400 negativity can occur even when participants are unaware of the semantic relationships (Bentin et al., 1984), and when elicited by an unattended stimulus, provided that the previously presented material was attended (Kellenbach & Michie, 1996).

 

Kutas and Federmeier (2000) have argued that N400 amplitude reflects the degree of difficulty with which verbal material is accessed in long-term memory. The degree of access difficulty (or ease) depends upon the extent to which the verbal material is incompatible (or compatible) with the context established by previously presented verbal material. Thus, Cameron et al. (2002) reasoned that the access difficulty (or ease) would depend, to some extent, on the degree to which the semantic representations of previously presented material were activated. That is, increased activation of the material’s semantic representations would establish greater contextual constraints on accessing the long-term memory representations of subsequent verbal material. Hence, if maintaining the three priming words in working memory involved enhanced activation of their semantic codes, then the N400 elicited by a related probe would be smaller, and the N400 elicited by an unrelated probe would be larger, in the Memory task than in the Control task.

 

In the Memory task, a set of three priming words were presented aurally at a rate of one word per second, followed by a 4000 ms delay interval. At the end of the delay interval, a word was displayed until the participant verbally indicated whether the word matched or did not match the meaning of any of the three primes. Matches occurred on 50% of the memory task trials. An incidental probe word was presented aurally 2000 ms after onset of the delay interval. On 50% of the trials, the probe word was semantically and categorically related to the last word of the previously presented priming set. On the other trials, the probe was neither semantically nor categorically related to any of the primes.

 

The Control task was designed such that 1) there was no memory requirement during the 4000 ms delay interval, 2) the depth of processing of the three priming words preceding the delay interval was comparable to that in the Memory task, and 3) the operation at the end of the delay interval was comparable in difficulty to that in the Memory task. To eliminate the memory requirement, there was no contingency between the operations preceding and following the delay interval. Before the delay interval, participants decided whether the priming set contained one word that was unrelated (not associated) with the other two words. Participants were instructed to respond immediately with a vocal response if they detected an unrelated word in the priming set, and not to respond when the words in the priming set were all related. On 90% of the trials the three primes were associated (and hence there was no vocal response), and only these trials were used in the analysis of the ERP data. In Control trials, the task at the end of the delay interval involved adding a visually presented pair of two-digit numbers. The display of the pair of numbers terminated when the participant responded verbally with the sum. As in the memory task, an incidental probe word was aurally presented 2000 ms after onset of the delay interval, and was semantically and categorically related to the last word in the priming set on 50% of the trials and unrelated with any of the words in the priming set on the remaining trials.

 

 

Figure 8. Across-subjects (n=24) averaged ERPs at posterior scalp sites, where the effects of probe status and task were largest, for the four combinations of incidental probe status and task. The vertical level of the time axis for each recording corresponds to the average amplitude in the 100 ms interval preceding probe onset. The time base extends from 100 ms prior to the incidental probe onset to 1000 ms after probe onset.

 

Note that the N400 activity elicited by unrelated probes was largest in the memory task compared to the control task, and that the N400 activity elicited by related probes was smallest in the memory task compared to the control task. This combination of results indicates that there was greater activation of semantic representations in the delay interval of the memory task than in the non-memory control task.

 

The ERP responses to the probe for each of the four combinations of task (memory, control) and probe status (related or unrelated to the preceding three words) are presented in Fig. 8. ERPs elicited by unrelated probes in both memory and control tasks displayed enhanced N400 activity over posterior scalp compared with ERPs elicited by related probes. Fig. 9 displays the contrast between unrelated-minus-related probe difference waveforms for the memory and control tasks. Effects specific to the memory and control tasks, other than that of probe status, were approximately canceled in these difference waveforms. Fig. 9 makes clear that the effect of semantic relatedness upon N400 activity was greater in the memory task. Thus it can be inferred from Fig. 9 that retention of previously presented words results in a generally enhanced level of semantic activation during the retention interval, as indexed by N400 amplitude and duration.

 

 

Figure 9. Across-subjects average difference ERPs for the unrelated minus related incidental probe subtraction. This figure depicts the effect of the interaction between task (memory, non-memory control) and probe status (related, unrelated) upon N400 activity elicited by the incidental probe. Note that the effect of whether the incidental probe word is semantically related or unrelated to the three priming words preceding the delay interval is greater in the memory task.

 

 

The question of whether there is specifically greater activation in the semantic neighborhood of words when they are maintained in short-term memory is addressed by comparing the memory and control task ERP waveforms for related probes (red waveforms in Fig. 8). Note that, for the related probes, the N400 activity in the memory task has a shorter duration and somewhat lower amplitude than in the non-memory control task. This effect is most marked at posterior temporal and occipital sites. Thus, it can be inferred from the smaller N400s elicited by the incidental probe in the memory task that activation in the semantic neighborhood of previously presented words is greater when the words are consciously maintained in short-term memory.

 

These results show that retention of verbal material in short-term memory tasks that emphasize meaning is accompanied by extended, enhanced activation of the material’s semantic representations. The effect of semantic relatedness upon the incidental probe is not readily explained by verbal short-term memory being a separate buffer into which copies of long-term memory representations are transferred. A more parsimonious explanation is that verbal short-term memory is a process that involves continuous maintenance of long-term memory representations at enhanced levels of activation. Thus, the long-term memory representations of the related probe words may be easier to access in the memory task than in non-memory control task (Kutas & Federmeier, 2000). Conversely, the long-term memory representations of the unrelated probe words may be more difficult to access in the memory than non-memory control task due to enhanced activation of nonmatching features by the material held in short-term memory.

 

3.7. Sentence processing and semantic relatedness

 

The verbal working memory experiments reviewed above employed lists of unconnected items. Such paradigms have provided a useful but limited view of brain activity involved in typical verbal working memory operations. The study described below extended this view by using a sentence, rather than a series of unconnected words, as the stimulus, and requiring retention of the meaning of the sentence.

 

Comprehending a sentence involves a process of semantic and syntactic binding (Hagoort, 2000), whereby the meanings of the words in a sentence are related to one another and maintained in short-term memory as part of an integrated overall representation. There is evidence for a postinterpretative process that maintains thematic role relations following their syntactic computation (Caplan & Waters, 1999). This process is more error prone and/or engenders more brain activation when a sentence expresses more propositions (Caplan, Alpert, & Waters, 1998) suggesting that propositions and the thematic role relations they express are maintained by a capacity-limited semantic short-term memory process. Haarmann et al. (2002) sought neurophysiological evidence for such a process by manipulating semantic relationships within a sentence and analyzing the effects of the manipulation upon ERP and EEG activity in a post-sentence retention interval. The ERP results further provided support for Crowder’s (1993) proceduralist view of memory, namely that brain systems that process particular items of information also subserve storage of those items. Finally, the EEG results provided information on the interactions among brain systems involved in the initial rocessing and subsequent retention of the sentence.

 

The short-term memory process evidently depends in part upon interactions between frontal and posterior cortex implemented by the operation of frontal-posterior projection loops. The loops projecting from frontal cortex mediate the focusing of attention upon representations in posterior cortex to be retained, while the projections from posterior cortex provide information about the state of posterior cortical systems to frontal neural networks. Presumably the resulting influence of frontal and posterior cortical systems upon each other is actualized by an increase in synchrony between neural circuits in the two brain regions. Support for this view is provided by studies of verbal and visual-spatial short-term memory tasks in which the synchronization between EEG recordings from different scalp sites was analyzed with EEG coherence measures. The patterns of coherence between EEG recordings from frontal and posterior sites were found to differ markedly between the stimulus presentation and subsequent retention intervals (Sarnthein, Petsche, Rappelsberger, Shaw, & von Stein, 1998; von Stein & Sarnthein, 2000; Engel & Singer, 2001). These findings, when combined with evidence that the same brain regions are active during both the initial processing and subsequent post-stimulus retention, support the position that short-memory operations utilize specific patterns of connectivity among brain regions, and not buffers that are specialized to short-term memory storage.

 

This position was tested by Haarmann et al., (submitted) in a study of ERP activity during and following the reading of filler-gap sentences. Sentences, consisting of six phrases, were presented visually on a phrase-byphrase basis over a 4500 ms interval, followed by a 2000 ms delay inte