a comparison of forgetting in an implicit and explicit memory task

22
Jonmal of Experimental Psychology: General Copyright 1997 by the American Psychological Association, Inc. 1997, Vol. 126, No. 4, 371-392 0096-3445/97/$3.00 A Comparison of Forgetting in an Implicit and Explicit Memory Task Dawn M. McBride and Barbara Anne Dosher University of California, Irvine Among possible critieria for distinguishing separate memory systems for implicit and explicit memory is that of substantial differences in either the form or rate of forgetting. Prior literature has claimed both differential forgetting and equivalent forgetting for implicit and explicit tasks. Existing experimental data for word-stem completion and explicit control tasks were reviewed and shown to be inconclusive. Our experiments measure forgetting in comparable implicit and explicit memory tasks of stem completion and stem cued recall, The form and the rate of forgetting are essentially the same for these implicit and explicit tasks. Levels of processing and task conditions differ only in the level of initial learning or availability. Thus, either the implicit and explicit task reflect traces in the same memory system or they reflect traces in different systems that have identical forgetting dynamics. The goal of this article is to evaluate the form and rate of forgetting in implicit and explicit memory for words. Claims about differential forgetting in impficit and explicit memory are evaluated in a review of the relevant literature and in four new experiments. Implicit and Explicit Memory Implicit and Explicit Tasks Performance differences on implicit and explicit memory tasks have received much attention in recent literature (see Roediger, 1990; Schacter, 1987; and Riehardson-Klavelm & Bjork, 1988, for reviews). Typical explicit tasks, such as recognition and recall, require active recollection of a study episode. Implicit tasks differ from explicit tasks in that they measure recall of prior episodes without reference to the episode. For tasks that do not require explicit recall, memory or retrieval of the episode is inferred from a change relative to baseline performance. Some examples of implicit tasks include word-stem completion and word-fragment comple- tion, where participants are instructed to complete the stem or fragment with the first word that comes to mind. Memory Systems Underlying Implicit and Explicit Tasks Memory systems proponents argue that implicit and explicit performance dissociations are indicative of separate Dawn M. McBride, Department of Cognitive Sciences, Univer- sity of California, Irvine; Barbara Anne Dosher, Department of Cognitive Sciences and Institute of Mathematical Behavioral Science, University of California, Irvine. Portions of this work were supported by National Science Foundation Grant SBR-9396076, which originated as National Science Foundation Grant NSF89-19498 and Air Force Office of Scientific Research Grant AFOSR89-NL295. Correspondence concerning this article should be addressed to Barbara Anne Dosher, Department of Cognitive Sciences, 3151 Social Science Plaza, University of California, Irvine, California 92697-5100. Electronic mail may be sent via Internet to [email protected]. memory systems (Schacter & Tulving, 1994; Schacter, 1992, 1987; Squire, 1994, 1995; Squire & Knowlton, 1994; Squire, Knowlton, & Musen, 1993; Tulving & Sehacter, 1990; Tulving, Schacter, & Stark, 1982). In their recent book, Schacter and Tulving (1994) describe five major systems of human memory: procedural memory, the percep- tual representation system, semantic memory, episodic memory, and primary (working) memory, in this article, we are concerned orfly with claims about the distinct perceptual representation system and the episodic system. (Squire, 1995 [see also Squire, 1994; Squire & Knowlton, 1994; Squire et al., 1993], also claims distinct memory systems for these, but labels them nondeclarativepriming and declarativefact memory.) The perceptual representation system is thought to primarily underlie performance in implicit tasks such as word-stem or fragment completion. The episodic system is thought to primarily underlie performance in explicit tasks such as recognition and recall. Other researchers (e.g., Roediger, 1990) have argued that dissociations between performance on explicit and implicit tasks may reflect more subtle issues involving cross-accessibility of memory codes stored at input and accessed at retrieval. We first consider several sets of criteria for distinguishing between separate memory systems and then review some of the evidence upon which the claims of Schacter and Tulving (1994) are based. Criteria for Separate Memory Systems The criteria for concluding that separate memory systems underlie different peformances are still under debate. A variety of neuroanatomically distinct entities are known to be involved in memory function (see Fuster, 1995; Squire et al., 1993, for reviews). But when does a circuit of connec- tions involving these entities constitute a functional system, and how are these components related to behaviorally inferred systems? At least three sets of criteria for separate systems have been proposed. Nadel (1994) proposed two criteria for distinguishing between separate systems: (a) the peffor- rnance of different computations and (b) the maintenance of information over different time frames. Schacter and Tulv- 371

Upload: others

Post on 03-Feb-2022

16 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A Comparison of Forgetting in an Implicit and Explicit Memory Task

Jonmal of Experimental Psychology: General Copyright 1997 by the American Psychological Association, Inc. 1997, Vol. 126, No. 4, 371-392 0096-3445/97/$3.00

A Comparison of Forgetting in an Implicit and Explicit Memory Task

Dawn M. McBride and Barbara Anne Dosher University of California, Irvine

Among possible critieria for distinguishing separate memory systems for implicit and explicit memory is that of substantial differences in either the form or rate of forgetting. Prior literature has claimed both differential forgetting and equivalent forgetting for implicit and explicit tasks. Existing experimental data for word-stem completion and explicit control tasks were reviewed and shown to be inconclusive. Our experiments measure forgetting in comparable implicit and explicit memory tasks of stem completion and stem cued recall, The form and the rate of forgetting are essentially the same for these implicit and explicit tasks. Levels of processing and task conditions differ only in the level of initial learning or availability. Thus, either the implicit and explicit task reflect traces in the same memory system or they reflect traces in different systems that have identical forgetting dynamics.

The goal of this article is to evaluate the form and rate of forgetting in implicit and explicit memory for words. Claims about differential forgetting in impficit and explicit memory are evaluated in a review of the relevant literature and in four new experiments.

Implicit and Explicit Memory

Implicit and Explicit Tasks

Performance differences on implicit and explicit memory tasks have received much attention in recent literature (see Roediger, 1990; Schacter, 1987; and Riehardson-Klavelm & Bjork, 1988, for reviews). Typical explicit tasks, such as recognition and recall, require active recollection of a study episode. Implicit tasks differ from explicit tasks in that they measure recall of prior episodes without reference to the episode. For tasks that do not require explicit recall, memory or retrieval of the episode is inferred from a change relative to baseline performance. Some examples of implicit tasks include word-stem completion and word-fragment comple- tion, where participants are instructed to complete the stem or fragment with the first word that comes to mind.

Memory Systems Underlying Implicit and Explicit Tasks

Memory systems proponents argue that implicit and explicit performance dissociations are indicative of separate

Dawn M. McBride, Department of Cognitive Sciences, Univer- sity of California, Irvine; Barbara Anne Dosher, Department of Cognitive Sciences and Institute of Mathematical Behavioral Science, University of California, Irvine.

Portions of this work were supported by National Science Foundation Grant SBR-9396076, which originated as National Science Foundation Grant NSF89-19498 and Air Force Office of Scientific Research Grant AFOSR89-NL295.

Correspondence concerning this article should be addressed to Barbara Anne Dosher, Department of Cognitive Sciences, 3151 Social Science Plaza, University of California, Irvine, California 92697-5100. Electronic mail may be sent via Internet to [email protected].

memory systems (Schacter & Tulving, 1994; Schacter, 1992, 1987; Squire, 1994, 1995; Squire & Knowlton, 1994; Squire, Knowlton, & Musen, 1993; Tulving & Sehacter, 1990; Tulving, Schacter, & Stark, 1982). In their recent book, Schacter and Tulving (1994) describe five major systems of human memory: procedural memory, the percep- tual representation system, semantic memory, episodic memory, and primary (working) memory, in this article, we are concerned orfly with claims about the distinct perceptual representation system and the episodic system. (Squire, 1995 [see also Squire, 1994; Squire & Knowlton, 1994; Squire et al., 1993], also claims distinct memory systems for these, but labels them nondeclarative priming and declarative fact memory.) The perceptual representation system is thought to primarily underlie performance in implicit tasks such as word-stem or fragment completion. The episodic system is thought to primarily underlie performance in explicit tasks such as recognition and recall. Other researchers (e.g., Roediger, 1990) have argued that dissociations between performance on explicit and implicit tasks may reflect more subtle issues involving cross-accessibility of memory codes stored at input and accessed at retrieval.

We first consider several sets of criteria for distinguishing between separate memory systems and then review some of the evidence upon which the claims of Schacter and Tulving (1994) are based.

Criteria for Separate Memory Systems

The criteria for concluding that separate memory systems underlie different peformances are still under debate. A variety of neuroanatomically distinct entities are known to be involved in memory function (see Fuster, 1995; Squire et al., 1993, for reviews). But when does a circuit of connec- tions involving these entities constitute a functional system, and how are these components related to behaviorally inferred systems?

At least three sets of criteria for separate systems have been proposed. Nadel (1994) proposed two criteria for distinguishing between separate systems: (a) the peffor- rnance of different computations and (b) the maintenance of information over different time frames. Schacter and Tulv-

371

Page 2: A Comparison of Forgetting in an Implicit and Explicit Memory Task

372 McBRIDE AND DOSHER

ing (1994) propose three criteria for distinct memory systems: (al) system operations can apply to any memory items of an acceptable class (class inclusion operations), CO1) a distinctive list of features must be defined (properties and relations), and (el) there are dissociations between task performances reflecting different systems (convergent disso- ciations). ~ Dosher and Rosedale (1991) proposed three criteria: (a2) different forms of representation, (b2) different properties of forgetting, and (c2) substantial differences in speed of retrieval (when the nature of the retrieval operation is apparently equivalent).

In each case, it is generally agreed that extensive converg- ing evidence may be necessary to define a system. The behavioral work has focused on the existence of perfor- mance dissociations between tasks reflecting putatively distinct memory systems. In this work, we study one possible dissociation between the explicit and implicit expressions of memory--evidence for differential rates of forgetting. This distinction is relevant to Nadel's criterion Co), Schacter and Tulving's criterion (el), and Dosher and Rosedale's criterion (b2).

Performance Dissociations

Priming in Amnesic Individuals

The claims for dissociations between explicit and implicit systems were first based on research with amnesic patients (Jacoby & Witherspoon, 1982; Warrington & Weiskrantz, 1970, 1974). These studies showed that amnesic individuals performed similarly to participants with normal memory on implicit tasks such as stem completion or perceptual identifi- cation, yet the amnesic individuals exhibited seriously deficient performance on explicit tasks as compared to participants with normal memory.

For some researchers, neuropsychological demonstrations of loss in one form of behavioral performance and preserved function in another constitute prima facie evidence of separate systems. However, often in amnesic individuals, neither the loss of explicit memory function nor the preserva- tion of implicit memory function is absolute. Under some circumstances, amnesic patients can exhibit above-chance recall and recognition (Hirst, 1989; Hirst, Johnson, Phelps, & Volpe, 1988; Hirst, Johnson, Phelps, Risse, & Volpe, 1986); and the severity of the amnesia in affected individuals can affect the amount of priming shown on the various tests (Schacter, 1987). (But see Hamann & Squire, 1996, for a report of one individual with significant priming yet no above-chance recognition.) Further, evidence of selective loss is not always interpreted in terms of separate systems in the sense of separable neural substrates for the memory trace or engram. Weber and Murdock (1989) argue that amnesic deficits in recall and recognition may reflect an inability to coordinate context codes necessary for retrieval during recognition or recall. The ambiguities in interpretation of the deficits in amnesic patients make it quite useful to develop converging evidence in participants with normal memory. This converging evidence has taken the form of document- ing performance dissociations between implicit and explicit tasks.

Dissociations in Levels of Processing

One such dissociation between explicit and implicit memory tasks is based on a levels-of-processing manipula- tion. Consider a typical example. In one study by Roediger, Weldon, Stadler, and Riegler (1992, Experiment 1), level of processing was manipulated with semantic and graphemic orienting tasks during study. Fragment and stem-completion tasks with explicit instructions showed large levels of processing differences, but these same tasks with implicit instructions showed similar performance for the semantic and graphemic study conditions.

The levels of processing dissociation between systems is based on the claim that only the explicit system is sensitive to conceptual processing at encoding, whereas the implicit system is unaffected by conceptual processing at encoding. In an ideal pattern of data, explicit recall or recognition is quite sensitive to the coding differences between semantic and graphemic orienting tasks, whereas implicit stem or fragment completion is completely insensitive to the orient- ing task. A pattern of results approximating the ideal is sometimes found in studies comparing explicit and implicit tasks. However, the levels of processing dissociation has been shown recently to be less robust than was originally believed. Semantic versus graphemic orienting tasks often differentially affect implicit tasks to some degree. And changes in experimental design (e.g., blocked or between- subjects study) have resulted in large levels of processing differences on implicit tasks in a number of studies (Challis & Brodbeck, 1992; Thapar & Greene, 1994). Current work is attempting to salvage the levels of processing dissociation by extracting "process-pure" measures of implicit perfor- mance (that should be unaffected by levels of processing; e.g., Jacoby, 1991). A process-pure measure of implicit performance estimates and excludes any possible impact of explicit processing. The assumptions underlying Jacoby's method are actively debated (Buchner, Erdfelder, & Vaterrodt- PlUnnecke, 1995; Cowan & Stadier, 1996; Curran & Hintz- man, 1995; Joordens & Merikle, 1993), and more work is required before the outcome will be certain.

Dissociation in Forgetting Rates

Another behavioral dissociation that has been proposed is one of forgetting rates. Of course, separate memory systems underlying an implicit and an explicit memory task might just happen to have the same functional form and the same rates of forgetting. However, a finding of a substantially different form or rate of forgetting for implicit and explicit tasks would lend converging support to the separate memory systems position. This article evaluates the evidence for differential forgetting in one typical implicit memory task and comparable explicit memory tasks. We begin by review-

The first two of these criteria are not obvious. Criteria al involves internal consistency arguments, and bl involves the development of full system descriptions. Neither are directly relevant here.

Page 3: A Comparison of Forgetting in an Implicit and Explicit Memory Task

IMPLICIT AND EXPLICIT FORGETTING 373

ing what is known about forgetting in implicit and explicit tasks based on the existing literature.

Forgetting in Implicit and Explicit Tasks

Prior Research

A number of studies have compared forgetting in implicit and explicit tasks. The majority of existing studies use stem completion as the implicit task and recognition as the explicit task. A number of conflicting claims have been made.

Graf, Squire, and Mandler (1984, Experiment 2) found that for stem completion, performance over all participants was significantly above chance during an immediate test and a test 15 min after study but declined to chance during a test that followed study by 120 min. In comparison, recognition performance over all participants (vowel condition) had declined to chance after a 15-min delay. Other studies also report findings of dissimilar forgetting rates for stem comple- tion and recognition. In a study by Squire, Shimamura, and Graf (1987, Experiment 2), performance on immediate tests of stem completion and recognition was above chance averaged over all participants, but after a delay of 2 hr, stem-completion performance had declined to chance levels. On the other hand, recognition performance for normal control participants remained significantly above chance at test delays as long as 4 days. Chen and Squire (1990) reported differential forgetting for stem-completion and recognition performance with test delays of 0, 2, and 24 hr. Stem-completion performance had declined to chance within 2 hr, whereas recognition performance remained above chance longer than 24 hr. On the basis of these studies, Schacter (1987) has stated that "priming of word-stem completion. . , has proved to be a relatively transient phe-

nomenon, decaying across delays of minutes and hours over which explicit remembering persists" (p. 508).

Dorfman, Kihlstrom, Cork, and Misiaszek (1995), how- ever, have stated that "the rate of decay in recognition mirrors that of priming" and that "variables such a s . . . re- tention interval have been shown to affect priming and recognition in a similar fashion" (p. 245). One study they have cited as support for these statements compared stem- completion and recognition performance at test delays of 0, 20, and 90 min (Graf & Mandler, 1984, Experiment 2). Results indicated an effect of test delay on performance, but no interaction of test delay and test type. Roediger et al. (1992, Experiment 2) also report that stem-completion performance remained above chance for delays of 0, 2, and 48 hr. This experiment, however, was a comparison of various implicit tests and did not include an explicit test comparison.

Figure 1 shows a summary of the stem-completion results from these studies. Performance has been corrected for baseline performance given for each study and thus indicates priming rather than actual task performance. Retention interval estimates were made on the basis of information given in the studies. An average item-study-to-test delay was calculated from study time and study list lengths and from test times and lengths (see the Appendix for details of the calculations for each study). Because the comparison of forgetting rates was not the main focus of these studies, test delay was typically measured at just two or three different intervals, making it difficult to view the function of forget- ring across the entire range. (For example, Rubin and Wenzel, 1996, only evaluated forgetting functions with five or more points in their recent analysis of archival functions.) Overall, this summary exhibits a general decline in perfor- mance with increasing retention interval. However, we

O

O0

¢-

o~ t,- o.. E

(D ~L

04

Q

S T E M C O M P L E T I O N - Corrected for Baseline

o Chen & Squire (1990) v Graf & Mandler (1984) z~ Graf, Squire, & Mandler (1984) r7 Squire, Shimamura, & Graf (1987) o Roediger et al. (1992)

o

X ~"b~ v ~ o --..___.~..___ o A o

0¢ . . . . -~ ~

6 a'o 6'o 9b \ \ Retention Interval (min) days

Figure 1. Summary of percent correct data from experiments reporting forgetting for stem- completion performance. Data are corrected for baseline levels.

Page 4: A Comparison of Forgetting in an Implicit and Explicit Memory Task

374 McBRIDE AND DOSHER

cannot determine the functional form of forgetting in stem completion, and we certainly cannot infer the rate at which forgetting occurs, either within a single study or across the group of results.

All of these studies compared stem completion as an implicit task with recognition as an explicit task. Figure 2 displays a summary of recognition performance using the same retention interval estimates from these and one addi- tional study (Hashtroudi, Ferguson, Rappold, & Chrosnlak, 1988). Unfortunately, most of these studies report only hit rates, and false-alarm rates at the various delays ate unkown. Again a general decline in performance is indicated, but we cannot with any certainty know either the form or the rate of forgetting.

Measurement Issues

To complicate the issue further, there are both scaling differences and presumed process differences between the two tasks. Stem-completion performance ranges from a baseline percentage (usually about 5%-15%) to 100%. Recognition performance ranges from chance level (usually 25%-50%) to 100%. These range differences make direct comparison difficult. Recognition and stem completion also differ fundamentally in the kind of response required by the task. In recognition, a test item is given to the participant, who makes a response largely based on familiarity for the item. Stem completion requires a productive response: A cue is presented, and the participant generates a completion. To truly understand what would constitute comparable perfor- mance in these two tasks requires a strong model (similar to

models that relate recognition and recall performance; e.g., Gillund & Shiffrin, 1984; Raaijmakers & Shiffi'in, 1981). For these reasons, recognition is probably not the best explicit task for comparison to stem completion.

In the current study, forgetting rates for performance on a stem-completion (implicit) task were compared with perfor- mance on a stem-cued recall (explicit) task. The stem-cued recall task allowed the use of the same study and test stimuli for both task types. The presentation of the two tasks differed only in instruction. The two tasks require compa- rable production responses. Furthermore, this comparison equates the range of performance scores for the two tasks as much as possible.

In our studies, task performance was measured at a number of retention intevals in order to estimate the shape of the forgetting function and the rate of forgetting. A number of prior studies used a rather different strategy of observing long delays at which performance was still just above chance. Focusing on just above-chance performance is not as strong an approach as the measurement and estimation of forgetting over a substantial portion of the range. Estimating just above-chance performance is sensitive to the power of the experiment, and, importantly, to differences in the heterogeneity of the stimulus sets (e.g., is controlled by the few easiest to remember items).

Experiment 1 examined short delays, and Experiment 2 measured performance at longer delays. Experiment 3 measured long and short delays within one experiment. Retention intervals of approximately 1-90 min were chosen on the basis of previous results reporting stem-completion

0

0

0 o o

(D

(D f t .

0 U~ o Chen & Squire (1990)

n - - 0

V --~_~ V

A

A

RECOGNITION ~ O

0 0

v Graf & Mandler (1984)

n Squire, Shimamure, & Graf (1987) • Hashtroudi et al. (1988)

Graf, Squire, & Mandler (1984) • Sloman et al. (1988) Three Forced Choice \ \

6 s'o 6'0 9'o i 2 Retention Interval (min) days

Figure 2. Summary of percent correct data from experiments reporting forgetting for recognition performance. Data reflect hit rates only.

Page 5: A Comparison of Forgetting in an Implicit and Explicit Memory Task

IMPLICIT AND EXPLICIT FORGETTING 375

performance that had declined to chance levels within the first 2 hr after study (Chen & Squire, 1990; Squire et al., 1987). Performance on the two task types was compared across intervals.

Some researchers have been concerned with whether participants fail to follow instructions and choose to use explicit recall in implici t tasks. Reaction t imes for test items were recorded as a measure o f processing t ime for each task. Substantial differences in response times for stem comple- tion and cued-recall tasks support the conclusion that participants are following instructions and using different cognit ive processes in the two tasks.

E x p e r i m e n t 1

On the basis of statements of fast declining performance for stem complet ion (e.g., Schacter, 1987), Experiment 1 was designed to examine stem-completion and cued-recall performance at delays of approximately 1-15 min. In the "s tudy" portions of the task, participants judged words for pleasantness (a semantic orienting task) or counted the number of letters with ascenders (a graphemic orienting task). This constituted a levels-of-processing manipulation: The pleasantness judgment requires semantic encoding, whereas the graphemic judgment requires only structural encoding. In the test portions of the task, participants either generated the first word that came to mind in response to a three-letter stem (the implicit task) or recalled the previously seen word beginning with the three-letter stem (the explicit task). The retention intervals were arranged by interleaving study and test portions appropriately (see next).

M e t h o d

Participants. Participants in Experiment 1 were 28 volunteers obtained through advertisement. They received $15 for a single 2.5-hr session. All participants were native speakers of English. Initially, participants were prescreened with a brief typing test to ensure typing speeds above 25 words per minute.

Materials. Three hundred eighty words were used in the experiment. The majority of the words (N = 210) were taken from a previous study by Paller (1990) in which similar tasks were used. The remaining 170 words were selected to be similar in length and familiarity to the words from the Paller study. All words began with a unique three-letter stem, and each stem had five or more possible completions. This set of stems was normed for baseline levels by testing a separate set of 38 participants on stem completion without prior study. The probability of completing the stems with the target words on our list was 11.5%.

Design. The words were randomly assigned to eight study lists: two long lists with 132 words each and six short lists with 16 words each. Each participant received a new random assignment of words to conditions, and each word was equally likely to appear in each condition over participants. Long study lists contained 66 words in the rate pleasantness study condition and 66 words in the count ascenders condition. Short lists contained 8 items of each of the two study conditions. The experiment was designed in two blocks, in which a block consisted of an initial long study list and three short study lists, each followed with a test list that tested items from the prior short list and from the long list. The shortest retention interval resulted from testing items from the short study list immediately preceeding the test list. The longer three retention

intervals resulted from testing items from the long study list in each succeeding test list.

Three-letter word stems corresponding to the first three letters of the study words were presented in both the stem-completion and cued-recall tests. Each test was composed of 44 (22 of the rate pleasantness condition and 22 of the count ascenders condition) stems of the words from the long study list in that block and 16 (8 pleasant and 8 count) stems of all words in the immediately preceding short study list. Test type was alternated with three test lists given in each block. Therefore, one block would contain two stem-completion tests and one cued-recall test, and the other block held two cued-recall tests and one stem-completion test. The beginning test was counterbalanced across participants such that half the participants began with stem completion and half began with cued recall.

Procedure. Participants were tested in individual rooms, each containing an IBM-compatible computer. Before beginning the experiment, participants received five practice items for each study condition. All practice items had stems unique to the experimental word list.

Study and test lists were presented on the computer monitor. Study words were presented on the center of the screen with a symbol indicating which of the two tasks the participant was to complete for that word. If the words were embraced by ! symbols, participants were to rate the pleasantness of the word on a 1-7 scale, with 7 being most pleasant and 1 being least pleasant. If a # symbol was seen, the participant was to count the number of ascending lettes in the word presented. The indicative symbol was presented first for 700 ms. The study word then appeared between the symbols and remained on the screen for 1,400 ms. Participants were given an unlimited amount of time to make a response. All responses during study were made on the number pad of the keyboard.

Before each test list, participants saw instructions on the screen to indicate to them which test they were to complete for that test list. If participants were to do stem completion, they were asked to complete the following stems with the first word that came to mind. For cued recall, participants were instructed to complete the stems with a word from any list they had studied. For both tasks, participants were given the option of responding with xxx if they could not think of an appropriate response. They were encouraged to respond xxr in the cued-recall task if they could not remember a word from a study list that completed the stem; the xxx response was rare in stem completion. Stimulus presentation was the same in both tests. Only instructions differed between test types.

During each test list, three-letter word stems were presented in the center of the screen. Each stem was immediately preceded by a fixation square that cued the participant for the next trial. Stems remained on the screen for 6,300 ms or until participants began typing in the word (which usually occurred first). Participants typed in the full word, not just the completion. Response time (RT) was recorded as the time from stem presentation to time of the first keypress. Participants received a 5-rain break between the two blocks.

Time estimates and scoring. The average times corresponding to these retention conditions were estimated from the number of trials intervening and the duration of those trials. See the Appendix for details.

Typed cued-recall or stem-completion responses that exactly corresponded to the target (previously seen) response were com- puter scored as matches. Typed responses that differed widely from the target response were computer scored as mismatches. Typed responses that were possibly close to the target (shared several letters) were presented to a judge by the scoring program. Minor

Page 6: A Comparison of Forgetting in an Implicit and Explicit Memory Task

376 McBRIDE AND DOSI-IER

misspellings were coded as matches. This scoring procedure was used throughout all experiments.

Results and Discussion

Orienting-task RTs. Response times to judge items were similar across orienting condition. Average response time was 2.574 s to judge pleasantness (semantic condition) and 2.505 s to count ascenders (gmpbemic condition).

Target recall. Figure 3 displays the proportion of the target-production data for stem completion and cued recall broken down by semantic and graphemic study conditions. Data for both Experiment 1 (the first four data points) and Experiment 2 appear on this graph, which shows the full range of retention intervals from 1 to 90 min. The Experi- ment 1 data show that there is a rapid decline in production of the studied (target) word for both test types during the first 15 rain after study. This result is supported by a significant main effect of delay, F(3, 81) = 117.94, MSE = 0.011, p < .001, found in a three-way analysis of variance (A_NOVA) of delay, study condition, and test type for correct responses. Furthermore, the interaction of test type by delay was not significant, F < 1, indicating that the decline in target production was similar for stem-completion and cued recall tests. The ANOVA did reveal a significant main effect of study condition, F(1, 27) = 42.37, MSE = 0.019, p < .001, indicating that test performance was higher overall for the semantic study condition (M = .418) than the graphemic condition (M = .332). No other effects were significant, all Fs < 1.42. (Quantitative fits of forgetting functions are postponed until Experiment 3, in which the full range of retention delays was tested within one experiment.)

Task differences in RT. Table 1 provides mean reaction times for test items for all experiments. Response times are shown for each test type by study type condition and for correct, incorrect, and no-answer response types. A Study Condition X Delay X Test Type ANOVA for response times of correct responses indicated a large effect of test type, F(I , 27) = 16.80, MSE = 0.463, p < .001, such that cued recall

q

a .

d

d Q,.

q C ,

EXPERIMENTS 1 & 2 o Semantic/SC o Graphemic/SC

Semantic/CR

i~: ~ : Graphemic/CR

• ° ~ o _ _ •

i i i i i i

0 2o 40 60 80 1 oo

Retention Interval (min)

Figure 3. Proportion correct data from Experiments 1 and 2 for stem completion (SC) and cued recall (CR) as a function of test delay. The different lines represent study and test conditions.

resulted in much longer response times (M = 1.443) than stem completion (M = 1.180). The difference in average response times between the implicit and explicit tasks reflect substantial differences in processing in the two tasks. 2 Significant effects of correct response times for study condition, F(1, 27) = 8.25, MSE = 0.082, p < .01, and delay, F(3, 81) = 3.08, MSE = 0.111, p < .05 were also evident. Items studied with the semantic task (M = 1.273) werecompleted faster than items studied with the graphemic task (M =- 1.350).

Summary. Experiment 1 results indicated an initial rapid decline in production of the studied word that did not differ between test types, as indicated by the nonsignificant interaction of delay and test type. A typical levels-of- processing effect occurred: Semantic study produced higher test performance than graphemic study. However, this difference was not qualified by a significant Study Condition X Test Type interaction, which is a dissociation frequently found in comparisons of implicit and explicit tasks. Differ- ences in response times for the two tests indicate that to a large degree, participants were engaging in different eogni- five processes to complete stem-completion and cued-recall tests. The conscious processing invoked in cued recall required substantially more time than the processing in- voked in stem completion.

Experiment 2

Experiment 2 was designed to examine longer test delays (approximately 15-90 rain) for stem completion and cued recall. The design included a segmented study list with test lists derived of items from each portion of the study list in order to measure performance at different retention inter- vals. The main purpose of Experiment 2 was to compare performance for the two test types to near baseline levels at later delays on the basis of reports that stem-completion performance falls to baseline levels after a 2-hr delay (C-rift et al., 1984; Squire et al., 1987; Chen & Squire, 1990).

Method

Participants. Twenty-four participants volunteered for Experi- ment 2. Participants were undergraduate students obtained through the psychology subject pool at the University of California at Irvine and received course credit as compensation. All participants were native speakers of English. Twelve participants were randomly assigned to the stem-completion test condition, and 12 were assigned to the cued-recall test conditon. As in Experiment 1, participants were prescreened for typing speed. Thirty-three partici- pants were given a typing test before beginning the experiment. Data from participants with typing speeds lower than 25 words per minute (N = 9) were not included in further analysis.

Materials. Three hundred eighty-four words were used in Experiment 2. Three hundred eighty words were the same words used in Experiment 1. Four more words were then chosen to be similar to those used in the previous experiment. All words began with a unique three-letter stem. As in Experiment 1, each stem had five or more possible completions. A second set of 300 words was

2 Response time distributions are considered in Experiment 3, in which precise quantitative estimates of forgetting are derived.

Page 7: A Comparison of Forgetting in an Implicit and Explicit Memory Task

IMPLICIT AND EXPLICIT FORGETHNG

Table 1 Test Item Reaction Times for AU Experiments

Semantic Graphemic

No No Test type Match Incorrect answer Match Incorrect answer

Experiment 1 Stem completion 1.165 1.349 - - 1.194 1.351 Cued recall 1.380 1.666 1.636 1.636 1.507 1.908

Experiment 2 Stem completion 1.113 1.168 - - 1.094 1.177 Cued recall 1.665 1.931 0.768 1.851 1.882 1,047

Experiment 3 Stem completion 1.175 1.234 - - 1.173 1.240 Cued recall 1.585 1.446 1.582 1.512 1.571 1.609

Experiment 4 Recognition 2.101 2.120 ~ 2.154 2.271

Note. Recognition response times include added time for task discrimination. Dashes indicate that answer response times for stem completion are not reported because of few data points per cell.

377

developed for a filler task: vowel counting. Filler words were chosen to be similar in length and familiarity to words used in main tests. All words chosen began with a stem unique with respect to the vowel word list and with respect to the main list. No two words shared the same stem.

Design. All 384 words of the primary set were included in the single long study list. One hundred ninety-two words were studied in the rate pleasantness orienting condition, and 192 words were studied in the count ascenders orienting condition. Testing order was arranged in four test lists, each composed of stems of an equivalent number of words from each of four quarters of the study list. Each participant received a different random order of assign- ment of words to study and test positions, and each word was equally likely to appear in each interval over participants. In order to examine the range of retention intervals of 15-90 rain, a vowel-counting filler task was interposed between test lists. We chose to fill intervals between tests with a word-based activity similar to that carried out during learning and test so as to measure retention intervals under near steady-state conditions. Words were randomly assigned to three lists, 100 items in length, to be interposed between tests. Each participant received a different random assignment of items to list orders.

Procedure. Participants were tested individually on IBM- compatible computers. They were first given 10 practice items for each of the three main tasks: orienting judgments, recall or completions, and vowel counting. Of the 10 orienting judgments, 5 used pleasantness ratings and 5 ascender counting. No practice word duplicated a stem from any other tested word in the experiment.

Study and test lists were presented on the monitor of the computer. Study words were presented for an orienting task judgment in a similar manner to Experiment 1. Words appeared on the center of the screen with a symbol indicating which of the two tasks the participant was to complete. The .t symbol indicated the rate pleasantness task, and the # symbol preceded the count ascending letters task. The time sequence of the study word presentation was the same as in Experiment 1. In Experiment 2, however, all study words were presented in one list before any test lists were shown.

After the presentation of the study list, the first test list was shown. Participants were instructed prior to each test according to the test condition they were assigned. Stem-completion and cued-recall instructions were exactly as they were in Experiment 1, and the test lists were presented as they were in Experiment 1.

Three-letter word stems followed a fixation square shown in the center of the screen.

Following each test list, participants received a lO0-item vowel counting list. For each item, they were asked to count the number of vowels in the word presented and to type in their response on the number pad. The vowel-counting task ensured that participants continued in a similar task during intervals between tests and allowed intervals between study and test of approximately 15-90 min.

~me estimates. Participants were asked to enter the time read off a digital clock at the beginning of and the end of each quartile of the study list. Before and after each test list, participants were prompted to enter in the time read off the digital clock to provide estimates of the study-to-test retention intervals. Estimates of retention delays were made by calculating the average time elapsed between a particular study quartile and particular test list (see the Appendix for details).

Results and Discussion

Orienting-task RZ As in Experiment 1, response times to the study orienting tasks were similar. Participants responded to items in the semantic condition within an average of 2.594 s and to items in the graphemic condition within an average of 2.566 s.

Target recall. The last four points in Figure 3 display the proportion of targets produced for stem completion and cued recall for the semantic and graphemic study conditions for Experiment 2. These data represent performance for delays o f 15-90 rain. These data exhibit a much slower decline in the probability of producing the studied item than the decline seen in Experiment 1 for delays from 1 to 15 min. The decline at long delays was, however, significant, F(3, 66) = 12.30, MSE = 0.005, p < .001, in the three-way ANOVA of delay, study condition, and test type for correct responses. Once again, the effects of test type and the interaction of test type and delay were not significant, Fs < 1.37, indicating that performance declined similarly for stem-completion and cued-recall tasks. Performance did differ, however, for the two study conditions, F(1, 22) =

Page 8: A Comparison of Forgetting in an Implicit and Explicit Memory Task

378 McBRIDE AND DOSHER

17.62, MSE = 0.013, p < .001, such that test performance was again higher for the semantic study condition (M = .273) than the graphemic condition (M = .203). Unlike Experi- ment 1, there was a significant interaction of test type and study condition, F(1, 22) = 8.90, MSE = 0.013, p < .01, such that performance for the two study conditions differed much more for cued recall than for stem completion. No other effects were significant for Experiment 2, all Fs < 1.65.

Our arguments do not depend on specific comparisons to baseline, but rather on the functional form of forgetting over the range of retention intervals. Nonetheless, because they are often cited in prior literature, several t tests were performed to determine if performance for all conditions remained above baseline level (11.5%) at the longest delay. Target production performance was significantly above baseline for all conditions, ts(48) > 6.55, p < .001), except the cued-recall-graphemic condition, t(48) = 1.70, p = .095.

Task effects on RT. Mean reaction times for test items for Experiment 2 can be seen in Table 1. A Study Condi- tion × Delay x Test Type ANOVA for response times of correct responses indicated a significant effect of test type, F(1, 22) = 9.76, MSE = 2.106, p < .01, such that cued recall resulted in longer response times (M = 1.758) than stem completion (M = 1.104). On average, active recall was associated with response times 654 ms longer than stem completion. Significant effects of study condition, F (1, 22) = 5.11, MSE = 0.065, p < .05, and delay, F(3, 66) = 3.85, MSE = 0.090, p < .05 were also found. Items studied with the semantic task (M = 1.389) were completed faster than items studied with the graphemic task (M = 1.472). As in Experiment 1, response times varied with delay, but not consistently. Differential effects of study condition for the two test types were also evident in reaction time data. The interaction of test type and study condition was significant, F(1, 22) = 7.69, MSE = 0.065, p < .05, and indicated that response times differed between study conditions more for cued recall than for stem completion.

Summary. Results from Experiment 2 indicate that during test delays of 15-90 rain, both stem-completion and cued-recall performance decline at a very slow rate. Perfor- mance remained above baseline levels at delays up to 90 rain for all conditions except for cued recall following graphemic orienting at study. Experiment 2 again showed semantic study to produce higher target production performance than graphemic study, but unlike Experiment 1, the levels of processing difference found for cued recall was greater than the difference in stem completion. This result shows the typical dissociation found for implicit and explicit tasks. Test item response times again confirmed that stem comple- tion and cued recall are associated with substantial process- ing differences.

E x p e r i m e n t 3

Experiment 3 evaluates forgetting over a large range within a single experiment, which makes quantitative evalu- ation of the form and rate of forgetting possible. We

expected to generally replicate the results of Experiments 1 and 2, with rapid early decline and slower later decline in target production. Experiment 3 employs a continuous memory design: Retention intervals were based on the number of intervening items between the study and test positions in a long sequence of experimental trials.

M e ~ o d

Participants. Thirty-six participants volunteered for Experi- ment 3. Participants were undergraduate students obtained through the psychology subject pool at the University of California at Irvine and received course credit as compensation. All participants were native speakers of English. Eighteen participants were randomly assigned to the stem-completion test condition, and 18 were assigned to the cued-recall test condition. As in the previous experiments, 42 participants were given a typing test before beginning the experiment. Data from participants with typing speeds lower than 25 words per minute (N = 6) were not included in further analysis.

Materials. The critical word set was the same as the 380 word set used in Experiment 1. The set of 300 filler words was the same set as used in Experiment 2. Each word in the experiment had a unique stem. In this experiment, however, words in the filler vowel-counting task were repeated up to three times.

Design. Half the participants were randomly assigned to cued recall and half to stem completion. Critical study trials were assigned to either the semantic or graphemic orienting instructions. An unstudied condition consisting of 38 words for each participant provided a measure of baseline within the conditions of the experiment. Retention intervals between study and test were set by item lag. The lags were 20, 60, 120, 190, 280, 360, 450, 600, and 800 items in a long, 1,030-trial sequence. Each participant received a new random assignment of words to all conditions, including the baseline condition. An available study position was first randomly chosen among the 1,030 positions, and then a test position was set based on the item's predetermined lag. Occasionally the test position was already filled, and the test was assigned to the trial just before or just after. Unstudied items were given test positions throughout the trial ordering. All critical words were positioned and then filler items were placed to fill all remaining positions in the ordering. The majority of filler items were given the vowel- counting task, but some were assigned as either one of the two study tasks (in later positions) or as the test task (in earlier positions) in order to approximately balance the number of study trials and test trials over the course of the experiment. A slight imbalance resulted from the necessity to place the study trials for long retention-lag conditions in early positions and the test trials in later positions.

Procedure. Before beginning the experimental trials, partici- pants were given 40 practice items, 10 on each task (i.e., rating pleasantness, counting ascenders, counting vowels, and the as- signed test: stem completion or cued recall). Semantic and graphe- mic study tasks were indicated as in previous experiments. For vowel-counting filler items, V symbols indicated the task. Test items were again preceded by a square fixation, and then the three-letter stem appeared at the fixation. For the two test types, the instructions were the same as in previous experiments. As in the previous experiments, participants were given the option of responding with xxx to test items if they could not think of a proper response.

Study symbols appeared for 700 ms, and then the word appeared for 2,200 ms. An additional 780 ms was given in which the participant could make a response, and if no response was made

Page 9: A Comparison of Forgetting in an Implicit and Explicit Memory Task

IMPLICIT AND EXPLICIT FORGETTING 379

within this time, the next trial began. For test items, the cue appeared on the screen for 280 ms, and then the stem was shown for 2,570 ms. When the stem disappeared from the screen, participants were then given 100 m s to beg in a response.

Tuning estimates. Participants were prompted to enter the time from a digital clock at the beginning, every 150 trials, and at the end of the experiment. These times were used to determine the average retention delay in minutes for each of the item lags (see the Appendix).

Results and Discussion

Orienting-task RZ As in the previous experiments, response times for the two orienting tasks were similar. Participants responded to items in the semantic task within an average of 2.392 s and to items in the graphemic task within an average of 2.382 s.

Target recall. The main data from Experiment 3 are shown in Figure 4, which displays the proportion of target production for stem completion and cued recall for both semantic and graphemic study conditions. Test delays ranged from approximately 1--60 min. Production of the target for unstudied items (baseline) was 13%. In accordance with Experiments 1 and 2, an initial rapid decline in target production is evident, followed by a much slower decline through later delays.

The effects of delay, F(8, 272) = 72.49, MSE = 0.010, p < .001, and study condition, F(1, 34) = 72.62, MSE = 0.023, p < .001, were again significant. Test performance was higher overall for the semantic study condition (M = .355) than the graphemic condition (M = .254). Analy- sis of variance revealed no effect of test type, F < . 1. The significant interaction of test type and study condition, F(1, 34) = 18.42, MSE = 0.023, p < .001, indicates that the two study conditions differed much more for cued recall than for stem completion. There was a significant interaction be- tween test type and delay, F(8, 272) = 2.81, MSE = 0.010, p < .01, which was mainly due to the high initial perfor- mauce level for the cued-recall-semantic condition. The

• EXPERIMENT 3 o Semantic/SC o Graphemic/SC

• • Semantic/CR ! =~ • Graphemic/CR

i i v t i i i

0 10 20 30 40 50 60 Retention Interval (min)

Figure 4. Proportion correct data from Experiment 3 for stem completion (SC) and cued recall (CR) as a function of test delay. The different lines represent study and test conditions.

interaction between delay and study condition and the three-way interaction were not significant, Fs < 1.80. The meaning of the interaction between test type and delay is clarified below in quantitative analysis of the forgetting functions.

As in Experiment 2, t tests were performed to determine whether performance at the longest delay was above base- line performance. At the longest delay, performance for both stem-completion study conditions was significantly above baseline; semantic: t(17) = 10.68, p < .001; graphemic: t(17) = 4.08, p < .001. For cued recall, the semantic study condition performance was above baseline, t(34) = 6.45, p < .001, but was statistically equivalent to baseline performance for the graphemic study condition, t(34) = 1.94, p = .061.

Task differences in RZ Mean reaction times for test items for Experiment 3 can be seen in Table 1. The ANOVA for response times of correct responses indicated a highly significant effect of test type, F(1, 34) = 16.75, MSE = I •3570, p < .001, such that cued recall responses (M = 1.548) were longer than those for stem completion (M = 1.174). No other effects were found to be significant, all Fs < 1.45.

Response time distributions for each type were deter- mined by calculating seven quantiles of the function for each participant and then averaging the quantiles across partici- pants. This method was first suggested by Ratcliff (1979) as a method for averaging reaction time data across participants with relatively few observations per participant cell. The distribution analyses indicated that approximately 75% of the distributions for the two test types was nonoverlapping (approximately 75% of the stem-completion response times were shorter than 1•35 s, and approximately 75% of the cued-recall response times were longer). The low degree of overlap in the distributions indicates that the processes carded out in the stem-completion and cued-recall tasks were substantially different. The longer stem-completion times might simply reflect the long end of normal stem- completion processes. Similarly, the shorter cued-recall times might simply reflect the short end of normal cued- recall processes. The existence of overlap in the distributions does not by itself suggest that the overlapping portions reflect cross-task processing, but it does place an upper bound on such processing.

Models of forgetting. In order to characterize the func- tional form and rate of forgetting in the two tasks, data from Experiment 3 were fit to a series of descriptive forgetting functions (including simple and complex exponential, log, and power-function forms). The power function yielded a better account for our data, so we restrict our report to this function. (The log function also yielded a reasonable set of fits, which supported the same set of conclusions; see Rubin & Wenzel, 1996.) The power function has been reported as the preferred function in a number of recent treatments of forgetting (e.g., Wixted & Ebbesen, 1991).

Simple power function forgetting takes the form:

P(t) = k t-P,

where P(t) is the probability of recalling the target at time t,

Page 10: A Comparison of Forgetting in an Implicit and Explicit Memory Task

380 McBRIDE AND DOSHER

k is the initial level of encoding, and 13 controls the effect of time, hence reflecting the rate of decay. The simple power function was first fit to the data allowing a separate k and 13 for each of the four conditions: cued-recall semantic and graphemic and stem-completion semantic and graphemic (eight parameters), yielding an R 2 value of .87. 3 A fit with only five parameters that assumed constant loss over time (single 13) fit about as well, yield R 2 = .85. Constraining 13 to be the same for all conditions did not significantly reduce the goodness of the fit, F(3, 31) = 1.34, MSE = 0.004, p > .05. (Test for dropping least squares model parameters; Wanna- cott & Wannacott, 1981.)

Neither the unconstrained nor the constrained variants of the simple power function fit the data particularly well, however. The function had difficulty simultaneously captur- ing the rapid early decline and the later slow decline, assuming the same power function of time over the entire range: The fits tended to overestimate performance at short retention delays and underestimate it at long retention delays. Attempting to compensate for this problem intro- duced a variety of trade-offs in parameter estimates.

The pattern is revealed especially clearly when the data are displayed on log-log ( I n - In) plots. Power-function forgetting should appear as a single straight line with (negative) slope reflecting the 13, or rate of forgetting. Figure 5 shows the data of Experiment 3 on In P(t) versus In t axes. It is evident that the same slope does not characterize the data throughout the range of retention intervals, but rather there was a period of fast decline followed by a period of very slow decline.

A two-process, or composite, power function was de- signed to capture both the fast and slow forgetting regions:

P(t) = max [k t -~, ~/].

The power function h t-~ describes the early rapid period of forgetting; this holds up to about 15 min. The decline over longer retention delays is so shallow as to be essentially flat over the long retention intervals, from about 15 to 60 min, and is characterized as a constant value, "y. We believe that further decline would be visible when measured in hours or days and that forgetting over longer retention intervals might follow a power function, but we could not estimate its characteristics with these data. This rapid period of forget- ting followed by a slow period holds essentially equivalently for the implicit and explicit tasks.

We performed two fits of this composite power function to the data: one that allowed the rate of forgetting in the early region to differ between conditions and one that constrained the rate of forgetting to be the same. The unconstrained fit had 12 parameters: 4 ks, 4 13s, and 4 ~/s, one for each condition. This composite power-function fit was a signifi- cant improvement (R e = .98) over the comparable single process power fit, F(4, 28) = 32.93, MSE = 0.001, p < .01.

More importantly, the constrained composite fit that assumes equal forgetting over retention interval for different study and retrieval tasks (single 13, nine parameters) did not significantly reduce the quality of the fit (R 2 = .97), F(3, 28) = 1.53, MSE = 0.001, ns. All Study × Task conditions

are well described by a single 13 value. In other words, stem completion and cued recall did not differ significantly in either the form or rate of forgetting. Parameter estimates for this constrained nine-parameter fit are listed in Table 2. The estimated 13s in the unconstrained fit ranged from .352 to .479, and were accompanied by compensatory changes in k. These differences are small over the relevant range of retention delays up to 15 min. (It is useful to put these differences in 13 in the unconstrained fit in perspective. Although these values would yield substantial differences in prediction over long retention delays of hours, over the range of retention delays up to 15 or so minutes, the different 13s correspond to differences in predicted values between the constrained and unconstrained model of about 1%. In other words, the constrained fit is not ignoring substantial differ- ences in the conditions.)

The predictions of this constrained compound power function fit are shown as solid lines in Figure 5; the common slopes in the first line segment reflect the common parameter 13. Figure 6 shows the predicted values and the quality of fit on untransformed axes. It should be evident from these figures that the quality of the model fit for all four Study × Task conditions is very high. All model lines closely approximate the data. The assumption of equivalent forget- ting is not producing noticeable misfits in any of the functions.

In summary, the implicit and explicit tasks exhibited a closely comparable form and rate of forgetting. The primary fact is the similarity in overall form of forgetting, including a fast-forgetting portion at short retention intervals up to about 15-20 min and a slower forgetting portion at longer retention intervals up to about 60-90 rain. The secondary fact is the quantitative similarity in rate of forgetting in the fast-forgetting portion of the function. The estimated differ- ences in forgetting rate are small to negligible. We consider both qualifying and supplementary analyses next.

Qualifying analyses. Our experiment may underesti- mate very slightly the forgetting during the later retention periods. The design required that the longest retention intervals correspond to trials that were studied early in the session, whereas the study positions for the shortest reten- tion intervals appeared throughout the session. A collateral analysis evaluated the possible effect of study position. From these analyses, it appears that items learned early in the session had as much as a 2%-4% advantage over those learned later in the session. This covariation in position of study with long retention intervals might have offset true declines of 2%-4% by the longest retention interval. This emphasizes that our characterization of the later retention intervals by a constant level of producing the target, % is a simplification of what is almost surely a very slow loss of availability. However, this possible small underestimate of forgetting does not compromise either the selection of the compound power-function model, or the equivalence of forgetting across tasks.

3 R 2 is corrected for the number of estimated parameters, like a multiple regression adjusted r e .

Page 11: A Comparison of Forgetting in an Implicit and Explicit Memory Task

IMPLICIT AND EXPLICIT FORGETTING 381

l l .

t - O

R 2

a . v

e- . J

o

'7, A

V <) <> O ^ P, 0 v

Semantic Stem Completion

~~'~V - V V V v v V . . . . . . . . . . . . . . . . . . . . . . . . . . . - . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

G r ~ h e m i c Stem Completion

I I I

0 2 4

u P ' u u 0

Semantic Cued Recall

~ ~ OOuO 0

Graphemic Cued Recall

I I I

0 2 4

Ln (Time) Figure 5. Experiment 3 data plotted on In P (t) versus In t axes that linearize power-function decay. Each study and task condition is shown in a separate panel. The lines represent the fitted values of the equal 13 composite power-function model.

Supporting analyses. We examined the data of Experi- ments 1 and 2 on the In-In axes of the power-function analysis. The segments from the two experiments do not line up because they represent different participants as well as substantial design differences between the experiments. Nonetheless, these data were consistent in showing a demarkation in the rate of forgetting (slope) for shorter and longer retention intervals.

Table 2 Parameters of the Composite Function for Experiment 3

Condition k 13 7

Stem completion Semantic .675 0.404 .285 Graphemic .571 0.404 .242

Cued recall Semantic .889 0.404 .317 Graphemic .537 0.404 .195

Note. Parameter 13 constant for all conditions, k = initial level of availability; 13 = time constant of decay; 7 = level of availability at longer retention delays.

The data from Experiment 1 showed faster forgetting. We performed two sets of model fits that fully support the contention that rate of forgetting does not differ for the implicit and explicit memory task. Power-function forget- ring models were fit to the data of Experiment 1. One data set corresponded to the data in Figure 3. We were also able to evaluate a six-point (t = 2, 5, 7.5, 11, 15, and 19 rain) retention function with lower sample size for Experiment 1 by sorting trials into six groups by retention delay. For both the four-point and six-point retention functions of Experi- ment 1, a model that allowed four ks and one 13 fit as well as a model allowing 13 to vary with condition, F(3, 11) = 2.11, MSE = 0.002, ns, and F(3, 19) = 0.77, MSE = 0.002, ns, for the four- and six-point analyses. The estimated 13 was 0.362 for the four-point analysis. This is quite similar to [3 of 0.404 in Table 2, given that the participants and design are different. The data and the fitted power-function model of forgetting with identical 13s for the four-point and six-point data are shown on In - In axes in Figures 7 and 8, respectively.

The data from Experiment 2 showed very slow forgetting. These data are shown on In - In axes in Figure 9.

Page 12: A Comparison of Forgetting in an Implicit and Explicit Memory Task

382 McBRIDE AND DOSHER

o

=3

c o. O

R =. 2

D.

O

Semantic Stem Completion

0 o C ^ 0

Semantic Cued Recall

Graphemic Stem Completion

~V ~ ~7 X'7 V ~ V v

Graphemic Cued Recall

C C~

I I I I I I

0 30 60 0 30 60

T ime

Figure 6. Experiment 3 data plotted with the fitted curves of the equal ~ composite power function. Each study and task condition is shown in a separate panel.

Power-function decay models were also fit to the 4-point retention data shown in Figure 3 andto a set of 16-point (t = 6.8, 11.2, 16.2, 21.3, 34.4, 39, 43.9, 49, 58.8, 63.4, 68.3, 73.4, 83, 87.5, 92.5, and 97.7 rain) retention functions, with lower sample size based on sorting the trials by retention intervals. In neither case were the data better fit by a model that allowed 13 to vary, F(3, l l ) = 0.07, MSE -- 0.010, ns, and F(3, 59) = 0.50, MSE = 0.008, ns, for the 4- and 16-point analyses). The estimate of 13 was about 0.1--0.13 for these longer retention delays. (The simplifying assumption of no loss corresponds to a 13 of 0.) The 0.1 13 estimate predicts declines of 3%-4% over retention delays from I5 to 60 rain. This estimate corresponds closely with our estimate of the possible true decline over these retention intervals in Experiment 3 that may have been masked because of study-position contributions.

Summary. Function fits to Experiment 3 data confirmed the qualitative results from Experiments 1 and 2 that stem-completion and cued-recall performance declines rap- idly for test delays up to 15 rain or 20 rain and then declines very slowly for delays from 15 rain or 20 min to 60 min. The primary fact is the overall similarity of this form of forgetting for stem completion and cued recall. Additionally,

data from stem completion and cued recall were very well fit by a single parameter, 13, for rate of forgetting during the rapid-forgetting portion of the forgetting curves. This find- ing was supported by collateral analyses that fit single 13 models to the data of Experiments 1 and 2. Each of these analyses found separately that the data from stem comple- tion and cued recall were very well fit by a single 13 for both the rapid-forgetting portion (Experiment 1) and the slow- forgetting portion (Experiment 2) of the retention function. There is no evidence that stem completion and cued recall reflect information that is being forgotten at different rates. For both tasks, the power-function exponents for the initial rapid-forgetting period were 0.40 and 0.36 (Experiments 3 and 1, respectively). For both tasks, the power-function exponent for the later slow-forgetting period was about 0.10 (Experiment 2). Again, the primary fact is the similarity of forgetting over both the fast- and slow-forgetting portions; secondarily, the pattern of 13s controlling the rate of forget- ting is quite consistent over experiments. Our findings do not extend past retention delays of 90 min, although collateral analyses suggest that our estimates are roughly consistent with estimates taken from the literature for stem completion at 24 and 48 hr. Response time differences between the

Page 13: A Comparison of Forgetting in an Implicit and Explicit Memory Task

IMPLICIT AND EXPLICIT FORGETTING 383

O

I° °°

Graphemic Stem Completion

I I I

0 2 4

~ °

Graphemic Cued Recall

I I I

0 2 4

Ln (Time)

Figure 7. Experiment 1 data plotted on In-in axes, with the fitted curves of the equal [3 power function. Each study and task condition is shown in a separate panel.

explicit cued-recall task and the implicit stem-completion task confirm that the participants are processing the stem cues quite differently in the two sots o f data.

Experiment 4

Experiment 4 tested recognition performance using the same design and stimuli as Experiment 3. This design measures retention intervals from about 1 to 60 rain. For reasons articulated earlier, we do not expect recognition performance to b e directly quantitatively comparable to stem-completion performance. We examine recognition be- cause most previous studies in the literature have compared stem completion to recognition. We hope to observe whether forgetting in recognition is generally similar to that in stem completion.

Method

Participants. Eighteen participants volunteered for Experi- ment 4. Participants were undergraduate students obtained through the psychology subject pool at the University of California at Irviue

and received course credit as compensation. All participants were native speakers of English.

Materials and procedure. The study words, test items, and filler items were the same as those used in Experiments 2 and 3.

Design. Experiment 4 was designed in the same way as Experiment 3. Words were again studied under semantic or graphemic conditions, and vowel counting was used as the task for filler items. Test items for all participants in this experiment, however, were recognition items, instead of stem completion or cued recall. As in Experiment 3, retention intervals between study and test were set by item lag: 20, 60, 120, 190, 280, 360, 450, 600, and 800 item lags. Each participant received a different assignment of words to conditions.

Procedure. For test items, participants were shown a word surrounded by question marks. Participants were to respond "Old" (press the / key) if they remembered seeing the test word in a previous task or "New" (press the z key) if the word had not been previously presented.

Forty practice items were again given to participants prior to the experimental trials. Ten items of each task (i.e., rating pleasantness, counting ascenders, counting vowels, and recognition) were pre- sented. During experimental recognition trials, 342 old words and 359 new (unstudied) words were tested. All words (new and old) had unique three-letter stems.

Page 14: A Comparison of Forgetting in an Implicit and Explicit Memory Task

384 McBRIDE AND DOSHER

a .

I- ¢; ._o

R 2

Q.

i-- ._1

o

"7,

"7,

Semantic Stem Completion

Graphemic Stem Completion

I I I

Semantic Cued Recall

Graphemic Cued Recall

I I I

0 2 4 0 2 4

Ln (Time)

Figure 8. Experiment 1 data sorted into six retention delays plotted on In-In axes, with the fitted curves of the equal 13 power function. Each study and task condition is shown in a separate panel.

Time estimates. Again, participants entered the time from a digital clock at the beginning of the experiment, every 150 trials, and at the end of the experiment. Time estimates for retention lags were estimated from these times (see the Appendix for details).

Results and Discussion

Orienting-task RTs. Consistent with previous experi- ments, response times to studied items in the two orienting tasks were similar. Participants responded to items in the semantic study task within an average of 2.416 s and to items in the graphemic study task within an average of 2.443 s.

Target recognition. Proportion-correct data for Experi- ment 4 are shown in Figure 10. The average false-alarm rate for unstudied items was 15%. The proportion corrects are calculated by averaging the hit rates with one minus the common false-alarm rate. The data are broken down by semantic and graphemic study conditions for all test delays (approximately 1-60 rain). Like stem completion and cued recall, recognition performance showed an initial rapid decline in performance, although this decline was not as

pronounced as in the previous tasks for the semantic- orienting condition. The slower decline for recognition in the semantic condition is caused at least in part, and perhaps entirely, by ceiling artifacts. The points marked with crosses in Figure 10 reflect values in which 5-7 participants showed hit proportions of 1. (It may not be obvious why values in the 0.85-0.9 range are contaminated by ceiling artifacts, but all hit rates are combined with a percent correct on unstudied items that averages 0.85, and some participants were not at ceiling.) For delays from 10 min to 60 rain, the recognition task showed little or no decline. Analysis of variance on hit-rate data revealed effects of study condition, F(1, 17) = 108.38, MSE = 0.096, p < .001, and delay, F(8, 136) = 7.75, MSE = 0.014, p < .001. The study condition effect was due to higher test performance for the semantic study condition (M =.789) than the graphemic condition (M-- .432). The interaction of delay and study condition was not significant, F(8, 136) = 1.26, p > .05. This pattern of results is consistent with stem-competion and cued-recall results reported in the previous experiments.

Task differences in RT. Mean reaction times for recogni- tion-test items can be seen in Table 1. No significant effects

Page 15: A Comparison of Forgetting in an Implicit and Explicit Memory Task

IMPLICIT AND EXPLICIT FORGETTING 385

A "10

8 ::3

n

._0

& 2 ¢L r -

. . J

0

0

Semantic Stem Completion

Graphemic Stem Completion

I I I

0 2 4

rn

Semantic Cued Recall

Graphemic Cued R e c a l l

I I I

0 2 4

Ln (Time)

Figure 9. Experiment 2 data plotted on ln-ln axes, with the fitted curves of the equal 13 power function. Each study and task condition is shown in a separate panel.

were found in an ANOVA for response times of hits, all Fs < 1.41. As seen in Table 1, recognition response times were generally longer than those for the other two test types. One might generally expect that recognition could be accom- plished more quickly than recall. This peculiarity reflects the fact that in this experimental design, response times also reflect task discrimination. Unlike stem completion and cued recall, where a highly distinctive three-letter word stem was presented, recognition trials were not as distinguishable from other trial types. Recognition response times reflect not just the time to recognize the item, but also include the time the participant required to determine the type of task they were to complete on each trial.

Models of forgetting. The data from Experiment 4 were also fit to a variety of forgetting functions. Only the power-function forgetting models are discussed here. As in stem completion and cued recall, the faster and then slower decline in recognition data is best fit by a composite power function. Figures 11 and 12 display the data and model fits on In - In and untransformed plots, respectively. A simple power-function fit resulted in an R 2 of .98. The composite power function yielded a better fit than a simple power- function model, with an R 2 of .99, F(2, 14) = 8.40, MSE =

0.001, p < .01. Table 3 shows parameter estimates for the full model (six parameters) composite fit. 4 The estimated 13 for semantic orienting condition is certainly contaminated by ceiling artifacts, and should be ignored. The estimated 13 for the graphemic orienting condition is also slower than for the stem completion and cued recall data of Experiment 3, but in the absence of a model relating recall and recognition, we have no basis to predict any specific relationship.

Summary. Although differences in both scale and re- sponse structure prevent the direct, quantitative comparison of recognition performance to stem completion, the recogni- tion data are in several ways similar to the production data. Recognition performance showed an initial, more rapid decline, followed by a slow decline for later retention intervals, as indicated by the superior fit of the composite power function. Recognition also showed the strong effect of orienting task exhibited by cued recall. Although recogni-

4 Because of the ceiling artifacts at early lags, it was necessary to constrain the fits so that the point of segmentation was between the third and fourth retention points of the semantic study data in order to produce stable fits.

Page 16: A Comparison of Forgetting in an Implicit and Explicit Memory Task

386 McBRIDE AND DOSHER

o. ,p.

0

i" 6

¢,D

¢5

m 0

0

o Semantic o Graphemic

D N ~ m o

o \

°~o__o__o~O__o~O--

EXPERIMENT 4

J i i ) i i i

0 10 20 30 40 50 60

Retention Interval (min)

Figure 10. Proportion correct data from Experiment 4 for recog- nition as a function of test delay• The different lines represent study and test conditions.

tion performance is not expected to mirror that of stem completion exactly because of process and scaling differ- ences, forgetting in recognition does not seem to differ strongly in functional form from stem completion.

13 2 t- O 0 t-

O "E

2 0.

¢q,

0

0

Semantic Recognition

v O C C. 0

Graphemic Recognition

V ~7

I I

0 30 60

Time

0

" ¢Y,

8

5 v- i

v ~ - v v

Semantic Recognition

Graphemic Recognition

I I I

0 2 4

Ln (Time)

Figure 11. Data from Experiment 4 plotted on In P(t) versus In t axes that linearize power-function decay. The two study conditions are shown in separate panels. Differences in rate of forgetting 13 between the two conditions are contaminated by ceiling artifacts.

Figure 12. Data from Experiment 4 with the fitted curves of the composite power function. The two study conditions are shown in separate panels.

General Discussion

Levels o f Processing

The manipulation of levels of processing at study by the semantic or graphemic orienting task resulted in a pattern of data frequently found in comparisons of stem completion and cued recall. Cued-recall performance showed a large benefit for semantically studied items over graphemically studied items, but stem-completion performance showed a much smaller difference between study conditions. Nonethe- less, orienting condition did impact stem-completion perfor- mance. The level of processing and test type interaction in Experiments 2-4 (but not Experiment 1) is compatible with recent findings (e.g., Jelicic & Bonke, 1991; Roediger et al., 1992). Challis and Brodbeck (1992) discussed a large number of studies where a levels-of-processing effect was evident in various implicit memory tasks. They argued that experimental design differences (e.g., within- vs. between- subjects study conditions) are the main cause of these effects. In addition, Brown and Mitchell (1994) reviewed 38 studies with levels-of-processing study conditions for im- plicit and explicit tasks. They found that 79% of studies showed greater performance for semantic than graphemic study on implicit tasks. Our results are compatible with these observations.

Page 17: A Comparison of Forgetting in an Implicit and Explicit Memory Task

IMPLICIT AND EXPLICIT FORGETTING 387

Table 3 Parameter Estimates of Composite Function for Experiment 4

Condition k 13 ~/

Semantic .928 a 0.086" .772 G-raphemic .655 0.206 .399

Note. k = initial level of availability; 13 = time constant of decay; ",/= level of availability at longer retention delays. aThese values reflect a ceiling artifact at the first retention interval for a subset of participants.

Extrapolating from the measured levels and estimated forgetting rates over the longer retention delays in Experi- ment 2, we estimate that target production in some condi- tions would be approaching baseline levels at around 24--48 hr. This is consistent with observations of near baseline levels in previous literature.

Form and rate of forgetting are apparently equivalent for these closely comparable measures of implicit memory (stem completion) and explicit memory (stem-cued recall). Our claims hold for retention delays up to about 90 rain.

Forgetting in Implicit and Explicit Tasks

Our work considers claims regarding forgetting for the (implicit) perceptual representation system often tested by word-stem completion and the explicit memory system often tested by word recognition or cued recall. We recog- nize that the resulting forgetting claims can be applied to these and highly related task performances and cannot be generalized to other implicit performances, such as classical conditioning, habituation, fear learning, and so forth.

Several previous researchers claimed the existence of differential forgetting in implicit and explicit memory, as measured in stem completion and recognition (Chen & Squire, 1990; Graf et al., 1984; Schacter, 1987; Squire et al., 1987). Others claimed general similarity in rates of forget- ting in stem completion and recognition (Dorfman et al., 1995; Graf & Mandler, 1984). Our review of this literature showed that the existing comparisons of forgetting in these implicit and explicit memory tasks were inconclusive.

Our experiments evaluated forgetting in implicit and explicit memory by comparing stem completion and stem- cued recall, where tasks and performance scales are the same and the values are essentially directly comparable. Over a wide range of retention intervals, from about 1 to about 90 rain, similar rates of forgetting were found in stem comple- tion and cued recall for items studied in a semantic and in a graphemic orienting task. In both stem completion and in cued recall, production of the target (studied) word declined rapidly over the first 15 rain and then declined much more slowly over longer retention delays. The data were well fit with a model that assumed an equivalent forgetting rate [3 for all conditions. (The form of the function is discussed next.) This was true not just in Experiment 3, with retention intervals of 1--60 min, but in collateral analysesof Experi- ment 1, with retention intervals less than 15 rain, and Experiment 2, with retention intervals of about 15-90 min. This was also true in additional collateral analyses of Experiments 1 and 2, using 6-point and 16-point retention functions, respectively.

Although our measurements of forgetting are not perfect, they are substantial improvements over the measurements in prior studies. Our retention intervals encompassed produc- tion probabilities from their maximum levels (estimated to be 60%-90% across conditions) to low levels (about 20%- 30% across conditions), compared to baseline levels be- tween 11%-13%. This corresponds to approximately 75%- 80% of the full range from initial to baseline levels.

Task Differentiation

Our observations on form and rate of forgetting, unlike tests of several previously proposed dissociations, do not require that the implicit and explicit memory tasks be process pure (Jacoby, 1991). If implicit processes dominate performance in stem completion and explicit processes dominate performance in cued recall and if there is a substantial difference in forgetting rate between the two memory types (and only substantial differences would constitute evidence for different memory systems), then differential forgetting in the two task performances should have been evident. We presented evidence that task differen- tiation in these experiments is good. Differences in the time to begin typing the word response were large, averaging 430 ms, and the RT distributions (analyzed for Experiment 3) were substantially nonovedapping.

Next, we consider three possible design issues related to task differentiation. First, although participants in Experi- ment 1 performed both tasks, participants in Experiments 2 and 3 performed either stem completion or stem-cued recall, but not both. The between-subjects design is preferred by some researchers. Our results on forgetting, which were in fact completely consistent over all three experiments, sug- gest that this was not a crucial issue in this case.

Second, Roediger and McDermott (1993) suggest that 50% of tested items be nonstudied in order to minimize use of explicit memory in stem completion. Although this guideline is not met in Experiment 1 or 2, Experiment 3 arguably exceeds these standards. Our 342 critical tests of old items appear in a protocol of some 1,030 trials. This protocol includes not only 38 unstudied target items, but also includes a number of unstudied filler items tested as either stem completion or cued recall, and many filler items being judged in other ways. Furthermore, many of the tests of old target items appear at delays that support levels of target production that are only weakly above baseline, much lower than the levels of production considered by Roediger and McDermott (1993). Again, our results on forgetting were in fact consistent over all three experiments.

Third, in some variants of the stem completion task (e.g., Graf & Schacter, 1987, 1989), cued recall exceeds stem completion for all conditions. In our case, cued recall exceeded stem completion for semantically studied items and was below stem completion for graphemically studied items. These differences flow from differential instructions in our tasks and the variants which axe sometimes used by others. Those variants generally instruct participants to

Page 18: A Comparison of Forgetting in an Implicit and Explicit Memory Task

388 McBRIDE AND DOSHER

recall the target item if possible, but either instruct partici- pants to always generate a word for the stem (Graf & Schacter, 1987, 1989) or do not emphasize the use of a no-response alternative (Bowers & Schacter, 1990). Those instructions result in explicit cued recall measures that are always above the stem-completion measures (because stem completion is the fall-back mode), but they also guarantee that explicit cued-recall measures are contaminated by a substantial number of implicit stem-completion responses. Instead, we encouraged stem-cued recall participants to use the xxx response if they felt that recall was unsuccessful. Consider Experiment 3. That target production in cued recall was consistently above target production in stem completion for semantically studied items is evidence that cued recall and stem completion are being distinguished; cued recall contributes up to 20% additional target productions. That target production in cued recall is consistently below stem completion for items studied graphemically is evidence that cued recall and stem completion are being distinguished. Our instructions succeeded in reducing the contamination of cued recall by stem completion. These results argue for reasonable task differentiation by the participants between stem completion and cued recall.

Finally, in a subsequent project (McBride & Dosher, 1996), we replicate these results using process pure mea- sures derived from the process dissociation procedure of Jacoby (1991). All of these facts argue for adequate task differentiation in our measures of forgetting.

Implications for Memory Systems

Memory systems proponents argue that implicit memory tasks such as stem completion and explicit memory tasks such as cued recall are largely supported by a perceptual representation system and an episodic memory system, respectively. Two possible conclusions follow from our observation that performance in the implicit and explicit memory tasks was indistinguishable in functional form and rate of forgetting.

Functional interpretations. One possibility is that the implicit task and the explicit task reflect slightly different availabilities of information from the same memory trace or traces for words. The same trace or traces may include contextual, conceptual, lexical, and perceptual features of the study exposure that are forgotten at approximately the same rate. Stem completion may focus on the lexical and perceptual features, whereas cued recall may focus on the lexical and conceptual features. The pattern of levels of processing effects would follow from the increased encod- ing of conceptual features during a semantic rather than a graphemic orienting task. This would primarily affect the conceptual features that are differentially accessed during cued recall. (Note that some modest differences in rates of forgetting would still be compatible with this same-trace view, because perceptual and conceptual features in the same trace might nonetheless be differentially exposed to interference from other items.)

Another possibility is that performances in the implicit task and the explicit task reflect availabilities of different

traces in different memory systems, such that those systems have essentially identical functional forms of forgetting and essentially identical rates of forgetting.

Schacter (1987) and others claimed substantially different forgetting dynamics for the systems reflected in implicit and explicit tasks. A finding of strongly differential forgetting would have provided converging evidence for system disso- ciation. Our results do not lend support to this separate- systems view of performance in the implicit and explicit task. Preference for the common-system or for the separate- systems viewpoint must reflect the weight and interpretation given to other, converging evidence. Different aspects of performance concerning system dissociations must be used if one wishes to argue for separate memory systems. In that case, our results strongly constrain our understanding of one key property of the separate systems, if they exist. The forgetting process must be similar.

Model examples. A number of quantitative memory models may be consistent in a general way with our findings. In this section, we outline one example of a model that assumes that the same trace underlies implicit and explicit performance and one example of a model that assumes that different traces underlie implicit and explicit performance.

One example of a model that might instantiate a same trace or same engram view is the application of the matrix model by Humphreys, Bain, and Pike (1989). This model is a distributed composite memory model: Experiences are represented by features (matrix entries), and multiple experi- ences are stored by addition into a composite memory. This multifeature code includes certain features that represent the context in which the event is experienced. This model (Humphreys et al., 1989) strongly distinguishes between tasks that require explicit recollection of a particular event and those that do not. The distinction emphasizes the use of context codes (feature values) at retrieval. In explicit tasks, a context code that represents the intended list or experiment is assembled and becomes part of the query of memory. The use of the context code in retrieval serves to give high weight to the particular experimental or study exposure relative to other experiences of the same item. In implicit tasks, a default or neutral context code is assembled as part of the query to memory. In this case, the particular study exposure receives less weight compared to all the other prior exposures or exposures of related items. Nonetheless, experi- mental exposure may be important, especially if it is recent. This model could predict approximately equivalent forget- ting because the same trace is being accessed, albeit with different weight, in the two tasks.

This model would need to be elaborated and developed in order to account for larger levels of processing effects on explicit than on implicit tasks. The weight given to the experimental exposure (in which levels of processing was manipulated) would be high in explicit retrieval, whereas the weight would be comparatively low in implicit recovery.

One example of a model that might instantiate a different trace or different engram view is the recent retrieving effectively from memory (REM) model of Shiffrin and Steyvers (1997). In this model, memory traces are also collections of features. There are separate episodic and

Page 19: A Comparison of Forgetting in an Implicit and Explicit Memory Task

IMPLICIT AND EXPLICIT FORGETTING 389

lexical-semantic traces of experiences such as words. The episodic traces largely underlie explicit performance, and the lexical-semantic traces largely underlie implicit perfor- mance. However, when a word is experienced during preexposure or study, not only is an episodic trace (including context features) created, but those same context features are written into the stable lexical trace. In this model, forgetting occurs becanse of drift over time and changes in situation of the context features. At retrieval, the match of the current context codes to the stored context codes determines the availability of information, and this process of context drift is assumed to be exactly the same in the episodic (explicit) and the lexieal (implicit) traces.

We do not mean to imply here that either of these models would necessarily survive rigorous quantitative evaluation in relation to our data and other data. Such model develop- ments and tests would constitute articles in themselves. We use these models simply to provide examples of existing quantitative models that serve as representatives of the same trace and different trace classes.

Neurological substrates. Neuroanatomical and func- tional analyses of memory (Fuster, 1995; McGaugh, 1995; Squire et al., 1993) have already identified areas and anatomical entities relevant for various memory functions. There is no reason to believe that the processing architecture for memory is not at least as complex as the architecture of the visual system (Van Essen & Anderson, 1995). Our analysis focuses on the assertion of behavioral expression of separate memory systems in the implicit and explicit tasks. In this section, we consider several brain-system hypotheses. These should be considered as mere speculations.

The now-classic dissociation of implicit performances such as priming in stem completion and explicit recollection in amnesic patients suggests the existence of independent systems. Explicit recollection deficits are usually measured as deficits in recognition and recall. (This is often operation- alized in animals in delayed nonmatch to sample tasks.) Most deficits of amnesia are known to be related to damage of the hippocampal system, including the hippocampus, parahippocampal gyms, posterior entorhinal cortex, and anterior entorhinal and perirhinal cortex (Squire & Zola- Morgan, 1988, 1991; Squire, 1992). In contrast, priming is sometimes associated with the neocortex (Squire, 1995). More specifically, word priming has been associated in positron-emission tomography (PET) studies (e.g., Squire et al., 1992) with neocortical areas that may be involved in processing the stimuli themselves.

Explicit memory, then, is associated with a medial temporal--diencephalic system, whereas implicit memory, as expressed in priming, is associated with neocortical sites. If the explicit and implicit memory traces reside in these disparate structures, then the common dynamics of forget- ting may be more coincidental then revealing of system commonalities, s

However, central importance of a brain structure for certain forms of memory does not necessarily imply that the memory traces are stored in those structures. For example, the amygdala is usually associated with emotionally modu- lated memory (Squire & Knowlton, 1994; Squire, 1995).

(This is often operationalized in animals by inhibitory avoidance or escape training.) It is now understood that the function of the amygdala in emotional memory is to modulate the formation or consolidation of memory traces in other brain regions (see McGaugh, 1995, for a review). Lesions of the amygdala several days following inhibitory avoidance training do not affect retention (Liang et al., 1982), whereas lesions applied shortly after training, during a consolidation period, do affect retention.

Likewise, for explicit recognition and recall, it is known from an amnesic individual's ability to retrieve older memo- ries that the older long-term memory traces reside outside the damaged hippocampal system. In this case, the switch away from dependence on the hippocampal system occurs over a longer time period, probably days or weeks. In one view, the medial temporal lobe (hippocampal system) might be the site of storage until representations elsewhere are developed and stabilized. Alternatively, it is possible that the memory trace is distributed over various regions relevant for the processing of the word stimuli of the classic word-based priming tests and that the function of the hippocampal system is to bind or coordinate these distributed sites, which may represent a variety of information including context information that allows a memory to be tied to a particular time and place. To quote Squire (1995), "It remains possible that the posterior cortical locus identified in PET [see Squire et al., 1992], which is important for form-specific word-stem- completion priming, is one of the critical sites that are bound together by the hippocarnpal system" (p. 219). In this view, a largely common set of distributed memory sites may consti- tute the memory trace; these representations control forget- ring. The function of the entorhinal cortical and hippocampal system is to potentiate the further storage or binding of disparate aspects of an experience, especially context infor- mation that is especially essential for explicit tasks. The distributed memory trace is processed or activated some- what differently at the time of retrieval depending on whether the mode of access is explicit or implicit. Measures of brain activity during retrieval, such as PET (e.g., Squire et al., 1992), partially reflect activity in the loci of the memory sites and partially reflect activity in the loci of the brain centers (such as the hippocampus), which are differentially involved during explicit or implicit retrieval activities. Although all these views are speculative, this is the view we favor.

Form of Forgetting

The functional form of the forgetting in stem-completion and cued-recall tasks seen in our experiments was well described by a composite power function, which is character- ized by a faster decay rate during the first 15 min or so

5 Or, perhaps the commonality of forgetting dynamics rather reflects the similarity in representation along with a common neurophysiological mechanism such as the similar molecular and dendritic processes thought to underlie memory consolidation across many systems (Abel, Alberini, Ghirardi, Huang, Nguyen, & Kandel, 1995).

Page 20: A Comparison of Forgetting in an Implicit and Explicit Memory Task

390 McBRIDE AND DOSHER

following exposure and a slower decay rate over longer periods up to 90 mill. Perhaps the closest analog to our finding is a graph of proportion-correct fragment completion (Sloman, Hayman, Ohta, Law, & Tulving, 1988) over delays from 1 to 7 trials in immediate test to tests following 20-min retention intervals. Although their data are noisy, they also observed a fast-forgetting region and then a region of little change. It is possible that these two regions reflect contribu- tions to performance from short-term memory and long-term memory components. However, neither our data nor Sloman et al.'s data force this interpretation.

Most prior studies found simpler functional forms of forgetting, but they also differed in significant ways from the current paradigm, either evaluating recognition or recall in quite different paradigms or over different retention delays. Wixted and Ebbesen (1991) and Anderson (Anderson & Schooler, 1991) use simple power-function decay on several different dependent measures. Rubin and Wenzel (1996) compiled 210 published retention functions and concluded that the logarithmic and power functions were the best of the two-parameter forgetting functions. The logarithmic func- tion fits to our data were quite similar to the power function fits and also required a two-phase form, Wickelgren (1972, 1974) proposed somewhat more complex retention func- tions that combined a simple power function and an exponential component, with d' as the measure of memory strength. Wickelgren rejected the simple power function for his data. Interestingly, he noted a tendency for the rate of forgetting to be slower over hours than minutes and slower over days than hours. Indow (1995) has evaluated retention curves over hours and years and supports a more complex form, the Weibull. At this point, we do not wish to rule out any of these slightly more complex functions as possible descriptions under the correct transformation of our memory measure, probability of target production.

Remaining Questions

We conclude that both the functional form and the rate of forgetting are substantially the same for the implicit memory task, stem completion, and the explicit memory task, cued recall. Subsequent work might (a) measure the rate of decay over longer time periods up to several days or weeks; (b) evaluate the generalizability of these findings to other implicit memory tasks, such as fragment completion or perceptual identification, and their corresponding explicit tasks; (c) evaluate forgetting for conscious and unconscious measures of memory (Jacoby, 1991; that is the topic of a subsequent article); and (d) evaluate the generality of the two-phase (composite power function) forgetting functions in recall.

References

Abel, T., Alberini, C., Ghirardi, M., Huang, Y-Y., Nguyen, E, & Kandel, E. R. (1995). Steps toward a molecular definition of memory consolidation. In D. L. Schacter (Ed.), Memory distor- tions: How minds, brains, and societies reconstruct the past (pp. 298-325). Cambridge, MA: Harvard University Press.

Anderson, J. R., & Schooler, L. J. (1991). Reflections of the environment in memory. Psychological Science, 2, 396-408.

Bowers, J. S., & Schacter, D. L. (1990). Implicit memory and test awareness. Journal of Experimental Psychology: Learning, Memory, and Cognition, 16, 404--416.

Brown, A.S., & Mitchell, D. B. (1994). A reevaluation of semantic versus monsemantic processing in implicit memory. Memory and Cognition, 22, 533-541.

Buchner, A., Erdfelder, E., & Vaterrodt-Pltinnecke, B. (1995). Toward unbiased measurement of conscious and unconscious memory processes within the process dissociation framework. Journal of Experimental Psychology: General, 124, 137-160.

Challis, B. H., & Brodbeck, D. R. (1992). Level of processing affects priming in word fragment completion. Journal of Experi- mental Psychology: Learning, Memory, and Cognition, 18, 595--607.

Cben, K. S., & Squire, L. R. (1990). Strength and duration of word-completion priming as a function of word repetition and spacing. Bulletin of the Psychonomic Society, 28, 97-100.

Cowan, N., & Stadler, M. A. (1996). Estimating unconscious processes: Implications of a general class of models. Journal of Experimental Psychology: General, 125, 195-200.

Curran, T., & Hintzman, D. L. (1995). Violations of the indepen- dence assumption in process dissociation. Journal of Experimen- tal Psychology: Learning, Memory, and Cognition, 21, 531-547.

Doffman, J., Kihlstrom, J. E, Cork, R. C., & Misiaszek, J. (1995). Priming and recognition in ECT-induced amnesia. Psychonomic Bulletin and Review, 2, 244-248.

Dosber, B. A., & Rosedale, G. (1991). Judgments of semantic and episodic relatedness: Common time-course and failure of segre- gation. Journal of Memory and Language, 30, 125-160.

Fuster, J. M. (1995). Memory in the cerebral cortex: An empirical approach to neural networks in the human and nonhuman primate. Cambridge, MA: MIT Press.

Gillund, G., & Shiffrin, R. M. (1984). A retrieval model for both recognition and recall. PsychologicalReview, 91, 1-67.

Graf, E, & Mandler, G. (1984). Activation makes words more accessible, but not necessarily more retrievable. Journal of Verbal Learning and Verbal Behavior, 23, 553-568.

Graf, E, & Schacter, D. L. (1987). Selective effects of interference on implicit and explicit memory for new associations. Journal of Experimental Psychology: Learning, Memory, and Cognition, 13, 45-53.

Graf, E, & Schacter, D. L. (1989). Modality specificity of implicit memory for new associations. Journal of Experimental Psychol- ogy: Learning, Memory, and Cognition, 15, 3-12.

Graf, P., Squire, L. R., & Mandier, G. (1984). The information that amnesic patients do not forget. Journal of Experimental Psychol- ogy: Learning, Memory, and Cognition, 10, 164-178.

Hamann, S., & Squire, L. R. (1996). Priming in the absence of declarative memory. Abstracts of the Psychonomic Society, 1, 67.

Hashtroudi, S., Ferguson, S. A., Rappold, V. A., & Chrosniak, L. D. (1988). Data-driven and conceptually driven processes in partial- word identification and recognition. Journal of Experimental Psychology: Learning, Memory, and Cognition, 14, 749-757.

Hirst, W. (1989). On consciousness, recall, recognition, and the architecture of memory. In S. Lewandowsky, J. C. Duma, & K. Kirsner (Eds.), Implicit memory: Theorectical issues (pp. 33- 46). Hillsdale, NJ: Erlbaum.

Hirst, W., Johnson, M. K., Phelps, E. A., & Volpe, B. T. (1988). More on recognition and recall in anmesics. Journal of Experi- mental Psychology: Learning, Memory, and Cognition, 14, 758-762.

Hirst, W., Johnson, M. K., Phelps, E. A., Risse, G., & Volpe, B. T.

Page 21: A Comparison of Forgetting in an Implicit and Explicit Memory Task

IMPLICIT AND EXPLICIT FORGETTING 391

(1986). Recognition and recall in amnesics. Journal of Experi- mental Psychology: Learning, Memory, and Cognition, 12, 445-451.

Humphreys, M. S., Bain, J. D., & Pike, R. (1989). Different ways to cue a coherent memory system: A theory for episodic, semantic and procedural tasks. Psychological Review, 96, 208-233.

Indow, T. (1995). Retrieval sequences and retention curves: An analysis based on extreme statistics (Tech. Memo. No. MBS-95- 31). Irvine: University of California, Institute for Mathematical Behavioral Sciences.

Jacoby, L. L. (1991). A process dissociation framework: Separating automatic from intentional uses of memory. Journal of Memory and Language, 30, 513-541.

Jacoby, L. L., & Witherspoon, D. (1982). Remembering without awareness. Canadian Journal of Psychology, 36, 300-324.

Jelicic, M., & Bonke, B. (1991). Level of processing affects performance on explicit and implicit memory tasks. Perceptual and Motor Skills, 72, 1263-1266.

Joordens, S., & Merikle, P. M. (1993). Independence or redun- dancy? Two models of conscious and unconscious influences. Journal of Experimental Psychology: General, 122, 462-467.

Liang, K. C., McGaugh, J. L., Martinez, J. L., Jensen, R. A., Vasquez, B. J., & Messing, R. B. (1982). Post training amygda- loid lesions impair retention of an inhibitory avoidance response. Behavioral Brain Research, 4, 237-249.

McBride, D. M., & Dosher, B. A. (1996). Forgetting in process dissociation: Familiarity and recollection. Abstracts of the Psychonomic Society, 1, 15.

McGaugh, J. L. (1995). Emotional activation, neuromodulatory systems, and memory. In D. L. Schacter (Ed.), Memory distor- tions: How minds, brains, and societies reconstruct the past (pp. 255-273). Cambridge, MA: Harvard University Press.

Nadel, L. (1994). Multiple memory systems: What and why, an update. In D. Schacter & E. Tulving (Eds.), Memory systems 1994 (pp. 39-63). Cambridge, MA: MIT Press.

Paller, K. A. (1990). Recall and stem-completion priming have different electrophysiological correlates and are modified differ- entially by directed forgetting. Journal of Experimental Psychol- ogy: Learning, Memory, and Cognition, 16, 1021-1032.

Raaijmakers, J. G., & Shiffrin, R. M. (1981). Search of associative memory. Psychological Review, 88, 93-134.

Ratcliff, R. (1979). Group reaction time distributions and an analysis of distribution statistics. Psychological Bulletin, 86, 446-461.

Richardson-Klavehn, A., & Bjork, R. A. (1988). Measures of memory. Annual Review of Psychology, 39, 475-543.

Roediger, H. L. (1990). Implicit memory: Retention without remembering. American Psychologist, 45, 1043-1056.

Roediger, H. L., & McDermott, K. B. (1993). Implicit memory in normal human subjects. In E Boiler & J. Grafman (Eds.), Handbook ofneuropsychology (Vol. 8, pp. 63-131). Amsterdam: Elsevier.

Roediger, H. L., Weldon, M. S., Stadler, M. L., & Riegler, G. L. (1992). Direct comparison of two implicit memory tests: Word fragment and word stem completion. Journal of Experimental Psychology: Learning, Memory, and Cognition, 18, 1251-1269.

Rubin, D. C., & Wenzel, A. E. (1996). One hundred years of forgetting: A quantitative description of retention. Psychological Review, 103, 734-760.

Schacter, D. L. (1987). Implicit memory: History and current status. Journal of Experimental Psychology: Learning, Memory, and Cognition, 13, 501-518.

Schacter, D. L. (1992). Priming and multiple memory systems:

Perceptual mechanisms of implicit memory. Journal of Cogni- tive Neuroscience, 4, 244-256.

Schacter, D. L., & Tulving, E. (1994). What are the memory systems of 1994? In D. Schacter & E. Tulving (F_,ds.), Memory systems 1994 (pp. 1-38). Cambridge, MA: MIT Press.

Shiffrin, R. M., & Steyvers, M. (1997). Model for recognition memory: REM--Retfieving effectively from memory. Psycho- nomic Bulletin and Review, 4, 145-166.

Sloman, S. A., Hayman, C. G., Ohta, N., Law, J., & Tulving, E. (1988). Forgetting in primed fragment completion. Journal of Experimental Psychology: Learning, Memory, and Cognition, 14, 223-239.

Squire, L. R. (1992). Memory and the hippocampus: A synthesis from findings with rats, monkeys, and humans. Psychological Review, 99, 195-231.

Squire, L. R. (1994). Declarative and nondeclarative memory: Multiple brain systems supporting learning and memory. In D. Schacter & E. Tulving (Eds.), Memory systems 1994 (pp. 203-231). Cambridge, MA: MIT Press.

Squire, L. R. (1995). Biological foundations of accuracy and inaccuracy in memory. In D. Schacter (Eds.), Memory distor- tions: How minds, brains, and societies reconstruct the past (pp. 197-225). Cambridge, MA: Harvard University Press.

Squire, L. R., & Knowlton, B. J. (1994). Memory, hippocampus, and brain systems. In M. S. Gazzaniga (Ed.), The cognitive neurosciences (pp. 825-837). Cambridge, MA: MIT Press.

Squire, L. R., Knowlton, B., & Musen, G. (1993). The structure and organization of memory. Annual Review of Psychology, 44, 453-495.

Squire, L. R., Ojemarm, J. G., Miezin, E M., Petersen, S. E., Videen, T. O., & Raichle, M. E. (1992). Activation of the hippocampus in normal humans: A functional anatomical study of memory. Proceedings of the National Academy of Sciences, USA, 89, 1837-1841.

Squire, L. R., Shimamura, A. P., & Graf, P. (1987). Strength and duration of priming effects in normal subjects and amnesic patients. Neuropsychologia, 25, 195-210.

Squire, L. R., & Zola-Morgan, S. (1988). Memory: Brain systems and behavior. Trends in Neurosciences, 11, 170-175.

Squire, L. R., & Zola-Morgan, S. (1991). The medial temporal lobe memory system. Science, 253, 1380-1386.

Thapar, A., & Greene, R. L. (1994). Effects of level of processing on implicit and explicit tasks. Journal of Experimental Psychol- ogy: Learning, Memory, and Cognition, 20, 671-679.

Tulving, E., & Schacter, D. L. (1990). Priming and human memory systems. Science, 247, 301-306.

Tulving, E., Schacter, D. L., & Stark, H. A. (1982). Priming effects in word-fragment completion are independent of recognition memory. Journal of Experimental Psychology: Learning, Memory, and Cognition, 8, 336-342.

Van Essen, D. C., & Anderson, C. H. (1995). Information processing strategies and pathways in the primate visual system. In S. E Zornetzer, J. L. Davis, C. Lau, & T. McKenna (Eds.), An introduction to neural and electronic networks (2nd ed.)--- Neural networks: Foundations to applications (pp. 45-76). San Diego, CA: Academic Press.

Wannacott, T. H., & Wannacott, R. J. (1981). Regression: A second course in statistics. NY: Wiley.

Warrington, E. K., & Weiskrantz, L. (1970). Amnesic syndrome: Consolidation or retrieval? Nature, 228, 628-630.

Warrington, E. K., & Weiskrantz, L. (1974). The effect of prior learning on subsequent retention in amnesic patients. Neuropsy- chologia, 12, 419--428.

Page 22: A Comparison of Forgetting in an Implicit and Explicit Memory Task

392 McBRIDE AND DOSHER

Weber, E. U., & Murdock, B. B. (1989). Priming in a distributed memory system: Implications for models of implicit memory. In S. Lewandowsky, J. C. Dunn, & K. Kirsner (Eds.), Im- plicit memory: Theoretical issues (pp. 87-98). Hillsdale, NJ: Erlbaum.

Wickelgren, W. A. (1972). Trace resistance and the decay of

long-term memory. Journal of Mathematical Psychology, 9, 418-455.

Wickelgren, W. A. (1974). Single trace fragility theory of memory dynamics. Memory and Cognition, 2, 775-780.

Wixted, J. J., & Ebbesen, E. B. (1991 ). On the form of forgetting. Psychological Science, 6, 409-415.

Appendix

Estimates of Retention Delays

Details of test-delay estimates for Figures 1 and 2 and the current experiments. Unless otherwise noted, time estimates were made by adding half the study list time to half of the test list time and any other tasks (e.g., distracters, instructions, etc.) to get average study-test delay times.

Roediger, Weldon, Stadler, & Riegler (1992): Experiment 2 Study list of 68 items at 7 s per item Test list of 68 items at 12 s per item Distractor task for 6 min Reported delays of 0, 2, and 48 hr Average study-test delay = 17 min Estimated study-test delays = 17 min, 137 min, and 2 days.

Squire, Shimamura, & Graf (1987): Experiment 1 (normal controls only)

Study list of 18 items at 5 s per item Test list of 18 items at 3 s per item Instruction time of 1 min Reported delays of 0, 2, and 96 hr Average study-test delay = 3 min Estimated study-test delays = 3 min, 123 min, and 4 days.

Chen & Squire (1990) (data read off Figure 1, 4 repetition condition)

Study list of 520 items at 2.5 s per item Test list 100 items at 2 s per item (time per item estimated; no time

given) Reported delays of O, 2, and 24 hr Average study-test delay = 13 min Estimated study-test delays = 13 min, 133 min, and 24 hr.

Graf& Mandler (1984): Experiment 2 Study list of 60 items at 3 s per item Test list of 10 items at 4 s per item Reported delays of 0, 20, and 90 min Average study-test delay = 2 min Estimated study-test delays = 2, 22, and 92 min

Graf, Squire, & Mandler (1984): Experiment 2 (data read from Figure 2 and averaged over encoding condition

for inpatient subjects only). Study list of 20 items at 2.5 s per item Test list of 40 items at 2 s per item Instructions for 1 min Reported delays of 0, 15, and 120 min

Average study-test delay = 3 min Estimated study-test delays = 3, 18, and 123 min.

Hashtroudi, Ferguson, Rappold, & Chrosniak (1988): Experiment 3

Study list of 31 items at 4 s per item Test list of 54 items at 7 s per item Reported delays of 0, 1, and 24 hr Average study-test delay = 4 min Estimated study-test delays = 4 min, 64 min, and 24 hr.

Experiment 1 Long study of 132 items at 3.2 s per item Short study of 16 items at 3.2 s per item Test lists of 60 items at 2.77 s per item (includes typing time) Instruction and rest before each test 1.2 min Estimated study-test delays = 3 min, 7 min, 12.1 min, and 17.2

min.

Experiment 2 Participants recorded times at the beginning of the study list and at the beginning and end of each test block. Times for the study and test items per participant were calculated and then averaged across participants. Average study time was 3.3 s per item Average test time ws 3.3 s per item Intervening vowel blocks of 23 min, 19 min, and 19 min Estimated study-test delays = 13.2, 41.5, 65.8, and 90.0 min

Experiment 3 Times were recorded as in Experiment 2 after every 150 trials. Average time per item across participants = 4.8 s Estimated study-test delays = 1.6, 4.7, 9.4, 14.9, 21.9, 28.2, 35.3,

47.0, and 62.7 rain.

Experiment 4 Times Were recorded as in Experiment 3 every 150 trials. Average time per item across participants = 4.6 s Estimated study-test delays = 1.5, 4.6, 9.2, 14.7, 21.6, 27.7, 34.7,

46.2, and 61.6 min.

Received July 23, 1996 Revision received December 15, 1996

Accepted February 7, 1997 •