Published in Behavioral and Brain Sciences
Volume 25, Number 6: 657-674 (December 2002)

© 2002 Cambridge University Press


Below is the unedited, uncorrected, unquotable final draft preprint of a BBS target article that was accepted for publication. To order the final published version of this target article, commentaries and author's response, please visit the BBS Homepage at Cambridge Journals Online.


The cognitive functions of language

 

Peter Carruthers
Department of Philosophy
University of Maryland
College Park, MD 20742
pc154@umd.edu
www.shef.ac.uk/~phil/department/staff/Carruthers.html

http://http://www.philosophy.umd.edu/people/faculty/pcarruthers/

 

Abstract: This paper explores a variety of different versions of the thesis that natural language is involved in human thinking. It distinguishes amongst strong and weak forms of this thesis, dismissing some as implausibly strong and others as uninterestingly weak. Strong forms dismissed include the view that language is conceptually necessary for thought (endorsed by many philosophers) and the view that language is de facto the medium of all human conceptual thinking (endorsed by many philosophers and social scientists). Weak forms include the view that language is necessary for the acquisition of many human concepts, and the view that language can serve to scaffold human thought processes. The paper also discusses the thesis that language may be the medium of conscious propositional thinking, but argues that this cannot be its most fundamental cognitive role. The idea is then proposed that natural language is the medium for non-domain-specific thinking, serving to integrate the outputs of a variety of domain-specific conceptual faculties (or central-cognitive ‘quasi-modules’). Recent experimental evidence in support of this idea is reviewed, and the implications of the idea are discussed, especially for our conception of the architecture of human cognition. Finally, some further kinds of evidence which might serve to corroborate or refute the hypothesis are mentioned. The overall goal of the paper is to review a wide variety of accounts of the cognitive function of natural language, integrating a number of different kinds of evidence and theoretical consideration in order to propose and elaborate the most plausible candidate.

 

Keywords: cognitive evolution, conceptual module, consciousness, domain-general, inner speech, logical form (LF), language, thought.

 

1            Introduction

Natural language looms large in the cognitive lives of ordinary folk. Although proportions vary, many people seem to spend a good deal of their waking activity engaged in ‘inner speech’, with imaged natural language sentences occupying a significant proportion of the stream of their conscious mentality.

This bit of folk-wisdom has been corroborated by Hurlburt (1990, 1993), who devised a method for sampling people’s inner experience. Subjects wore headphones during the course of the day, through which they heard, at various intervals, a randomly generated series of bleeps. When they heard a bleep, they were instructed to immediately ‘freeze’ what was passing through their consciousness at that exact moment and then make a note of it, before elaborating on it later in a follow-up interview. Although frequency varied widely, all normal (as opposed to schizophrenic) subjects reported experiencing inner speech on some occasions – with the minimum being 7% of occasions sampled, and the maximum being 80%. Most subjects reported inner speech on more than half of the occasions sampled. (The majority of subjects also reported the occurrence of visual images and emotional feelings – on between 0% and 50% of occasions sampled in each case). Think about it: more than half of the total set of moments which go to make up someone’s conscious waking life occupied with inner speech - that’s well nigh continuous!

            Admittedly, the sample-sizes in Hurlburt’s studies were small; and other interpretations of the data are possible. (Perhaps the reports of linguistically-clothed thoughts occurring at the time of the beep were a product of confabulation, for example, reflecting people’s naďve theory that thought must be in natural language. If so, this should be testable.) But let us suppose that inner verbalization is as ubiquitous as common-sense belief and Hurlburt’s data would suggest. Just what would all this inner verbalization be doing? What would be its function, or cognitive role? The naďve common-sense answer is that inner verbalization is constitutive of our thinking – it is that we think by talking to ourselves in inner speech (as well as by manipulating visual images etc.). Anyone who holds such a view endorses a version of what I shall call ‘the cognitive conception of language’, which maintains that, besides its obvious communicative functions, language also has a direct role to play in normal human cognition (in thinking and reasoning).

            Quite a different answer would be returned by most members of the cognitive science community, however. For they endorse what I shall call ‘the (purely) communicative conception of language’, according to which language is but an input–output system for central cognition. Believing that language is only a channel, or conduit, for transferring thoughts into and out of the mind, they are then obliged to claim that the stream of inner verbalization is more-or-less epiphenomenal in character. (Some possible minor cognitive roles for inner speech, which should nevertheless be acceptable to those adopting this perspective, will be canvassed later.) The real thinking will be going on elsewhere, in some other medium of representation.

            One reason for the popularity of the communicative conception amongst cognitive scientists is that almost all now believe that language is a distinct input–output module of the mind (at least in some sense of ‘module’, if not quite in Fodor’s classic - 1983 – sense). And they find it difficult to see how the language faculty could both have this status and be importantly implicated in central cognition. But this reasoning is fallacious. For compare the case of visual imagination. Almost everyone now thinks that the visual system is a distinct input-module of the mind, containing a good deal of innate structure. But equally, most cognitive scientists now accept that visual imagination re-deploys the resources of the visual module for purposes of reasoning – for example, many of the same areas of the visual cortex are active when imagining as when seeing. (For a review of the evidence, see Kosslyn, 1994.)

What is apparent is that central cognition can co-opt the resources of peripheral modules, activating some of their representations to subserve central cognitive functions of thinking and reasoning. The same is then possible in connection with language. It is quite consistent with language being an innately structured input and output module, that central cognition should access and deploy the resources of that module when engaging in certain kinds of reasoning and problem solving.

Note, too, that hardly anyone is likely to maintain that visual imagery is a mere epiphenomenon of central cognitive reasoning processes, playing no real role in those processes in its own right. On the contrary, it seems likely that there are many tasks which we cannot easily solve without deploying a visual (or other) image. For example, suppose you are asked (orally) to describe the shape which is enclosed within the capital letter ‘A’. It seems entirely plausible that success in this task should require the generation of a visual image of that letter, from which the answer (‘a triangle’) can then be read off. So it appears that central cognition operates, in part, by co-opting the resources of the visual system to generate visual representations, which can be of use in solving a variety of spatial-reasoning tasks. And this then opens up the very real possibility that central cognition may also deploy the resources of the language system to generate representations of natural language sentences (in ‘inner speech’), which can similarly be of use in a variety of conceptual reasoning tasks.

There is at least one further reason why the cognitive conception of language has had a bad press within the cognitive science community in recent decades. (It continues to be popular in some areas of the social sciences and humanities, including philosophy.) This is that many of the forms of the thesis which have been defended by philosophers and by social scientists are implausibly strong, as we shall see in section 3 below. The unacceptability of these strong views has then resulted in all forms of the cognitive conception being tarred with the same brush.

A crucial liberalizing move, therefore, is to realize that the cognitive conception of language can come in many different strengths, each one of which needs to be considered separately on its own merits. In this paper I shall distinguish between some of the many different versions of the cognitive conception. I shall begin (in section 2) by discussing some weak claims concerning the cognitive functions of language which are largely uncontroversial. This will help to clarify just what (any interesting form of) the cognitive conception is, by way of contrast. I shall then (in section 3) consider some claims which are so strong that cognitive scientists are clearly right in rejecting them, before zeroing in on those which are both interesting and plausible (in sections 4 and 5). I shall come to focus, in particular, on the thesis that natural language is the medium of inter-modular integration. This is a theoretical idea which has now begun to gather independent empirical support. Finally (in sections 6 and 7) some additional implications, elaborations, and possible further empirical tests of this idea are discussed.

I should explain at the outset, however, that the thesis I shall be working towards is that it is natural language syntax which is crucially necessary for inter-modular integration. The hypothesis is that non-domain-specific thinking operates by accessing and manipulating the representations of the language faculty. More specifically, the claim is that non-domain-specific thoughts implicate representations in what Chomsky (1995) calls ‘Logical Form’ (LF). Where these representations are only in LF, the thoughts in question will be non-conscious ones. But where the LF representation is used to generate a full-blown phonological representation (an imaged sentence), the thought will generally be conscious.[1]

I should emphasize that I shall not be claiming that syntax is logically required for inter-modular integration, of course. Nor shall I be claiming that only natural language syntax – with its associated recursive and hierarchical structures, compositionality, and generativity – could possibly play such a role in any form of cognition, human or not. (In fact it is the phrase-structure element of syntax which does the work in my account; see section 6.1 below.) Rather, my claim will be that syntax does play this role in human beings. It is a factual claim about the way in which our cognition happens to be structured, not an unrestricted modal claim arrived at by some sort of task-analysis.

I should also declare at the outset how I shall be using the word ‘thought’ in this paper. Unless I signal otherwise, I intend all references to thought and thinking to be construed realistically. Thoughts are discrete, semantically-evaluable, causally-effective states, possessing component structure, and where those structures bear systematic relations to the structures of other, related, thoughts. So distinct thoughts have distinct physical realizations, which may be true or false, and which cause other such thoughts and behavior. And thoughts are built up out of component parts, where those parts belong to types which can be shared with other thoughts. It is not presupposed, however, that thoughts are borne by sentence-like structures. Although I shall be arguing that some thoughts are carried by sentences (viz. non-domain-specific thoughts which are carried by sentences of natural language), others might be carried by mental models or mental images of various kinds.

It is hugely controversial that there are such things as thoughts, thus construed, of course. And while I shall say a little in defense of this assumption below (in section 3.3), for the most part it is just that – an assumption – for present purposes. I can only plead that one can’t do everything in one paper, and that one has to start somewhere. Those who don’t want to share this assumption should read what follows conditionally: if we were to accept that there are such things as realistically-construed thoughts, then how, if at all, should they be seen as related to natural language sentences?

Finally, a word about the nature of the exercise before we proceed further. This paper ranges over a great many specialist topics and literatures in a number of distinct disciplines. Of necessity, therefore, our discussion of any given subject must be relatively superficial, with most of the detail, together with many of the required qualifications and caveats, being omitted. Similarly, my arguments against some of the competitor theories are going to have to be extremely brisk, and some quite large assumptions will have to get taken on board without proper examination. My goal, here, is just to map out an hypothesis space, using quite broad strokes, and then to motivate and discuss what I take to be the most plausible proposal within it.

 

2          Weak claims

Everyone will allow that language makes some cognitive difference. For example, everyone accepts that a human being with language and a human being without language would be very different, cognitively speaking. In this section I shall outline some of the reasons why.

 

2.1            Language as the conduit of belief

Everyone should agree that natural language is a necessary condition for human beings to be capable of entertaining at least some kinds of thought. For language is the conduit through which we acquire many of our beliefs and concepts, and in many of these cases we could hardly have acquired the component concepts in any other way. So concepts which have emerged out of many years of collective labor by scientists, for example – such as electron, neutrino, and DNA – would de facto be inaccessible to someone deprived of language. This much, at any rate, should be obvious. But all it really shows is that language is required for certain kinds of thought; not that language is actually involved in or is the representational vehicle of those thoughts.

            It is often remarked, too, that the linguistic and cognitive abilities of young children will normally develop together. If children’s language is advanced, then so will be their abilities across a range of tasks; and if children’s language is delayed, then so will be their cognitive capacities. To cite just one item from a wealth of empirical evidence: Astington (1996) and Peterson and Siegal (1998) report finding a high correlation between language-ability and children’s capacity to pass false-belief tasks, whose solution requires them to attribute, and reason from, the false belief of another person. Does this and similar data show that language is actually involved in children’s thinking?

In the same spirit, we might be tempted to cite the immense cognitive deficits which can be observed in those rare cases where children grow up without exposure to natural language. Consider, for example, the cases of so-called ‘wolf children’, who have survived in the wild in the company of animals, or of children kept by their parents locked away from all human contact (Malson, 1972; Curtiss, 1977). Consider, also, the cognitive limitations of profoundly deaf children born of hearing parents, who have not yet learned to sign (Sachs, 1989; Schaller, 1991). These examples might be thought to show that human cognition is constructed in such a way as to require the presence of natural language if it is to function properly.

            But all that such data really show is, again, that language is a necessary condition for certain kinds of thought and types of cognitive process; not that it is actually implicated in those forms of thinking. And this is easily explicable from the standpoint of someone who endorses the standard cognitive science conception of language, as being but an input–output system for central cognition, or a mere communicative device. For language, in human beings, is a necessary condition of normal enculturation. Without language, there are many things which children cannot learn; and with delayed language, there are many things which children will only learn later. It is only to be expected, then, that cognitive and linguistic development should proceed in parallel. It does not follow that language is itself actually used in children’s central cognition.

            Another way of putting the point is that this proposed cognitive function of language is purely developmental - or diachronic - rather than synchronic. Nothing is said about the role of language in the cognition of adults, once a normal set of beliefs and concepts has been acquired. And the evidence from aphasia suggests that at least many aspects of cognition can continue to operate normally once language has been removed.

            Aphasias come in many forms, of course, and in many different degrees of severity. And it is generally hard to know the extent of any collateral damage – that is, to know which other cognitive systems besides the language faculty may have been disabled as a result of the aphasia-causing brain-damage. But many patients with severe aphasia continue to be adept at visuo-spatial thinking, at least (Kertesz, 1988), and many continue to manage quite well for themselves in their daily lives.

            Consider, for example, the a-grammatic aphasic man studied in detail by Varley (1998, 2002). He is incapable of either producing or comprehending sentences, and he also has considerable difficulty with vocabulary, particularly verbs. He has lost all mentalistic vocabulary (‘belief’, ‘wants’, etc.), and his language system is essentially limited to nouns. Note that there is not a lot of explicit thinking that you can do using just nouns! (It should also be stressed that he has matching deficits of input and output, suggesting that it is the underlying system of linguistic knowledge which has been damaged). Yet he continues to drive, and to have responsibility for the family finances. He is adept at communicating, using a mixture of single-word utterances and pantomime. And he has passed a range of tests of theory of mind (the standard battery of false-belief and deception tasks, explained using nouns and pantomime), as well as various tests of causal thinking and reasoning. It appears that, once language has done its developmental work of loading the mind with information, a good deal of adult cognition can thereafter survive its loss.

            Since natural language is the conduit for many of our beliefs and for much of our enculturation, everyone should accept that language is immensely important for normal cognitive development. That language has this sort of cognitive function should be no news to anyone.[2]

 

2.2            Language as sculpting cognition

A stronger and more controversial thesis has been proposed and defended by some researchers over recent decades. This is that the process of language acquisition and enculturation does not merely serve to load the mind with beliefs and concepts, but actually sculpts our cognitive processes to some degree (Lucy, 1992a, 1992b; Nelson, 1996; Bowerman and Levinson, 2001).[3]

For example, acquisition of Yucatec (as opposed to English) – in which plurals are rarely marked and many more nouns are treated grammatically as substance-terms like ‘mud’ and ‘water’ – leads subjects to see similarities amongst objects on the basis of material composition rather than shape (Lucy, 1992b; Lucy and Gaskins, 2001). And children brought up speaking Korean (as opposed to English) – in which verbs are highly inflected and massive noun ellipsis is permissible in informal speech – leads children to be much weaker at categorization tasks, but much better at means–ends tasks such as using a rake to pull a distant object towards them (Choi and Gopnik, 1995; Gopnik et al., 1996; Gopnik, 2001).

            Fascinating as these data are, they do not, in themselves, support any version of the cognitive conception of language. This is because the reported effects of language on cognition are still entirely diachronic and developmental, rather than synchronic. The fact that acquiring one language as opposed to another causes subjects to attend to different things and to reason somewhat differently doesn’t show that language itself is actually involved in people’s thinking. Indeed, on the hypothesis proposed by Gopnik (2001), language-acquisition has these effects by providing evidence for a pre-linguistic theorizing capacity, which operates throughout development to construct children’s systems of belief and inference.

 

2.3            Language as a cognitive scaffold

Other claims can be extracted from the work of Vygotsky (1934/1986), who argues that language and speech serve to scaffold the development of cognitive capacities in the growing child. Researchers working in this tradition have studied the self-directed verbalizations of young children – for example, observing the effects of their soliloquies on their behavior (Diaz and Berk, 1992). They have found that children tend to verbalize more when task demands are greater, and that those who verbalize most tend to be more successful in problem-solving.

This claim of linguistic scaffolding of cognition admits of a spectrum of readings, however. At its weakest, it says no more than has already been conceded above, that language may be a necessary condition for the acquisition of certain cognitive skills. At its strongest, on the other hand, the idea could be that language forms part of the functioning of the highest-level executive system – which would then make it a variant of the ideas to be discussed in sections 4 and 5 below.

            Clark (1998) argues for a sort of intermediate-strength version of the Vygotskian idea, defending a conception of language as a cognitive tool. (Chomsky, too, has argued for an account of this sort. See his 1976, ch.2.) According to this view – which Clark labels ‘the supra-communicative conception of language’ – certain extended processes of thinking and reasoning constitutively involve natural language. The idea is that language gets used, not just for communication, but also to augment human cognitive powers.

Thus by writing an idea down, for example, I can off-load the demands on memory, presenting myself with an object of further leisured reflection; and by performing arithmetic calculations on a piece of paper, I may be able to handle computational tasks which would otherwise be too much for me (and my short-term memory). In similar fashion, it may be that inner speech serves to enhance memory, since it is now well-established that the powers of human memory systems can be greatly extended by association (Baddeley, 1988). Inner speech may thus facilitate complex trains of reasoning (Varley, 1998).

Notice that on this supra-communicative account, the involvement of language in thought only arises when we focus on a process of thinking or reasoning extended over time. So far as any given individual (token) thought goes, the account can (and does) buy into the standard input–output conception of language. It maintains that there is a neural episode which carries the content of the thought in question, where an episode of that type can exist in the absence of any natural language sentence and can have a causal role distinctive of the thought, but which in the case in question causes the production of a natural language representation. This representation can then have further benefits for the system of the sort which Clark explores (off-loading or enhancing memory).

            According to stronger forms of the cognitive conception to be explored in later sections, in contrast, a particular tokening of an inner sentence is (sometimes) an inseparable part of the mental episode which carries the content of the thought-token in question. So there is no neural or mental event at the time which can exist distinct from that sentence, which can occupy a causal role distinctive of that sort of thought, and which carries the content in question; and so language is actually involved in (certain types of) cognition, even when our focus is on individual (token) thinkings.

In this section I have discussed two weak claims about the role of language (that language is necessary for the acquisition of many beliefs and concepts; and that language may serve as a cognitive tool, enhancing the range and complexity of our reasoning processes). These claims should be readily acceptable to most cognitive scientists. In addition, I have briefly introduced a more controversial thesis, namely that the acquisition of one or another natural language can sculpt our cognitive processes, to some degree. But this thesis relates only to the developmental, or diachronic, role of language. It says nothing about the role of language in adult cognition. We will in future focus on more challenging versions of the cognitive conception of language.

 

3          Strong claims

As is starting to emerge, the thesis that language has a cognitive function admits of a spectrum of readings. In this section I shall jump to the other end of that spectrum, considering forms of the cognitive conception of language which are too strong to be acceptable.

 

3.1            Language as necessarily required for thought

When the question of the place of natural language in cognition has been debated by philosophers the discussion has, almost always, been conducted a priori in universalist terms. Various arguments have been proposed for the claim that it is a conceptually necessary truth that all thought requires language, for example (Wittgenstein, 1921, 1953; Davidson, 1975, 1982; Dummett, 1981, 1989; McDowell, 1994). But these arguments all depend, in one way or another, upon an anti-realist conception of the mind – claiming, for instance, that since we cannot interpret anyone as entertaining any given fine-grained thought in the absence of linguistic behavior, such thoughts cannot even exist in the absence of such behavior (Davidson, 1975). Since the view adopted in this paper – and shared by most cognitive psychologists – is quite strongly realist about thought, I do not propose to devote any time to such arguments.

            Notice, too, that Davidson et al. are committed to denying that any non-human animals can entertain genuine thoughts, given that it is very doubtful whether any such animals are capable of understanding and using a natural language (in the relevant sense of ‘language’, that is; see Premack, 1986; Pinker, 1994). This conclusion conflicts, not just with common-sense belief, but also with what can be discovered about animal cognition, both experimentally and by observation of their behavior in the wild (de Waal, 1982, 1996; Walker, 1983; Gallistel, 1990; Savage-Rumbaugh and Lewin, 1994; Byrne, 1995; Dickinson and Shanks, 1995; Allen and Bekoff, 1997; Hauser, 2000; Povinelli, 2000). So not only are the arguments of Davidson et al. unsound, but we have independent reasons to think that their conclusion is false.

            Dummett (1994) makes some attempt to accommodate this sort of point by distinguishing between concept-involving thoughts (which are held to be necessarily dependent upon language) and what he calls ‘proto-thoughts’, which are what animals are allowed to possess. Proto-thoughts are said to consist of ‘visual images superimposed on the visually perceived scene’, and are said to be possible only when tied to current circumstances and behavior. But such an account vastly under-estimates the cognitive capacities of non-human animals, I believe. If an animal can decide whom to form an alliance with, or can calculate rates of return from different sources of food, or can notice and exploit the ignorance of another, then these things cannot be accounted for in Dummett’s terms. And given that conceptual thinking of this sort is possible for animals, then he will be left without any principled distinction between animal thought and human thought.

            I do not expect that these brief considerations will convince any of my philosophical opponents, of course; and they aren’t meant to. Given the intended readership of this target-paper, their position is not really one that I need to take seriously. It is mentioned here just to set it aside, and (most importantly) in order that other, more plausible, versions of the cognitive conception of language shouldn’t be confused with it.

I propose, therefore, to take it for granted that thought is conceptually independent of natural language, and that thoughts of many types can actually occur in the absence of such language. But this leaves open the possibility that some types of thought might de facto involve language, given the way in which human cognition is structured. It is on this – weaker but nevertheless still controversial – set of claims that I shall focus. Claims of this type seem to me to have been unjustly under-explored by researchers in the cognitive sciences; partly, no doubt, because they have been run together with the a priori and universalist claims of some philosophers, which have been rightly rejected.

 

3.2       The Joycean machine

Another overly-strong form of cognitive conception of language – which has been endorsed by some philosophers and by many social scientists – is that language is, as a matter of fact, the medium of all human conceptual thinking. Most often it has been associated with a radical empiricism about the mind, according to which virtually all human concepts and ways of thinking, and indeed much of the very structure of the human mind itself, are acquired by young children from adults when they learn their native language – these concepts and structures differing widely depending upon the conceptual resources and structures of the natural language in question. This mind-structuring and social-relativist view of language is still dominant in the social sciences, following the writings early in this century of the amateur linguist Whorf (many of whose papers have been collected together in his 1956) indeed, Pinker (1994) refers to it disparagingly as ‘the Standard Social Science Model’ of the mind.

Perhaps Dennett (1991) provides one of the clearest exponents of this view. He argues that human cognitive powers were utterly transformed following the appearance of natural language, as the mind became colonized by memes (ideas, or concepts, which are transmitted, retained and selected in a manner supposedly analogous to genes; see Dawkins, 1976). Prior to the evolution of language, on this picture, the mind was a bundle of distributed connectionist processors – which conferred on early hominids some degree of flexibility and intelligence, but which were quite limited in their computational powers. The arrival of language then meant that a whole new – serial and compositionally structured – cognitive architecture could be programmed into the system.

This is what Dennett calls the Joycean machine (named after James Joyce’s ‘stream of consciousness’ writing). The idea is that there is a highest-level processor which runs on a stream of natural-language representations, utilizing learned connections between ideas, and patterns of reasoning acquired in and through the acquisition of linguistic memes. On this account, then, the concept-wielding mind is a kind of social construction, brought into existence through the absorption of memes from the surrounding culture. And on this view, the conceptual mind is both dependent upon, and constitutively involves, natural language.

Admittedly, what Dennett will actually say is that animals and pre-linguistic hominids are capable of thought, and engage in much intelligent thinking. But this is because he is not (in my sense) a realist about thoughts. On the contrary, he (like Davidson) is what is sometimes called an ‘interpretationalist’ – he thinks that there is nothing more to thinking than engaging in behavior which is interpretable as thinking. Yet he does seem committed to saying that it is only with the advent of natural language that you get a kind of thinking which involves discrete, structured, semantically-evaluable, causally-effective states – that is, thoughts realistically construed.

            Bickerton’s proposals (1990, 1995) are somewhat similar, but more biological in flavor. He thinks that, before the evolution of language, hominid cognition was extremely limited in its powers. On his view these early forms of hominid cognition consisted largely of a set of relatively simple computational systems, underpinning an array of flexible but essentially behavioristic conditioned responses to stimuli. But then the evolution of language some 100,000 years ago involved a dramatic re-wiring of the hominid brain, giving rise to distinctively human intelligence and conceptual powers.[4]

Bickerton, like Dennett, allows that subsequent to the evolution of language the human mind would have undergone further transformations, as the stock of socially transmitted ideas and concepts changed and increased. But the basic alteration was coincident with, and constituted by, a biological alteration – namely, the appearance of an innately-structured language-faculty. For Bickerton is a nativist about language. (Indeed, his earlier work on the creolization of pidgin languages – 1981 – is often cited as part of an argument for the biological basis of language; see Pinker, 1994.) And it is language which, he supposes, conferred on us the capacity for ‘off-line thinking’ – that is, the capacity to think and reason about topics and problems in the abstract, independent of any particular sensory stimulus.

            These strong views seem very unlikely to be correct. This is so for two reasons. First, they undervalue the cognitive powers of pre-linguistic children, animals, and earlier forms of hominid. Thus Homo erectus and archaic forms of Homo sapiens, for example, were able to survive in extremely harsh tundra environments, presumably without language (see below). It is hard to see how this could have been possible without a capacity for quite sophisticated planning and a good deal of complex social interaction (Mithen, 1996). Second, the views of Dennett and Bickerton are inconsistent with the sort of central-process modularism which has been gaining increasing support in recent decades. On this account the mind contains a variety of conceptual modules – for mind-reading, for doing naďve physics, for reasoning about social contracts, and so on – which are probably of considerable ancestry, pre-dating the appearance of a modular language-faculty.[5] So hominids were already capable of conceptual thought, and of reasoning in a complex, and presumably ‘off-line’, fashion before the arrival of language.

In sections 3.3 and 3.4 which follow I shall elaborate briefly on these points. But first, I want to consider a potential reply which might be made by someone sympathetic to Bickerton’s position. For Bickerton actually thinks that earlier hominids probably used a form of ‘proto-language’ prior to the evolution of syntax, similar to the language used by young children and to pidgin languages. (This is, in fact, a very plausible intermediate stage in the evolution of natural language.) It might be claimed, then, that insofar as hominids are capable of intelligent thought, this is only because those thoughts are framed in proto-language. So the view that thought is dependent upon language can be preserved.

Such a reply would, indeed, give Bickerton a little extra wiggle-room; but only a little. For as we shall see in section 3.3 below, a good deal of the evidence for hominid thinking is provided by the capacities of our nearest relatives, the great apes, who are known to lack even a proto-language (without a good deal of human enculturation and explicit training, at any rate; Savage-Rumbaugh and Lewin, 1994). And some of the other evidence – e.g. provided by hominid stone knapping – is not plausibly seen as underpinned by proto-language. Moreover, the various thought-generating central modules, to be discussed in section 3.4 below, are almost surely independent both of language and proto-language. So it remains the case that much hominid thought is independent even of proto-language.

 

3.3            Hominid intelligence

Since social intelligence is something which we share with the other great apes (especially chimpanzees), it is reasonable to conclude that the common ancestor of all apes - and so, by implication, all earlier forms of hominid - will also have excelled in the social domain. While it is still disputed whether chimpanzees have full-blown mind-reading, or ‘theory of mind’, abilities, of the sort attained by a normal four-year-old child, it is not in dispute that the social behavior of great apes can be extremely subtle and sophisticated (Byrne and Whiten, 1988, 1998; Byrne, 1995; Povinelli, 2000).

Two points are worth stressing in this context. One is that it is well-nigh impossible to see how apes can be capable of representing multiple, complex, and constantly changing social relationships (who is friends with whom, who has recently groomed whom, who has recently fallen out with whom, and so on) unless they are capable of structured propositional thought.[6] This is a development of what Horgan and Tienson (1996) call ‘the tracking argument for Mentalese’ (i.e. an argument in support of the claim that thoughts are structured out of recombinable components). Unless the social thoughts of apes were composed out of elements variously representing individuals and their properties and relationships, then it is very hard indeed to see how they could do the sort of one-off learning of which they are manifestly capable. This surely requires separate representations for individuals and their properties and relations, so that the latter can be varied while the former are held constant. So (contra Dennett and Bickerton) we have reason to think that all earlier forms of hominid would have been capable of sophisticated conceptual thought (realistically construed), at least in the social domain.[7]

            The second point to note is that the social thinking of apes seems sometimes to be genuinely strategic in nature, apparently involving plans which are executed over the course of days or months. Consider, for example, the way in which a band of male chimpanzees will set out quietly and in an organized and purposive manner towards the territory of a neighboring group, apparently with the intention, either of killing some of the males of that group, or of capturing some of its females, or both (Byrne, 1995). Or consider the way in which a lower-ranking male will, over the course of a number of months, build up a relationship with the beta male, until the alliance is strong enough for them to co-operate in ousting the alpha male from his position (de Waal, 1982). Presumably the thinking which would generate such long-term plans and strategies would have to be ‘off-line’, in the sense of not being tied to or driven by current perceptions of the environment.

            We can conclude, then, that all of our hominid ancestors would have had a sophisticated social intelligence. In addition, the stone-tool-making abilities of later species of Homo erectus indicate a sophisticated grasp of fracture dynamics and the properties of stone materials. Making stone tools isn’t easy. It requires judgment, as well as considerable hand-eye co-ordination and upper-body strength. And since it uses a reductive technology (starting from a larger stone and reducing it to the required shape) it cannot be routinized in the way that (presumably) nest-building by weaver birds and dam-building by beavers can be. Stone knappers have to hold in mind the desired shape and plan two or more strikes ahead in order to work towards it using variable and unpredictable materials (Pelegrin, 1993; Mithen, 1996). Moreover, some of the very fine three-dimensional symmetries produced from about half-a-million years ago would almost certainly have required significant capacities for visual imagination - in particular, an ability to mentally rotate an image of the stone product which will result if a particular flake is struck off (Wynn, 2000). And this is surely ‘off-line’ thinking if anything is!

            We can also conclude that early humans were capable of learning and reasoning about their natural environments with a considerable degree of sophistication. They were able to colonize much of the globe, ranging from Southern Africa to North-Western Europe to South-East Asia. And they were able to thrive in a wide variety of habitats (including extremely harsh marginal tundra environments), adapting their life-style to local - and sometimes rapidly changing - circumstances (Mithen, 1990, 1996). This again serves as a premise for a version of the ‘tracking argument’, suggesting that early humans were capable of compositionally-structured thoughts about the biological as well as the social worlds.

 

3.4       The modular mind

The above claims about the cognitive powers of our early ancestors both support, and are in turn supported by, the evidence of modular organization in the minds of contemporary humans. On this account, besides a variety of input and output modules (including early vision, face-recognition, and language, for example), the mind also contains a number of innately channeled conceptual modules, designed to process conceptual information concerning particular domains. Although these would not be modules in Fodor’s classic (1983) sense, in that they wouldn’t have proprietary transducers, might not have dedicated neural hardware, and might not be fully encapsulated, they would still be innately channeled dedicated computational systems, generating information in accordance with algorithms which are not shared with, nor accessible to, other systems.

Plausible candidates for such conceptual modules might include a naďve physics system (Leslie, 1994; Spelke, 1994; Spelke et al., 1995; Baillargeon, 1995), a naďve psychology or ‘mind-reading’ system (Carey, 1985; Leslie, 1994; Baron-Cohen, 1995), a folk-biology system (Atran, 1990, 1998, 2002), an intuitive number system (Wynn, 1990, 1995; Gallistel and Gelman, 1992; Dehaerne, 1997), a geometrical system for re-orienting and navigating in unusual environments (Cheng, 1986; Hermer and Spelke, 1994, 1996) and a system for processing and keeping track of social contracts (Cosmides and Tooby, 1992; Fiddick et al., 2000).

Evidence supporting the existence of at least the first two of these systems (folk-physics and folk-psychology) is now pretty robust. Very young infants already have a set of expectations concerning the behaviors and movements of physical objects, and their understanding of this form of causality develops very rapidly over the first year or two of life. And folk-psychological concepts and expectations also develop very early, and follow a characteristic developmental profile. Indeed, recent evidence from the study of twins suggests that three-quarters of the variance in mind-reading abilities amongst three year olds is both genetic in origin and largely independent of the genes responsible for verbal intelligence, with only one quarter of the variance being contributed by the environment (Hughes and Plomin, 2000).[8]

            Now, of course the thesis of conceptual modularity is still highly controversial, and disputed by many cognitive scientists. And I cannot pretend to have said enough to have established it here; nor is there the space to attempt to do so. This is going to be one of the large assumptions which I need to ask my readers to take on board as background to what follows. However, there is one sort of objection to conceptual modularity which I should like to respond to briefly here. This is that there simply hasn’t been time for all of these modular systems to have evolved (or at any rate, not those of them that are distinctively human – geometry and folk-physics might be the exceptions).

            Tomasello (1999) argues that the mere six million years or so since the hominid line diverged from the common ancestor of ourselves and chimpanzees is just too short a time for the processes of evolution to have sculpted a whole suite of conceptual modules. He thinks that explanations of distinctively-human cognition need to postulate just one – at most two – biological adaptations, in terms of which all the other cognitive differences between us and chimpanzees should be explained. His preferred option is theory of mind ability, which underpins processes of cultural learning and cultural accumulation and transmission. Others might argue in similar fashion that the only major biological difference is the language faculty (Perner, personal communication).

            The premise of this argument is false, however; six million years is a lot of time, particularly if the selection pressures are powerful ones. (Only 10,000 years separate polar bears and grizzlies, for example.) And this is especially so when, as in the present case, many of the systems in question don’t have to be built ab initio, but can result from a deepening and strengthening of pre-existing faculties. Thus theory of mind would surely have developed from some pre-existing social-cognition module; folk-biology from a pre-existing foraging system; and so on. In order to reinforce the point, one just has to reflect on the major, and multiple, physical differences between ourselves and chimpanzees – including upright gait, arm-length, physical stature, brain size, nasal shape, hairlessness, whites of eyes, and so on and so forth. These, too, have all evolved – many of them independently, plainly – over the last six million years.

 

3.5       Taking stock

What has happened in the cognitive sciences in recent decades, then, is this. Many researchers have become increasingly convinced, by neuropsychological and other evidence, that the mind is more or less modular in structure, built up out of isolable, and largely isolated, components (Fodor, 1983; Sachs, 1985; Shallice, 1988; Gallistel, 1990; Barkow et al., 1992; Hirschfeld and Gelman, 1994; Sperber et al., 1995; Pinker, 1997). They have also become convinced that the structure and contents of the mind are substantially innate (Fodor, 1981, 1983; Carey, 1985; Spelke, 1994), and that language is one such isolable and largely innate module (Fodor, 1983; Chomsky, 1988; Pinker, 1994). There has then been, amongst cognitive scientists, a near-universal reaction against the cognitive conception of language, by running it together with the Whorfian hypothesis. Most researchers have assumed, without argument, that if they were to accept any form of cognitive conception of language, then that would commit them to Whorfian linguistic relativism and radical empiricism, and would hence be inconsistent with their well-founded beliefs in modularity and nativism (Pinker, 1994).

            It is important to see, however, that someone endorsing the cognitive conception of language does not have to regard language and the mind as cultural constructs, either socially determined or culturally relative. In fact, some form of cognitive conception of language can equally well be deployed along with a modularist and nativist view of language and mind. There are a range of positions intermediate between the input–output conception of language on the one hand, and Whorfian relativism (the Standard Social Science Model) on the other, which deserve the attention of philosophers and cognitive scientists alike. These views are nativist as opposed to empiricist about language and much of the structure of the mind, but nevertheless hold that language is constitutively employed in many of our thoughts.

 

4            Language and conscious thinking

What is at stake, then, is the question whether language might be constitutively involved in some forms of human thinking. But which forms? In previous work I suggested that language might be the medium in which we conduct our conscious propositional thinking - claiming, that is, that inner speech might be the vehicle of conscious-conceptual (as opposed to conscious visuo-spatial) thinking (Carruthers, 1996). This view takes seriously and literally the bit of folk-wisdom with which this paper began - namely, that much of our conscious thinking (viz. our propositional thinking) is conducted in inner speech.

            Now, if the thesis here is that the cognitive role of language is confined to conscious thinking, then it will have to be allowed that much propositional thinking also takes place independently of natural language - for it would hardly be very plausible to maintain that there is no thinking but conscious thinking. And there are then two significant options regarding the relations between non-conscious language-independent thought, on the one hand, and conscious language-involving thinking, on the other. For either we would have to say that anything which we can think consciously, in language, can also be thought non-consciously, independently of language; or we would have to say that there are some thought-types which can only be entertained at all, by us, when tokened consciously in the form of an imaged natural language sentence.

            Suppose that it is the first - weaker and more plausible - of these options which is taken. Then we had better be able to identify some element of the distinctive causal role of an imaged sentence which is sufficiently thought-like or inference-like for us to be able to say that the sentence in question is partly constitutive of the (conscious) tokening of the thought-type in question, rather than being merely expressive of it. For otherwise - if everything which we can think consciously, in language, we can also think non-consciously, without language - what is to block the conclusion that inner speech is merely the means by which we have access to our occurrent thoughts, without inner speech being in any sense constitutive of our thinking? (On this, at length, see Carruthers 1998b.)

            There would seem to be just two distinct (albeit mutually consistent) possibilities here. One (implicit in Carruthers, 1996) would be to propose a suitably weakened version of Dennett’s Joycean machine hypothesis. While allowing (contra Dennett) that much conceptual thinking (realistically construed) and all conceptual thought-types are independent of language (in the sense of not being constituted by it), we could claim that there are certain learned habits and patterns of thinking and reasoning which are acquired linguistically, and which are then restricted to linguistic (and conscious) tokenings of the thoughts which they govern. It is surely plausible, for example, that exact long-division or multiplication can only be conducted consciously, in imaged manipulations of numerical symbols. Similarly, it may be that the result of taking a course in logic is that one becomes disposed to make transitions between sentences, consciously in language, where one would otherwise not have been disposed to make the corresponding transitions between the thoughts expressed. If these sorts of possibilities are realized, then we would have good reason to say of a token application of a particular inference-form, that the imaged natural language sentences involved are constitutive of the inference in question, since it could not have taken place without them.

A second possibility is proposed and defended by Frankish (1998a, 1998b, and forthcoming; see also Cohen, 1993). This is that the distinctive causal role of inner speech is partly a function of our decisions to accept, reject, or act on the propositions which our imaged sentences express. I can frame a hitherto unconsidered proposition in inner speech and decide that it is worthy of acceptance, thereby committing myself to thinking and acting thereafter as if that sentence were true. Then provided that I remember my commitments and execute them, it will be just as if I believed the proposition in question. (In his published work Frankish describes this level of mentality as the ‘virtual mind’ and the beliefs in question as ‘virtual beliefs’.) But, by hypothesis, I would never have come to believe what I do, nor to reason as I do reason, except via the tokening of sentences in inner speech. Frankish argues, in effect, that there is a whole level of mentality (which he now dubs ‘supermind’) which is constituted by our higher-order decisions and commitments to accept or reject propositions; and that language is constitutive of the thoughts and beliefs which we entertain at this level.

Such views have considerable plausibility; and it may well be that one, or other, or both of these accounts of the causal role of inner speech is correct. Indeed, the dual process theory of human reasoning developed over the years by Evans and colleagues (Wason and Evans, 1975; Evans and Over, 1996), and more recently by Stanovich (1999), combines elements of each of them. On this account, in addition to a suite of computationally powerful, fast, and implicit reasoning systems (from our perspective, a set of conceptual modules), the mind also contains a slow, serial, and explicit reasoning capacity, whose operations are conscious and under personal control, and which is said (by some theorists at least; e.g. Evans and Over, 1996) to involve natural language. The emphasis here on learned rules in the operations of the explicit system is reminiscent of Dennett’s ‘Joycean machine’, whereas the stress on our having personal control over the operations of that system seems very similar to Frankish’s conception of ‘supermind’.

Not only is some form of dual-process theory plausible, but it should also be stressed that these accounts are independent of central-process modularism. Those who deny the existence of any conceptual modules can still accept that there is a level of thinking and reasoning which is both language-involving and conscious. It is surely plain, however, that none of the above accounts can amount to the most fundamental cognitive function of language once conceptual modularity is assumed.

Given conceptual modularity, then unless the above views are held together with the thesis to be developed in section 5 below – namely, that language provides the medium for inter-modular communication and non-domain-specific thinking – then we can set their proponents a dilemma. Either they must claim that a domain-general architecture was in place prior to the evolution of language. Or they must allow that there was no significant domain-general cognition amongst hominids prior to the appearance of language and language-involving conscious thinking; and they must claim that such cognition still evolved as a distinct development, either at the same time or later. Since contemporary humans are manifestly capable of conjoining information across different domains in both their theoretical thought and their planning, then either pre-linguistic humans must already have had domain-general theoretical and practical reasoning faculties, or they must have evolved them separately at the same time or after the evolution of the language faculty (that is, if it isn’t language itself which enables us to combine information across modules).

The problem with the first alternative, however – namely, that domain-general reasoning capacities pre-dated language – is that the evidence from cognitive archaeology suggests that this was not the case. For although the various sub-species of Homo erectus and archaic forms of Homo sapiens were smart, they were not that smart. Let me briefly elaborate.

As Mithen (1996) demonstrates at length and in detail, the evidence from archaeology is that the minds of early humans were in important respects quite unlike our own. While they successfully colonized diverse and rapidly changing environments, the evidence suggests that they were incapable of bringing together information across different cognitive domains. It seems that they could not (or did not) mix information from the biological world (utilized in hunting and gathering) with information about the physical world (used in tool making); and that neither of these sorts of information interacted with their social intelligence. Although they made sophisticated stone tools, they did not use those tools for specialized purposes (with different kinds of arrow-head being used for different kinds of game, for example); and they did not make tools out of animal products such as antler and bone. There is no sign of the use of artifacts as social signals, in the form of body ornaments and such-like, which is so ubiquitous in modern human cultures. And there is no indication of totemization or other sorts of linkages between social and animal domains, such as lion-man figurines, cave-paintings, or the burying of the dead with (presumably symbolic) animal parts - which all emerge onto the scene for the first time with modern humans. As Mithen summarizes the evidence, it would appear that early humans had sophisticated special intelligences, but that these faculties remained largely isolated from one another.

The problem with the second horn of the dilemma sketched above is just that it is hard to believe, either that a domain-general reasoning faculty might have evolved after the appearance of language some 100,000 years ago (in just the 20,000 years or so before the beginning of the dispersal of modern humans around the globe), or that language and domain-general capacities might have co-evolved as distinct faculties. For as we shall see in section 5, the evolution of language would in any case have involved the language faculty taking inputs from, and sending outputs to, the various modular systems, if there wasn’t already a domain-general system for it to be linked to. And it is hard to discern what the separate selection pressures might have been, which would have led to the development of two distinct faculties at about the same time (language and domain-general thought), when just one would serve.

 

5            Language as the medium of non-domain-specific thinking

The hypothesis which I particularly want to explore, then, is that natural language is the medium of non-domain-specific thought and inference. Versions of this hypothesis have been previously proposed by Carruthers (1996, 1998a), by Mithen (1996), and by Spelke and colleagues (Hermer-Vazquez et al., 1999; Spelke and Tsivkin, 2001; Spelke, forthcoming). I shall sketch the thesis itself, outline the existing experimental evidence in its support, and then (in the section following) consider some of its ramifications and possible elaborations. Finally (in section 7) I shall discuss what further evidence needs to be sought as a test of our thesis.

 

5.1       The thesis

The hypothesis in question assumes a form of central-process modularism. That is, it assumes that in addition to the various input and output modules (vision, face-recognition, hearing, language, systems for motor-control, etc.), the mind also contains a range of conceptual modules, which take conceptual inputs and deliver conceptual outputs. Evidence of various sorts has been accumulating in support of central-process modularism in recent decades (some of which has already been noted above). One line of support is provided by evolutionary psychologists, who have argued on both theoretical and empirical grounds that the mind contains a suite of domain-specific cognitive adaptations (Barkow et al., 1992; Sperber, 1996; Pinker, 1997). But many who would not describe themselves as ‘evolutionary psychologists’ have argued for a modular organization of central cognition, on developmental, psychological, and/or neuro-pathological grounds (Carey, 1985; Shallice, 1988; Gallistel, 1990; Carey and Spelke, 1994; Leslie, 1994; Spelke, 1994; Baron-Cohen, 1995; Smith and Tsimpli, 1995; Hauser and Carey, 1998).

            What cognitive resources were antecedently available, then, prior to the evolution of the language faculty? Taking the ubiquitous laboratory rat as a representative example, I shall assume that all mammals, at least, are capable of thought – in the sense that they engage in computations which deliver structured (propositional) belief-like states and desire-like states (Dickinson, 1994; Dickinson and Balleine, 2000). I shall also assume that these computations are largely carried out within modular systems of one sort or another (Gallistel, 1990) – after all, if the project here is to show how cross-modular thinking in humans can emerge out of modular components, then we had better assume that the initial starting-state was a modular one. Furthermore, I shall assume that mammals possess some sort of simple non-domain-specific practical reasoning system, which can take beliefs and desires as input, and figure out what to do.

            I shall assume that the practical reasoning system in animals (and perhaps also in us) is a relatively simple and limited-channel one. Perhaps it receives as input the currently-strongest desire and searches amongst the outputs of the various belief-generating modules for something which can be done in relation to the perceived environment which will satisfy that desire. So its inputs have the form DESIRE [Y] and BELIEF [IF X THEN Y], where X should be something for which an existing motor-program exists. I assume that the practical reasoning system is not capable of engaging in other forms of inference (generating new beliefs from old), nor of combining together beliefs from different modules; though perhaps it is capable of chaining together conditionals to generate a simple plan – e.g. BELIEF [IF W THEN X], BELIEF [IF X THEN Y] ® BELIEF [IF W THEN Y].

            The central modules will take inputs from perception, of course. And my guess is that many of the beliefs and desires generated by the central modules will have partially indexical contents – thus a desire produced as output by the sex module might have the form, ‘I want to mate with that female’, and a belief produced by the causal-reasoning module might have the form, ‘That caused that’. So if the practical reasoning system is to be able to do anything with such contents, then it, too, would need to have access to the outputs of perception, to provide anchoring for the various indexicals. The outputs of the practical reasoning system are often likely to be indexical too, such as an intention of the form, ‘I’ll go that way’.

The inputs to central-process modules can presumably include not only conceptualized perceptions but also propositional descriptions (in the latter case deriving from linguistic input - for we surely use our mind-reading system, for example, when processing a description of someone’s state of mind as well as when observing their behavior). And in some cases, too, the inputs to a module will include the outputs of other central-process modules; for we might expect that there will be cases in which modules are organized into some sort of hierarchy. But what of the outputs from central-process modules? Besides being directed to other modules (in some instances), and also to the practical reasoning system, where is the information which is generated by central-process modules normally sent? And in particular, is there some non-domain-specific central arena where all such information is collated and processed?

            The hypothesis being proposed here is that there is such an arena, but one which crucially implicates natural language, and which cannot operate in the absence of such language. Moreover, the hypothesis is not just that our conscious propositional thinking involves language (as sketched in section 4 above), but that all non-domain-specific reasoning of a non-practical sort (whether conscious or non-conscious) is conducted in language. And as for the question of what a non-conscious tokening of a natural language sentence would be like, we can propose that it would be a representation stripped of all imagistic-phonological features, but still consisting of natural language lexical items and syntactic structures. (The role of syntax in the present account will be further explored in section 6.1 below.)

Chomsky (1995) has maintained, for example, that there is a level of linguistic representation which he calls ‘Logical Form’ (LF), which is where the language faculty interfaces with central cognitive systems. We can then claim that all cross-modular thinking consists in the formation and manipulation of these LF representations. The hypothesis can be that all such thinking operates by accessing and manipulating the representations of the language faculty. Where these representations are only in LF, the thoughts in question will be non-conscious ones. But where the LF representation is used to generate a full-blown phonological representation (an imaged sentence), the thought will nomally be conscious. And crucially for my purposes, the hypothesis is that the language faculty has access to the outputs of the various central-process modules, in such a way that it can build LF representations which combine information across domains.

Let me say a just little more about the conscious / non-conscious distinction as it operates here. As I shall mention again in a moment (and as I shall return to at some length in section 6.2) language is both an input and an output module. Its production sub-system must be capable of receiving outputs from the conceptual modules in order to transform their creations into speech. And its comprehension sub-system must be capable of transforming heard speech into a format suitable for processing by those same conceptual modules. Now when LF representations built by the production sub-system are used to generate a phonological representation, in ‘inner speech’, that representation will be consumed by the comprehension sub-system, and made available to central systems. One of these systems is a theory of mind module. And on the sort of higher-order theory of consciousness which I favor (Carruthers, 2000), perceptual and imagistic states get to be phenomenally conscious by virtue of their availability to the higher-order thoughts generated by the theory of mind system (i.e. thoughts about those perceptual and imagistic states). So this is why inner speech of this sort is conscious: it is because it is available to higher-order thought.

            The hypothesis, then, is that non-domain-specific, cross-modular, propositional thought depends upon natural language - and not just in the sense that language is a necessary condition for us to entertain such thoughts, but in the stronger sense that natural language representations are the bearers of those propositional thought-contents. So language is constitutively involved in (some kinds of) human thinking. Specifically, language is the vehicle of non-modular, non-domain-specific, conceptual thinking which integrates the results of modular thinking.

            Before moving on to discuss the evidence in support of our thesis, consider one further question. Why does it have to be language, and not, for example, visual imagery which serves the integrative function? For visual images, too, can carry contents which cross modular domains. But such visual thinking will access and deploy the resources of a peripheral input module. It cannot, therefore, play a role in integrating information across conceptual modules, because the latter exist down-stream of the input-systems. Vision provides input to conceptual modules, and doesn’t receive output from them. The language faculty, in contrast, while also ‘peripheral’, has both input and output functions. (I shall return to this point again in section 6.2 below.) I would hypothesize, therefore, that in cases where visual images have cross-modular contents (and aren’t memory images), they are always generated from some linguistic representation which originally served to integrate those contents.

 

5.2       The evidence

What evidence is there to support the hypothesis that natural language is the medium of inter-modular communication, or of non-domain-specific integrated thinking? Until recently, the evidence was mostly circumstantial. For example, one indirect line of argument in support of our thesis derives from cognitive archaeology, when combined with the evidence of contemporary central-process modularism (Mithen, 1996). For as we noted above, it seems that we only have significant evidence of cross-modular thought following the emergence of contemporary humans some 100,000 years ago; whereas independent evidence suggests that language, too, was a late evolutionary adaptation, only finally emerging at about the same time (perhaps from an earlier stage of ‘proto-language’ - Bickerton, 1990, 1995). So the simplest hypothesis is that it is language which actually enables cross-modular thinking.

            Another strand of indirect evidence can be provided if we take seriously the idea that the stream of inner verbalization is constitutive of (some forms of) thinking (Carruthers, 1996). For as we saw in section 4 above, such views can only plausibly be held (given the truth of central-process modularism) together with the present hypothesis that language is the main medium of inter-modular communication.

            Much more importantly, however, direct tests of (limited forms of) our hypothesis have now begun to be conducted. The most important of these is Hermer-Vazquez et al. (1999), which provides strong evidence that the integration of geometric properties with other sorts of information (color, smell, patterning, etc.) is dependent upon natural language. The background to their studies with human adults is the apparent discovery of a geometric module in rats by Cheng (1986), as well as the discovery of a similar system in pre-linguistic human children (Hermer and Spelke, 1994, 1996).

            Cheng (1986) placed rats in a rectagonal chamber, and allowed them to discover the location of a food source. They were then removed from the chamber and disoriented, before being placed back into the box with the food now hidden. In each case there were multiple cues available - both geometric and non-geometric - to guide the rats in their search. For example, the different walls might be distinctively colored or patterned, one corner might be heavily scented, and so on. In fact in these circumstances the rats relied exclusively on geometric information, searching with equal frequency, for example, in the two geometrically-equivalent corners having a long wall on the left and a short wall on the right. Yet rats are perfectly well capable of noticing and remembering non-geometric properties of the environment and using them to solve other tasks. So it appears that, not only are they incapable of integrating geometric with non-geometric information in these circumstances, but that geometric information takes priority.

            (This makes perfectly good ecological-evolutionary sense. For in the rat’s natural environment, overall geometrical symmetries in the landscape are extremely rare, and geometrical properties generally change only slowly with time; whereas object-properties of color, scent-markings, and so on will change with the weather and seasons. So a strong preference to orient by geometrical properties is just what one might predict.)

            Hermer and Spelke (1994, 1996) found exactly the same phenomenon in pre-linguistic human children. Young children, too, rely exclusively on geometric information when disoriented in a rectangular room, and appear incapable of integrating geometrical with non-geometrical properties when searching for a previously seen but now-hidden object. Older children and adults are able to solve these problems without difficulty - for example, they go straight to the corner formed with a long wall to the left and a short blue wall to the right. It turns out that success in these tasks isn’t directly correlated with age, nonverbal IQ, verbal working-memory capacity, vocabulary size, or comprehension of spatial vocabulary. In contrast, the only significant predictor of success in these tasks which could be discovered, was spontaneous use of spatial vocabulary conjoined with object-properties (e.g. ‘It’s left of the red one’). Even by themselves, these data strongly suggest that it is language which enables older children and adults to integrate geometric with non-geometric information into a single thought or memory.

            Hermer-Vazquez et al. (1999) set out to test this idea with a series of dual-task experiments with adults. In one condition, subjects were required to solve one of these orientation problems while shadowing (i.e. repeating back) speech played to them through a set of headphones. In another condition, they were set the same problems while shadowing (with their hands) a rhythm played to them in their headphones. The hypothesis was that speech-shadowing would tie up the resources of the language faculty, whereas the rhythm-shadowing tasks would not; and great care was taken to ensure that the latter tasks were equally if not more demanding of the resources of working memory.

            The results of these experiments were striking. Shadowing of speech severely disrupted subjects’ capacity to solve tasks requiring integration of geometric with non-geometric properties. In contrast, shadowing of rhythm disrupted subjects’ performance relatively little. Moreover, a follow-up experiment demonstrated that shadowing of speech didn’t disrupt subjects’ capacities to utilize non-geometric information per se - they were easily able to solve tasks requiring only memory for object-properties. So it would appear that it is language itself which enables subjects to conjoin geometric with non-geometric properties, just as the hypothesis that language is the medium of cross-modular thinking predicts.

            Of course, this is just one set of experiments