Commentary on
Michael Arbib
Word counts:
abstract
60
main
text 1456
references
193
entire
text (total) 1709
Title: The
“Complex First” Paradox: What’s Needed to Lexicalize Neurally Distributed Meanings?
Author name:
Markus Werning
Mailing
address:
Department of Philosophy
Heinrich-Heine
University
Düsseldorf,
D-40225
Germany
Phone:
+49-211-81-11473
Email: werning@phil.uni-duesseldorf.de
URL: www.phil.uni-duesseldorf.de/thphil/werning
Abstract:
From studies on
language acquisition and typology it can be inferred that there is an ontogenetic
and phylogenetic priority for the lexicalization of complex concepts, whose neural
correlates are more widely distributed in the cortex than those of primitive
concepts. It is argued that this poses a problem for the explanation of how an
appropriately distributive syntax-semantics interface could evolve.
Main Text:
One of Arbib’s main claims is that, once
the human brain had biologically evolved to support the features of a
protolanguage (LR1 – LR7), the additional requirements for the possession of a
real language (LA1 – LA4) were a matter merely of social development and
required no further change in the biological hardware. Among these “soft”
features of language, Arbib (with some caution) claims, are symbolization and
compositionality, i.e., the fact that ”symbols become words in the modern
sense, interchangeable and composable in the expression of meaning” (p. 13). I
would like to highlight a problem that concerns the neural syntax-semantics
interface and directly address the issues of symbolization and
compositionality. It isn’t a problem particular to Arbib’s approach, but a
problem for any neurobiological approach towards semantics. I will, however,
ask whether the story Arbib tells on the biological evolution of protolanguage
provides a framework for an adequate solution.
When one is concerned with the search for
the neural correlates of linguistic meanings that are lexical, i.e. verbalized
by syntactically unstructured expressions, one is confronted by what I would
like to call the “complex first” paradox: The paradox arises from the
fact that substance concepts are more frequently lexicalized across languages
and their lexical expressions are ontogenetically and (probably) phylogenetically
earlier acquired than is the case for attribute concepts. This is not what one
would expect from the fact that, according to widely held views (e.g. Barsalou,
1999), the prototypical (first-level) attribute concepts are semantically
primitive and the prototypical substance concepts are semantically complex.
According to those views, the substance concept [mango], e.g., is made
up of the vector of attribute concepts <orange, oval, big, soft, sweet, edible,
...>. This implies that the neural correlates of substance concepts
should be widely distributed in the cortex, whereas those of attribute concepts
should be relatively local. At first glance the effort for a syntax-semantics
interface to address attribute concepts should be less than that for addressing
substance concepts. From this point of view, it is surprising that the latter
nevertheless are lexicalized first.
Substance concepts serve to re-identify
things over time despite of their contingent changes of attributes and so allow
us to gather, store and update information in a systematic and enduring way (Millikan,
1998). They are typically expressed by concrete nouns — in English, e.g., by
names of individuals like mama, names of basic kinds like mouse and
names of stuffs like milk. The great mass of children’s earliest words
are concrete nouns. During the so-called naming explosion, when children
around 18 months of age first systematically organize their concepts by means
of a lexicon, they preponderantly pair substance concepts with concrete nouns,
whereas the assignment of adjectives (red, vertical) and abstract
nouns (redness, verticality ) to the concepts they express, viz.
attribute concepts, comes much later (Ingram, 1989). Some languages even don’t
have adjectives or just a closed set of them (Dixon, 1999), while the class of concrete nouns is (arguably) universal
and always open.
If one shares the widely held assumption
that the understanding of a word must finally result in the (synchronized)
activation of neuronal assemblies in the sensomotoric cortices (for review see Pulvermüller,
1999) and if one identifies those assemblies with the concept the word
expresses (Werning, 2003, 2004), the priority for the lexicalization of
substance concepts calls for an explanation. For, then the neural correlates of
prototypical attribute concepts would be constrained to local feature maps as
they occur for color concepts in V4 or for concepts of orientation in area 17
of V1. The correlates of substance concepts, in contrast, would be widely
distributed over the cortex. We know that the understanding of concrete nouns
for tools like hammer, e.g., results in an activity distributed over the
premotor and the visual cortex (Martin, Wiggs, Ungerleider, & Haxby, 1996).
The assertion that words for substance concepts arouse more widely distributed
activity in sensomotoric cortices than words for attribute concepts is,
furthermore, supported by a study of Rappelsberger, Weiss, and Schack (2000).
They compared the temporal coherence of EEG traces across different regions of
cortex that result from the processing of concrete and abstract nouns. For
concrete nouns they measured much higher and more widely distributed temporal
coherence, than they did for abstract nouns.
Does Arbib’s account provide a framework to
explain the ”complex first” paradox? Arbib indeed sympathizes with the view
that the first syntactically unstructured signs were semantically complex.
Their semantic values, he thinks, were in fact even more complex than those of
concrete nouns and represented entire situations. His approach might also
explain why there is a stronger evolutionary pressure to lexicalize concepts as
complex as those of substances than to lexicalize the less complex attribute
concepts. It evidently is rather economic to lexicalize concepts for often
recurring, highly specific entities of great survival value. Telling someone
that there are mangos nearby is not only shorter, but also more exact than to
tell someone that there are orange, oval, big, soft, sweet, edible things
around.
What Arbib does not answer though is the
following question: How could a mechanism evolve that enables certain regions
of cortex that are involved in representing a word (phonologically, as part of
a syntactically more complex expression, etc.) to address those regions of the sensomotoric
cortices that represent the word’s meaning, i.e., the concept it expresses.
Given that semantically complex words are evolutionary prior such an interface
towards semantics (it’s sometimes attributed to Wernicke’s area) must have
evolved at an early stage in the evolution of language and it must have had
strong distributive capacities from the beginning.
It seems to me that Arbib does not at all
deal with the problem of addressing meanings. For him the phenomenon of
meaning, i.e., the fact that certain signs (words, gestures) stand for certain
other objects, events or properties, apparently is not to be explained neurally
by a mechanism of addressing, through which neuronal representations of words
(or gestures) in Broca’s, Wernicke’s and related areas lead to an activation of
the neuronal representations of objects, events and properties in the sensomotoric
cortices. His view, rather, seems to be that gestures that were originally neurally
represented in F5 (the proposed primate homolog of Broca’s area) and related
regions for the purpose of imitation became conventionalized such that those
neural representations ceased to be representations of those gestures alone,
but of situations that were by convention linked to these gestures. Later on
those “meaningful” gestures were substituted by “meaningful” vocalizations,
which finally became words and sentences with full-blown semantic values. In Arbib’s
framework, still, meanings would remain situated in F5/Broca and closely
related areas. There would be no need to propagate activation from regions
involved in the processing of the syntax and phonology of expressions to sensomotoric
regions that host the concepts semantically expressed by them. But this can’t
be true, I suppose. For, meaning comprises empirical content. The capacity to
understand the meaning of the word mango must be inherently linked to
the capacity to perceptually recognize mangos. F5/Broca, however, is not at all
involved in the recognition of mangos. What’s involved here are the sensomotoric
cortices. These must be addressed somehow when the word mango is
understood.
In the literature on mirror neurons it is
often suggested that, since mirror neurons (in F5 and other regions) are
sensitive to certain action-object frames, say the grasping of a mango, the
neurons represent the grasping of a mango and can hence be identified with the
neural correlate of the (unsaturated) concept [grasp(X, mango)].
But this identification can’t be true, either. For, the concept [grasp(X,
mango)] is semantically composed from the concepts [grasp] and [mango]
plus some specific syntactic structure. The complex concept hence contains the
concept [mango] as its constituent. The neural correlate of this
constituent concept, however, can’t be located in F5 (etc.) for the reasons
mentioned above. Given that conceptual representations are compositional, the
neural correlate of the composed concept [grasp(X, mango)],
consequently, can’t be located exhaustively – i.e., with all constituents – in
F5, either. What strikes me as a better hypothesis about the representational
function of those mirror neurons is that they represent the complex syntactic
structure needed to generate [grasp(X, mango)], something
like [grasp(X, Y)] or even [Z(X, Y)].
The fact that a single mirror neuron prefers mangos over bananas as the Y-argument
might be a contingent fact without any representational function, just like a
redness neuron in V4 still is a representation of redness rather than
redness-on-the-upper-left, although it is sensitive only to red objects on the
upper left corner of the visual field. The alternative hypothesis might then,
too, explain why F5 is indeed the primate homolog of Broca’s area, which, as we
all know, has traditionally been thought of as a syntax processing unit.
References:
Barsalou, L. W.
(1999). Perceptual symbol systems. Behavioral and Brain Sciences, 22,
577-660.
Dixon, R. M.W. (1999). Adjectives. In K. Brown, J. Miller, & R. E.
Asher (Eds.), Concise encyclopedia of grammatical categories. Amsterdam: Elsevier.
Ingram, D.
(1989). First language acquisition; method, description and explanation. Cambridge: Cambridge University Press.
Martin, A., Wiggs,
C. L., Ungerleider, L., & Haxby, J. V. (1996). Neural correlates of category-specific
knowledge. Nature, 379, 649-52.
Millikan, R. G.
(1998). A common structure for concepts of individuals, stuffs and real kinds:
More Mama, more milk, and more mouse. Behavioral and Brain Sciences, 21,
55-100.
Pulvermüller,
F. (1999). Words in the brain’s language. Behavioral and Brain Sciences,
22, 253-279.
Rappelsberger,
P., Weiss, S., & Schack, B. (2000). Coherence and phase relations between
EEG traces recorded from different locations. In R. Müller (Ed.), Time and
the brain (p. 297-330). Harwood Academic Publishers.
Werning, M.
(2003). Synchrony and composition: Toward a cognitive architecture between
classicism and connectionism. In B. Löwe, W. Malzkorn, & T. Raesch (Eds.), Applications of mathematical logic in
philosophy and linguistics (p. 261-278). Dordrecht: Kluwer.
Werning, M. (2004). The temporal dimension
of thought: Cortical foundations of predicative representation. Synthese.
(Forthcoming)