Uncertainty Reduction: Ambiguity Resolution Mechanisms in Language

By developinginte… on October 3, 2007.

Ambiguity is a constant problem for any embodied cognitive agent with limited resources. Decisions need to be made, and their consequences understood, despite the probabilistic veil of uncertainty enveloping everything from sensory input to action execution. Clearly, there must be mechanisms for dealing with or resolving such ambiguity.

A nice sample domain for understanding ambiguity resolution is language, where problems of uncertainty have been long appreciated. The meaning of words in general (not to mention referents like "that" or "he") can be highly ambiguous (see "the gavagai problem"). A similar problem abounds in grammar, famously in the case of garden path sentences ("the horse raced past the barn fell"), where grammatical ambiguities are often completely overlooked until a differentiating word is encountered ("fell").

Most accounts of language emphasize the distinction between semantics (the meanings of words) and syntax (the rules involved in how words are put together - essentially, grammar). One might therefore suspect that ambiguity resolution in these two domains is separable. However, a classic Psychological Review article by MacDonald, Pearlmutter and Seidenberg describes a single ambiguity resolution mechanism which might operate both on semantics and syntax.

MacDonald et al emphasize that the same three issues turn up in both lexical and syntactic explorations of ambiguity resolution: the role of frequency information, contextual constraints, and issues concerning modularity v.s. distributed interactivity. I'll illustrate examples from both below:

Frequency Information. Words with approximately equally-frequent multiple meanings (e.g., "pitcher") show longer eye fixation times than those words with either a single meaning or those with highly biased meanings (where one meaning is much more frequent than the other). Similarly, in grammatical processing, the interpretation of garden path sentences was presumed by Chomskian theory to be accomplished by a grammatical parser with no access to frequency information, and yet some work demonstrates that the frequency of words used in garden path sentences may influence the interpretation subjects adopt to resolve the ambiguity of those sentences.

Contextual Information. In research on the influence of context on semantics, some studies have shown that words with multiple meanings have all the potential meanings activated automatically, while other studies have shown that the context in which the word appears does influence the extent to which certain meanings become activated (as determined through priming studies), even when the context doesn't seem to directly prime the ambiguous word's various meanings. Similarly, in research on syntax, context has been shown to influence the interpretation of garden path sentences, contradicting other accounts (e.g., minimal attachment algorithms) of garden path sentence processing.

Representation: Modularity vs Distributed Processing. Although many researchers emphasize that the multiple meanings of words seem to be accessed from memory, as though each meaning comprises a different record in a master database of all meanings (sometimes called the "mental lexicon"), other research has demonstrated that meanings interact with one another through the frequency and contextual effects described above. Thus, lexical access seems compatible with what might be expected from a distributed (i.e., connectionist) rather than modular (database-like, to simplify) representation. Similarly, in grammatical processing, early Chomskian theory presumed that grammatical rules were unrelated to the particular lexical entries of a particular language, whereas later Chomskian and related theories (e.g., Government Binding theory) proposed a much tighter interaction between semantics and grammar, suggestive again of more distributed and less modular processing.

Rather than reflecting mere coincidence, MacDonald et al propose that the similar theoretical, methodological and empirical issues surrounding lexical and syntactic processing reflect a fundamentally similar mechanism underlying the resolution of ambiguity in all of linguistic processing.

Specifically, they suggest that the cortical representation of words is distributed, such that many neurons participate in the representation of many words and that those representations differ mostly in the degrees to which various neurons contribute to those representations. Critically, these networks encode not only semantic information but also syntactic information (for example, tense, voice, person, gender, etc). Nodes which represent mutually compatible interpretations of a sentence are connected in an excitatory fashion, whereas those representing mutually incompatible interpretations are connected with inhibitory links; thus syntactic structures can be activated in a more graded fashion, in contrast to the "all-or-none" selection of grammatical structures implied by other views.

In this system, ambiguity resolution is accomplished by a winner-take-all process, at both the level of the individual words (which contain multiple meanings and related grammatical structures) and at the level of the larger linguistic context (where the "winning" patterns of activity from previous words may have a carry-over influence on activity elicited by the currently-processed word). The authors go on to account for a variety of syntactic ambiguities using this model, and demonstrate that the same lexical and contextual effects hold across these phenomena, as predicted by their unitary model of linguistic processing. It's interesting to note that some later connectionist models of language adopted a dual route mechanism, one route relying on phonological and another relying on orthographic information, to explain past tense formation. Although MacDonald et al advocate a single mechanism, they do not appear to have implemented this theory in all its scope, so it's unclear what kinds of architectural changes might be necessary to get it to work properly.

More like this

Clue stick etc

Ambiguous Loss

One important concept in psychotherapy studies is the concept of href="http://www.indiana.edu/%7Efamlygrf/units/ambiguous.html">ambiguous loss. This is a loss that is, in some way, less than definitive. If you are at the hospital

Another interview with Michael Edmondson

Edmondson is one of the creators of the "Beware the Believers" video, and this

What causes what? Depends on where you're looking.

Take a look at this video (click on the image to play). It's pretty clear what's going on -- the green dot bumps into the red dot, causing it to move:

It occurs to me that examining the responses to LOLcat-style sentences might yield interesting information on this issue....

Without ever having to relate to neurons and the brain there is an algorithm to remove ambiguities over any set of potential patterns provided there are at least some criteria to discriminate between the patterns (of course) no matter how entangled the others traits could be (and no matter the field of experience is or what the patterns are about).
This is the closure operator of Formal Concept Analysis which somehow implement our "pars pro toto" recognition capability.

A good (but very technical) paper on this is What Is A Concept? by Chris Hillman (as I remember the first 4 or 5 pages are enough to grasp the idea behind FCA and the closure operators).

If even computer scientists can do this why would the brain couldn't?

There is no need for any kind of structure in the traits involved just that they can be probed, that matches the "single mechanism" of MacDonald.

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

More by this author

Performance Improves with Transcranial Random Noise Stimulation

November 21, 2011

Stimulating the brain with high frequency electrical noise can supersede the beneficial effects observed from transcranial direct current stimulation, either anodal or cathodal (as well as those observed from sham stimulation), in perceptual learning, as newly reported by Fertonani, Pirully &…

Attractors All the Way Up: Metastability, Rostrocaudal Hierarchies, and Synaptic Facilitation

November 18, 2011

In their wonderful Neuroimage article, Braun & Mattia present a comprehensive introduction to the possible neuronal implementations and cognitive sequelae of a particular dynamical phenomenon: the attractor state. In another excellent paper, just recently out in Frontiers, Itskov, Hansel and…

Architecture of the VLPFC and its Monkey/Human Mapping

November 17, 2011

If you ever said to yourself, "I wonder whether the human mid- and posterior ventrolateral prefrontal cortex has a homologue in the monkey, and what features of its cytoarchitecture or subcortical connectivity may differentiate it from other regions of PFC" then this post is for you. Otherwise,…

Modus Tollens, Modus Shmollens! When people commit a fallacy so absurd that it's only recently been given a name.

November 16, 2011

Suppose - rather reasonably - that soups which taste like garlic have garlic in them. You observe two people eating soup; one of them says to the other, "There is no garlic in this soup." Do you think it's likely that the soup taste like garlic? If you said yes, then congratulations! You've just…

Greater Performance Improvements When Quick Responses Are Rewarded More Than Accuracy Itself.

November 8, 2011

Last month's Frontiers in Psychology contains a fascinating study by Dambacher, HuÌbner, and SchlÃ¶sser in which the authors demonstrate that the promise of financial reward can actually reduce performance when rewards are given for high accuracy. Counterintuitively, performance (characterized as…