Two-streams hypothesis

The two-streams hypothesis is a widely accepted and influential model of the neural processing of vision as well as hearing.[1] The hypothesis, given its most popular characterisation in a paper by David Milner and Melvyn A. Goodale in 1992, argues that humans possess two distinct visual systems.[2] However, recently there seems to be evidence of two distinct auditory systems as well. As visual information exits the occipital lobe, and as sound leaves the phonological network, it follows two main pathways, or "streams". The ventral stream (also known as the "what pathway") is involved with object and visual identification and recognition. The dorsal stream (or, "where pathway") is involved with processing the object's spatial location relative to the viewer and with speech repetition.
History
Several researchers had proposed similar ideas previously. The authors themselves credit the inspiration of work on blindsight by Weiskrantz, and previous neuroscientific vision research. Schneider first proposed the existence of two visual systems for localisation and identification in 1969.[3] Ingle described two independent visual systems in frogs in 1973.[4] Ettlinger reviewed the existing neuropsychological evidence of a distinction in 1990.[5] Moreover, Trevarthen had offered an account of two separate mechanisms of vision in monkeys back in 1968.[6]
In 1982 Ungerleider and Mishkin distinguished the dorsal and ventral streams, as processing spatial and visual features respectively, from their lesion studies of monkeys – proposing the original where vs what distinction.[7] Though this framework was superseded by that of Milner & Goodale, it remains influential.[8]
One hugely influential source of information that has informed the model has been experimental work exploring the extant abilities of visual agnosic patient D.F. The first, and most influential report, came from Goodale and colleagues in 1991[9] and work is still being published on her two decades later.[10] This has been the focus of some criticism of the model due to the perceived over-reliance on findings from a single case.
Two visual systems
Goodale and Milner amassed an array of anatomical, neuropsychological, electrophysiological, and behavioural evidence for their model, according to which the ventral ‘perceptual’ stream provides the rich and detailed representation of the visual world required for cognitive operations whereas the dorsal ‘action’ stream transforms incoming visual information into the required coordinates for skilled motor behaviour.[2] The model also posits that visual perception encodes the object properties (e.g. size) relative to the properties of other visible objects; in other words it utilises relative metrics and scene-based frames of reference. Whereas the visual control of action uses absolute metrics determined via egocentric frames of reference, computing the actual properties of objects relative to the observer. Thus, grasping movements directed towards objects embedded in size-contrast illusions have been shown to escape the effects of these pictorial illusions, as different frames of references and metrics are involved in the production of the illusion versus production of the grasping act.[11]
Norman proposed a similar dual-process model of vision, and described eight main differences between the two systems consistent with other two-system models.[12]
| Factor | Ventral system | Dorsal system | 
|---|---|---|
| Function | Recognition/identification | Visually guided behaviour | 
| Sensitivity | High spatial frequencies - details | High temporal frequencies - motion | 
| Memory | Long term stored representations | Only very short-term storage | 
| Speed | Relatively slow | Relatively fast | 
| Consciousness | Typically high | Typically low | 
| Frame of reference | Allocentric or object-centered | Egocentric or viewer-centered | 
| Visual input | Mainly foveal or parafoveal | Across retina | 
| Monocular vision | Generally reasonably small effects | Often large effects e.g. motion parallax | 
Dorsal stream
The dorsal stream is proposed to be involved in the guidance of actions and recognizing where objects are in space. Also known as the parietal stream, the "where" stream, or the "how" stream, this pathway stretches from the primary visual cortex (V1) in the occipital lobe forward into the parietal lobe. It is interconnected with the parallel ventral stream (the "what" stream) which runs downward from V1 into the temporal lobe.
General features
The dorsal stream is involved in spatial awareness and guidance of actions (e.g., reaching). In this it has two distinct functional characteristics—it contains a detailed map of the visual field, and is also good at detecting and analyzing movements.
The dorsal stream commences with purely visual functions in the occipital lobe before gradually transferring to spatial awareness at its termination in the parietal lobe.
The posterior parietal cortex is essential for "the perception and interpretation of spatial relationships, accurate body image, and the learning of tasks involving coordination of the body in space".[13]
It contains individually functioning lobules. The lateral intraparietal sulcus (LIP) contains neurons that produce enhanced activation when attention is moved onto the stimulus or the animal saccades towards a visual stimulus, and the ventral intraparietal sulcus (VIP) where visual and somatosensory information are integrated.
Effects of damage or lesions
Damage to the posterior parietal cortex causes a number of spatial disorders including:
- Simultanagnosia: where the patient can only describe single objects without the ability to perceive it as a component of a set of details or objects in a context (as in a scenario, e.g. the forest for the trees).
- Optic ataxia: where the patient can't use visuospatial information to guide arm movements.
- Hemispatial neglect: where the patient is unaware of the contralesional half of space (that is, they are unaware of things in their left field of view and focus only on objects in the right field of view; or appear unaware of things in one field of view when they perceive them in the other). For example, a person with this disorder may draw a clock, and then label it from 12, 1, 2, ..., 6, but then stop and consider their drawing complete.
- Akinetopsia: inability to perceive motion.
- Apraxia: inability to produce discretionary or volitional movement in the absence of muscular disorders.
Ventral stream
The ventral stream is associated with object recognition and form representation. Also described as the "what" stream, it has strong connections to the medial temporal lobe (which stores long-term memories), the limbic system (which controls emotions), and the dorsal stream (which deals with object locations and motion).
The ventral stream gets its main input from the parvocellular (as opposed to magnocellular) layer of the lateral geniculate nucleus of the thalamus. These neurons project to V1 sublayers 4Cβ, 4A, 3B and 2/3a[14] successively. From there, the ventral pathway goes through V2 and V4 to areas of the inferior temporal lobe: PIT (posterior inferotemporal), CIT (central inferotemporal), and AIT (anterior inferotemporal). Each visual area contains a full representation of visual space. That is, it contains neurons whose receptive fields together represent the entire visual field. Visual information enters the ventral stream through the primary visual cortex and travels through the rest of the areas in sequence.
Moving along the stream from V1 to AIT, receptive fields increase their size, latency, and the complexity of their tuning.
All the areas in the ventral stream are influenced by extraretinal factors in addition to the nature of the stimulus in their receptive field. These factors include attention, working memory, and stimulus salience. Thus the ventral stream does not merely provide a description of the elements in the visual world—it also plays a crucial role in judging the significance of these elements.
Damage to the ventral stream can cause inability to recognize faces or interpret facial expression [15]
Two auditory systems
Ventral stream

Along with the ventral pathway being important for visual processing, it is also important for processing auditory information. During processing, when sound waves enter the ear, this information is transduced into an unanalyzed auditory object. The subcortical auditory pathway then relays the information to the auditory cortex in the dorsal superior temporal gyrus (dSTG). In the dSTG, it is then divided into its constituent phones. These phones are then recognized into phonetic words in the superior temporal sulcus (STS). The information first enters the ventral stream at the posterior middle temporal gyrus and the posterior inferior temporal sulcus p(MTG+ITS). Here the auditory words are converted into semantic words. These words are then converted into a semantic phrase by the first combinatorial net at the anterior portion of the middle temporal lobe and then converted into a syntactic noun phrase at the second combinatorial net. This second combinatorial net is location in the anterior inferior temporal gyrus.[16]
Dorsal stream
The function of the dorsal pathway is to map auditory sensory representations onto articulatory motor representations. Hickok & Poeppel claim that the dorsal pathway is necessary because, "learning to speak is essentially a motor learning task. The primary input to this is sensory, speech in particular. So, there must be a neural mechanism that both codes and maintains instances of speech sounds, and can use these sensory traces to guide the tuning of speech gestures so that the sounds are accurately reproduced." [17]

Similarly to the ventral stream's auditory processing, information enters the ear and then into the superior temporal gyrus and then finally the superior temporal sulcus. From there the information moves to the beginning of the dorsal pathway, which is located at the boundary of the temporal and parietal lobes near the Sylvian fissure. The first step of the dorsal pathway begins in the sensorimotor interface, located in the left Sylvian parietal temporal (Spt) (within the Sylvian fissure at the parietal-temporal boundary). The spt is important for perceiving and reproducing sounds. This is evident because its ability to acquire new vocabulary, be disrupted by lesions and auditory feedback on speech production, articulatory decline in late-onset deafness and the non-phonological residue of Wernicke's aphasia; deficient self-monitoring. It is also important for the basic neuronal mechanisms for phonological short-term memory. Without the Spt, language acquisition is impaired. The information then moves onto the articulatory network, which is divided into two separate parts. The articulatory network 1, which processes motor syllable programs, is located in the left posterior inferior temporal gyrus and Brodmann's area 44 (pIFG-BA44). The articulatory network 2 is for motor phoneme programs and is located in the left M1-vBA6.[18]
Conduction aphasia affects a subject's ability to reproduce speech (typically by repetition), though it has no influence on the subject's ability to comprehend spoken language. This shows that conduction aphasia must reflect an impairment of the ventral pathway but instead of the dorsal pathway. Hickok and Poeppel found that conduction aphasia can be the result of damage, particularly lesions, to the Spt (Sylvian parietal temporal). This is shown by the Spt's involvement in acquiring new vocabulary, for while experiments have shown that most conduction aphasiacs can repeat high-frequency, simple words, their ability to repeat low-frequency, complex words is impaired. The Spt is responsible for connecting the motor and auditory systems by making auditory code accessible to the motor cortex. It appears that the motor cortex recreates high-frequency, simple words (like cup) in order to more quickly and efficiently access them, while low-frequency, complex words (like Sylvian parietal temporal) require more active, online regulation by the Spt. This explains why conduction aphasiacs have particular difficulty with low-frequency words which requires a more hands-on process for speech production. "Functionally, conduction aphasia has been characterized as a deficit in the ability to encode phonological information for production," namely because of a disruption in the motor-auditory interface.[19] Conduction aphasia has been more specifically related to damage of the arcuate fasciculus, which is vital for both speech and language comprehension, as the arcuate fasiculus makes up the connection between Broca and Wernicke's areas.[19]
Criticisms
Goodale & Milner's innovation was to shift the perspective from an emphasis on input distinctions, such as object location versus properties, to an emphasis on the functional relevance of vision to behaviour, for perception or for action. Contemporary perspectives however, informed by empirical work over the past two decades, offer a more complex account than a simple separation of function into two-streams.[20] Recent experimental work for instance has challenged these findings, and has suggested that the apparent dissociation between the effects of illusions on perception and action is due to differences in attention, task demands, and other confounds.[21][22] There are other empirical findings, however, that cannot be so easily dismissed which provide strong support for the idea that skilled actions such as grasping are not affected by pictorial illusions.[23][24][25][26]
Moreover, recent neuropsychological research has questioned the validity of the dissociation of the two streams that has provided the cornerstone of evidence for the model. The dissociation between visual agnosia and optic ataxia has been challenged by several researchers as not as strong as originally portrayed; Hesse and colleagues demonstrated dorsal stream impairments in patient DF;[27] Himmelbach and colleagues reassessed DF's abilities and applied more rigorous statistical analysis demonstrating that the dissociation wasn't as strong as first thought.[10]
A 2010 review of the accumulated evidence for the model concluded that whilst the spirit of the model has been vindicated the independence of the two streams has been overemphasised.[28] Goodale & Milner themselves have proposed the analogy of tele-assistance, one of the most efficient schemes devised for the remote control of robots working in hostile environments. In this account, the dorsal stream is viewed as a semi-autonomous function that operates under guidance of executive functions which themselves are informed by ventral stream processing.[29]
Thus the emerging perspective within neuropsychology and neurophysiology is that, whilst a two-systems framework was a necessary advance to stimulate study of the highly complex and differentiated functions of the two neural pathways; the reality is more likely to involve considerable interaction between vision-for-action and vision-for-perception. Rob McIntosh and Thomas Schenk summarize this position as follows:
We should view the model not as a formal hypothesis, but as a set of heuristics to guide experiment and theory. The differing informational requirements of visual recognition and action guidance still offer a compelling explanation for the broad relative specializations of dorsal and ventral streams. However, to progress the field, we may need to abandon the idea that these streams work largely independently of one other, and to address the dynamic details of how the many visual brain areas arrange themselves from task to task into novel functional networks.[28]:62
References
- ↑ Eyesenck MW, Keane MT (2010). "Cognitive Psychology: A Student's Handbook". Hove, UK: Psychology Press.
- 1 2 Goodale MA, Milner AD (1992). "Separate visual pathways for perception and action". Trends Neurosci. 15 (1): 20–5. doi:10.1016/0166-2236(92)90344-8. PMID 1374953.
- ↑ Schneider, GE. (Feb 1969). "Two visual systems.". Science. 163 (3870): 895–902. doi:10.1126/science.163.3870.895. PMID 5763873.
- ↑ Ingle, D. (Sep 1973). "Two visual systems in the frog.". Science. 181 (4104): 1053–5. doi:10.1126/science.181.4104.1053. PMID 4542178.
- ↑ Ettlinger G. (1990). ""Object vision" and "spatial vision": the neuropsychological evidence for the distinction.". Cortex. 26 (3): 319–41. doi:10.1016/s0010-9452(13)80084-6. PMID 2123426.
- ↑ Trevarthen, CB. (1968). "Two mechanisms of vision in primates.". Psychol Forsch. 31 (4): 299–348. doi:10.1007/bf00422717. PMID 4973634.
- ↑ Mishkin M, Ungerleider LG (1982). "Contribution of striate inputs to the visuospatial functions of parieto-preoccipital cortex in monkeys.". Behav Brain Res. 6 (1): 57–77. doi:10.1016/0166-4328(82)90081-X. PMID 7126325.
- ↑ Schenk, Thomas; McIntosh, Robert D. (2010). "Do we have independent visual streams for perception and action?". Cognitive Neuroscience. 1 (1): 52–62. doi:10.1080/17588920903388950. ISSN 1758-8928.
- ↑ Goodale, MA.; Milner, AD.; Jakobson, LS.; Carey, DP. (Jan 1991). "A neurological dissociation between perceiving objects and grasping them.". Nature. 349 (6305): 154–6. doi:10.1038/349154a0. PMID 1986306.
- 1 2 Himmelbach, M.; Boehme, R.; Karnath, HO. (Jan 2012). "20 years later: a second look on DF's motor behaviour.". Neuropsychologia. 50 (1): 139–44. doi:10.1016/j.neuropsychologia.2011.11.011. PMID 22154499.
- ↑ Aglioti S, DeSouza JF, Goodale MA (1995). "Size-contrast illusions deceive the eye but not the hand.". Curr. Biol. 5 (6): 679–85. doi:10.1016/S0960-9822(95)00133-3. PMID 7552179.
- ↑ Norman J. (2002). "Two visual systems and two theories of perception: An attempt to reconcile the constructivist and ecological approaches". Behav Brain Sci. 25: 73–144. doi:10.1017/s0140525x0200002x.
- ↑ Mark F Bear; Barry Connors; Michael Paradiso (2007). Neuroscience: Exploring the Brain. Hagerstown, MD: Lippincott Williams & Wilkins. ISBN 0-7817-6003-8.
- ↑ Lamme, Victor AF; Supèr, Hans; Spekreijse, Henk (1998). "Feedforward, horizontal, and feedback processing in the visual cortex". Current Opinion in Neurobiology. 8 (4): 529–535. doi:10.1016/S0959-4388(98)80042-1. PMID 9751656.
- ↑ http://www.ssc.education.ed.ac.uk/courses/vi&multi/vdec082ii.html
- ↑ Howard, Harry. "The ventral pathway". Brain and Language. Retrieved 5 December 2015.
- ↑ Hickok, Gregory. "The cortical organization of speech processing: Feedback control and predictive coding the context of a dual-stream model". NCBI. PMC. Retrieved 5 December 2015.
- ↑ Howard, Harry. "The dorsal stream". Brain and Language. Retrieved 5 December 2015.
- 1 2 Howard, Harry. "The sensorimotor interface". Brain and Language.
- ↑ Milner, AD.; Goodale, MA. (February 2008). "Two visual systems re-viewed.". Neuropsychologia. 46 (3): 774–85. doi:10.1016/j.neuropsychologia.2007.10.005. PMID 18037456.
- ↑ Franz VH, Gegenfurtner KR, Bülthoff HH, Fahle M (2000). "Grasping visual illusions: no evidence for a dissociation between perception and action.". Psychol Sci. 11 (1): 20–5. doi:10.1111/1467-9280.00209. PMID 11228838.
- ↑ Franz VH, Scharnowski F, Gegenfurtner (2005). "Illusion effects on grasping are temporally constant not dynamic.". J Exp Psychol Hum Percept Perform. 31 (6): 1359–78. doi:10.1037/0096-1523.31.6.1359. PMID 16366795.
- ↑ Ganel T, Goodale MA (2003). "Visual control of action but not perception requires analytical processing of object shape.". Nature. 426 (6967): 664–7. doi:10.1038/nature02156. PMID 14668865.
- ↑ Ganel T, Tanzer M, Goodale MA (2008). "A double dissociation between action and perception in the context of visual illusions: opposite effects of real and illusory size.". Psych. Sci. 19 (3): 221–5. doi:10.1111/j.1467-9280.2008.02071.x. PMID 18315792.
- ↑ Cardoso-Leite, Pedro; Gorea, Andrei (2010). "On the Perceptual/Motor Dissociation: A Review of Concepts, Theory, Experimental Paradigms and Data Interpretations". Seeing and Perceiving. 23 (2): 89–151. doi:10.1163/187847510X503588. PMID 20550823.
- ↑ Goodale MA. (2011). "Transforming vision into action.". Vision Res. 51 (14): 1567–87. doi:10.1016/j.visres.2010.07.027. PMID 20691202.
- ↑ Hesse, C.; Ball, K.; Schenk, T. (Jan 2012). "Visuomotor performance based on peripheral vision is impaired in the visual form agnostic patient DF.". Neuropsychologia. 50 (1): 90–7. doi:10.1016/j.neuropsychologia.2011.11.002. PMID 22085864.
- 1 2 McIntosh, RD.; Schenk, T. (May 2009). "Two visual streams for perception and action: current trends.". Neuropsychologia. 47 (6): 1391–6. doi:10.1016/j.neuropsychologia.2009.02.009. PMID 19428404.
- ↑ Milner, A.D.; Goodale, M.A. (2006), The Visual Brain in Action, ISBN 978-0-19-852472-4, retrieved 2012-12-06
|  | Wikimedia Commons has media related to Ventral and dorsal stream. |