Advanced

Perceptual Surface Reconstruction

Månsson, Jens LU (2005) In Lund University Cognitive Studies 129.
Abstract
How does the brain transform the 2-D light arrays in our eyes into a meaningful 3-D description of surfaces around us? What assumptions does the visual system make about the world when information is incomplete? And how are these assumptions computationally expressed in this perceptual reconstruction process? These questions, and other aspects of binocular depth perception are analysed from a theoretical and computational perspective, as well as through empirical investigations.



In paper one, the fundamentals of stereopsis are briefly reviewed, and the difficulties related with resolving the (stereo) correspondence problem are particularly discussed. A computational model of stereopsis is further proposed that seek... (More)
How does the brain transform the 2-D light arrays in our eyes into a meaningful 3-D description of surfaces around us? What assumptions does the visual system make about the world when information is incomplete? And how are these assumptions computationally expressed in this perceptual reconstruction process? These questions, and other aspects of binocular depth perception are analysed from a theoretical and computational perspective, as well as through empirical investigations.



In paper one, the fundamentals of stereopsis are briefly reviewed, and the difficulties related with resolving the (stereo) correspondence problem are particularly discussed. A computational model of stereopsis is further proposed that seek (binocularly) matching left-right image regions, by finding the highest area-correlation, taken on a derivative of the original images, at three different scales. A number of simulations are presented and discussed.



In the second paper, the computational difficulties that are related with the identification of object boundaries are addressed. A computational model is proposed that given contextual information selects image primitives that are (statistically) common along occluding edges, and connects such primitives into smooth contours. The selection is guided by a set of simple heuristics, which are based on findings that the response of cortical cells, which are tuned to a certain image primitive, can be modulated by information from outside their ?classical? receptive field.



In paper three, the justification for using the uniqueness constraint, as an absolute constraint in stereo models, is questioned. A stereo algorithm is proposed that uses a relaxed form of this constraint, and allows multiple matches when a one-to-one correspondence does not exist between the left and right image primitives. The central mechanism in the model produce binocular matches that preserve the relative ordering of image primitives.



Paper four describes an empirical study where sparse random-dot stereograms were used to investigate how depth is perceived in ambiguous image regions that lack explicit disparity information. The results of the study suggest that the binocular disparity content, as well as interocularly unpaired image elements, both affect interpolation of depth in stereoscopic displays. (Less)
Please use this url to cite or link to this publication:
author
supervisor
opponent
  • Professor Åström, Karl, Matematikcentrum, Lunds universitet
organization
publishing date
type
Thesis
publication status
published
subject
keywords
system, kontroll, Psychology, Psykologi, numerisk analys, Datalogi, systems, control, numerical analysis, Surface, Contour, Depth, Vision, Binocular, Computer science
in
Lund University Cognitive Studies
volume
129
pages
167 pages
publisher
Department of Cognitive Science, Lund University
defense location
Sal 318, Kungshuset, Lundagård, Lund
defense date
2005-12-20 09:00
external identifiers
  • other:LUHFDA/HFKO-1016-SE
ISSN
1101-8453
ISBN
91-974741-5-0
language
English
LU publication?
yes
id
9b17d264-e6fc-48d9-95b8-72eaf69bb811 (old id 545895)
date added to LUP
2007-09-11 14:35:27
date last changed
2016-09-19 08:44:53
@phdthesis{9b17d264-e6fc-48d9-95b8-72eaf69bb811,
  abstract     = {How does the brain transform the 2-D light arrays in our eyes into a meaningful 3-D description of surfaces around us? What assumptions does the visual system make about the world when information is incomplete? And how are these assumptions computationally expressed in this perceptual reconstruction process? These questions, and other aspects of binocular depth perception are analysed from a theoretical and computational perspective, as well as through empirical investigations.<br/><br>
<br/><br>
In paper one, the fundamentals of stereopsis are briefly reviewed, and the difficulties related with resolving the (stereo) correspondence problem are particularly discussed. A computational model of stereopsis is further proposed that seek (binocularly) matching left-right image regions, by finding the highest area-correlation, taken on a derivative of the original images, at three different scales. A number of simulations are presented and discussed.<br/><br>
<br/><br>
In the second paper, the computational difficulties that are related with the identification of object boundaries are addressed. A computational model is proposed that given contextual information selects image primitives that are (statistically) common along occluding edges, and connects such primitives into smooth contours. The selection is guided by a set of simple heuristics, which are based on findings that the response of cortical cells, which are tuned to a certain image primitive, can be modulated by information from outside their ?classical? receptive field.<br/><br>
<br/><br>
In paper three, the justification for using the uniqueness constraint, as an absolute constraint in stereo models, is questioned. A stereo algorithm is proposed that uses a relaxed form of this constraint, and allows multiple matches when a one-to-one correspondence does not exist between the left and right image primitives. The central mechanism in the model produce binocular matches that preserve the relative ordering of image primitives.<br/><br>
<br/><br>
Paper four describes an empirical study where sparse random-dot stereograms were used to investigate how depth is perceived in ambiguous image regions that lack explicit disparity information. The results of the study suggest that the binocular disparity content, as well as interocularly unpaired image elements, both affect interpolation of depth in stereoscopic displays.},
  author       = {Månsson, Jens},
  isbn         = {91-974741-5-0},
  issn         = {1101-8453},
  keyword      = {system,kontroll,Psychology,Psykologi,numerisk analys,Datalogi,systems,control,numerical analysis,Surface,Contour,Depth,Vision,Binocular,Computer science},
  language     = {eng},
  pages        = {167},
  publisher    = {Department of Cognitive Science, Lund University},
  school       = {Lund University},
  series       = {Lund University Cognitive Studies},
  title        = {Perceptual Surface Reconstruction},
  volume       = {129},
  year         = {2005},
}