Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Off-line Foveated Compression and Scene Perception: An Eye-Tracking Approach

Nyström, Marcus LU orcid (2008)
Abstract
With the continued growth of digital services offering storage and communication of pictorial information, the need to efficiently represent this information has become increasingly important, both from an information theoretic and a perceptual point of view.



There has been a recent interest to design systems for efficient representation and compression of image and video data that take the features of the human visual system into account. One part of this thesis investigates whether knowledge about viewers' gaze positions as measured by an eye-tracker can be used to improve compression efficiency of digital video; regions not directly looked at by a number of previewers are lowpass filtered. This type of video... (More)
With the continued growth of digital services offering storage and communication of pictorial information, the need to efficiently represent this information has become increasingly important, both from an information theoretic and a perceptual point of view.



There has been a recent interest to design systems for efficient representation and compression of image and video data that take the features of the human visual system into account. One part of this thesis investigates whether knowledge about viewers' gaze positions as measured by an eye-tracker can be used to improve compression efficiency of digital video; regions not directly looked at by a number of previewers are lowpass filtered. This type of video manipulation is called off-line foveation. The amount of compression due to off-line foveation is assessed along with how it affects new viewers' gazing behavior as well as subjective quality. We found additional bitrate savings up to 50% (average 20%) due to off-line foveation prior to compression, without decreasing the subjective quality.



In off-line foveation, it would be of great benefit to algorithmically predict where viewers look without having to perform eye-tracking measurements. In the first part of this thesis, new experimental paradigms combined with eye-tracking are used to understand the mechanisms behind gaze control during scene perception, thus investigating the prerequisites for such algorithms. Eye-movements are recorded from observers viewing contrast manipulated images depicting natural scenes under a neutral task. We report that image semantics, rather than the physical image content itself, largely dictates where people choose to look. Together with recent work on gaze prediction in video, the results in this thesis give only moderate support for successful applicability of algorithmic gaze prediction for off-line foveated video compression. (Less)
Please use this url to cite or link to this publication:
author
supervisor
opponent
  • PD Dr.-Ing. Barth, Erhardt, University of Lübeck, Germany
organization
publishing date
type
Thesis
publication status
published
subject
keywords
gaze prediction, off-line foveation, scene perception, eye-tracking, compression
pages
139 pages
publisher
Department of Electrical and Information Technology, Lund University
defense location
Room E:C, E-building, Ole Römers väg 3, Faculty of Engineering, Lund university
defense date
2008-09-26 13:15:00
language
English
LU publication?
yes
id
0f01bc03-ce12-44d5-9b73-3fc448102aeb (old id 1222627)
date added to LUP
2016-04-01 13:13:51
date last changed
2022-10-14 13:30:55
@phdthesis{0f01bc03-ce12-44d5-9b73-3fc448102aeb,
  abstract     = {{With the continued growth of digital services offering storage and communication of pictorial information, the need to efficiently represent this information has become increasingly important, both from an information theoretic and a perceptual point of view.<br/><br>
<br/><br>
There has been a recent interest to design systems for efficient representation and compression of image and video data that take the features of the human visual system into account. One part of this thesis investigates whether knowledge about viewers' gaze positions as measured by an eye-tracker can be used to improve compression efficiency of digital video; regions not directly looked at by a number of previewers are lowpass filtered. This type of video manipulation is called off-line foveation. The amount of compression due to off-line foveation is assessed along with how it affects new viewers' gazing behavior as well as subjective quality. We found additional bitrate savings up to 50% (average 20%) due to off-line foveation prior to compression, without decreasing the subjective quality.<br/><br>
<br/><br>
In off-line foveation, it would be of great benefit to algorithmically predict where viewers look without having to perform eye-tracking measurements. In the first part of this thesis, new experimental paradigms combined with eye-tracking are used to understand the mechanisms behind gaze control during scene perception, thus investigating the prerequisites for such algorithms. Eye-movements are recorded from observers viewing contrast manipulated images depicting natural scenes under a neutral task. We report that image semantics, rather than the physical image content itself, largely dictates where people choose to look. Together with recent work on gaze prediction in video, the results in this thesis give only moderate support for successful applicability of algorithmic gaze prediction for off-line foveated video compression.}},
  author       = {{Nyström, Marcus}},
  keywords     = {{gaze prediction; off-line foveation; scene perception; eye-tracking; compression}},
  language     = {{eng}},
  publisher    = {{Department of Electrical and Information Technology, Lund University}},
  school       = {{Lund University}},
  title        = {{Off-line Foveated Compression and Scene Perception: An Eye-Tracking Approach}},
  url          = {{https://lup.lub.lu.se/search/files/3245521/1222635.pdf}},
  year         = {{2008}},
}