Advanced

Empirical Observations regarding predictability in user access behavior in a distributed digital library system

Hollmann, J.; Ardö, Anders LU and Stenström, P. (2002) 16th IEEE International Parallel and Distributed Processing Symposium In Proceedings of the 16th IEEE International Parallel and Distributed Processing Symposium p.221-228
Abstract
Today document archives are geographically distributed

but often not replicated. This can potentially result in a

low quality of service in terms of reduced availability and long user-perceived access times. Instead of indiscriminate replication we study the effectiveness of caching techniques such as prefetching and selective preloading. Our technique analyzes whether user access behavior is predictable enough to guess what articles to prefetch or to preload based on access logs from DADS, a digital library system for scientific journal articles developed at DTV, the Technical Knowledge Center of Denmark. We have found that once a literature search has been narrowed to up to ten articles, there is a high likelihood that... (More)
Today document archives are geographically distributed

but often not replicated. This can potentially result in a

low quality of service in terms of reduced availability and long user-perceived access times. Instead of indiscriminate replication we study the effectiveness of caching techniques such as prefetching and selective preloading. Our technique analyzes whether user access behavior is predictable enough to guess what articles to prefetch or to preload based on access logs from DADS, a digital library system for scientific journal articles developed at DTV, the Technical Knowledge Center of Denmark. We have found that once a literature search has been narrowed to up to ten articles, there is a high likelihood that some of them will be eventually downloaded. This suggests that prefetching can be used to hide the article transfer latency. We have also found that 80% of the article downloads are confined to less

than 20% of the journals, so preloading a small fraction of the digital library database could significantly shorten the access latency and improve the availability. (Less)
Please use this url to cite or link to this publication:
author
publishing date
type
Chapter in Book/Report/Conference proceeding
publication status
published
subject
in
Proceedings of the 16th IEEE International Parallel and Distributed Processing Symposium
pages
221 - 228
publisher
IEEE--Institute of Electrical and Electronics Engineers Inc.
conference name
16th IEEE International Parallel and Distributed Processing Symposium
external identifiers
  • scopus:0036020883
ISBN
0-7695-1573-8
language
English
LU publication?
no
id
c5fb8ae0-f289-4953-b529-45e1554bf78f (old id 606259)
alternative location
http://ieeexplore.ieee.org/iel5/7926/21854/01016636.pdf?tp=&isnumber=21854&arnumber=1016636
http://www.eit.lth.se/fileadmin/eit/home/hs.aar/Publ/PDP2002.pdf
date added to LUP
2007-12-03 15:56:59
date last changed
2017-07-30 04:59:31
@inproceedings{c5fb8ae0-f289-4953-b529-45e1554bf78f,
  abstract     = {Today document archives are geographically distributed<br/><br>
but often not replicated. This can potentially result in a<br/><br>
low quality of service in terms of reduced availability and long user-perceived access times. Instead of indiscriminate replication we study the effectiveness of caching techniques such as prefetching and selective preloading. Our technique analyzes whether user access behavior is predictable enough to guess what articles to prefetch or to preload based on access logs from DADS, a digital library system for scientific journal articles developed at DTV, the Technical Knowledge Center of Denmark. We have found that once a literature search has been narrowed to up to ten articles, there is a high likelihood that some of them will be eventually downloaded. This suggests that prefetching can be used to hide the article transfer latency. We have also found that 80% of the article downloads are confined to less<br/><br>
than 20% of the journals, so preloading a small fraction of the digital library database could significantly shorten the access latency and improve the availability.},
  author       = {Hollmann, J. and Ardö, Anders and Stenström, P.},
  booktitle    = {Proceedings of the 16th IEEE International Parallel and Distributed Processing Symposium},
  isbn         = {0-7695-1573-8},
  language     = {eng},
  pages        = {221--228},
  publisher    = {IEEE--Institute of Electrical and Electronics Engineers Inc.},
  title        = {Empirical Observations regarding predictability in user access behavior in a distributed digital library system},
  year         = {2002},
}