Empirical Observations regarding predictability in user access behavior in a distributed digital library system
(2002) 16th IEEE International Parallel and Distributed Processing Symposium p.221-228- Abstract
- Today document archives are geographically distributed
but often not replicated. This can potentially result in a
low quality of service in terms of reduced availability and long user-perceived access times. Instead of indiscriminate replication we study the effectiveness of caching techniques such as prefetching and selective preloading. Our technique analyzes whether user access behavior is predictable enough to guess what articles to prefetch or to preload based on access logs from DADS, a digital library system for scientific journal articles developed at DTV, the Technical Knowledge Center of Denmark. We have found that once a literature search has been narrowed to up to ten articles, there is a high likelihood that... (More) - Today document archives are geographically distributed
but often not replicated. This can potentially result in a
low quality of service in terms of reduced availability and long user-perceived access times. Instead of indiscriminate replication we study the effectiveness of caching techniques such as prefetching and selective preloading. Our technique analyzes whether user access behavior is predictable enough to guess what articles to prefetch or to preload based on access logs from DADS, a digital library system for scientific journal articles developed at DTV, the Technical Knowledge Center of Denmark. We have found that once a literature search has been narrowed to up to ten articles, there is a high likelihood that some of them will be eventually downloaded. This suggests that prefetching can be used to hide the article transfer latency. We have also found that 80% of the article downloads are confined to less
than 20% of the journals, so preloading a small fraction of the digital library database could significantly shorten the access latency and improve the availability. (Less)
Please use this url to cite or link to this publication:
https://lup.lub.lu.se/record/606259
- author
- Hollmann, J. ; Ardö, Anders LU and Stenström, P.
- publishing date
- 2002
- type
- Chapter in Book/Report/Conference proceeding
- publication status
- published
- subject
- host publication
- Proceedings of the 16th IEEE International Parallel and Distributed Processing Symposium
- pages
- 221 - 228
- publisher
- IEEE - Institute of Electrical and Electronics Engineers Inc.
- conference name
- 16th IEEE International Parallel and Distributed Processing Symposium
- conference dates
- 2002-04-15 - 2002-04-19
- external identifiers
-
- scopus:0036020883
- ISBN
- 0-7695-1573-8
- language
- English
- LU publication?
- no
- id
- c5fb8ae0-f289-4953-b529-45e1554bf78f (old id 606259)
- alternative location
- http://ieeexplore.ieee.org/iel5/7926/21854/01016636.pdf?tp=&isnumber=21854&arnumber=1016636
- http://www.eit.lth.se/fileadmin/eit/home/hs.aar/Publ/PDP2002.pdf
- date added to LUP
- 2016-04-04 10:33:56
- date last changed
- 2022-01-29 20:31:30
@inproceedings{c5fb8ae0-f289-4953-b529-45e1554bf78f, abstract = {{Today document archives are geographically distributed<br/><br> but often not replicated. This can potentially result in a<br/><br> low quality of service in terms of reduced availability and long user-perceived access times. Instead of indiscriminate replication we study the effectiveness of caching techniques such as prefetching and selective preloading. Our technique analyzes whether user access behavior is predictable enough to guess what articles to prefetch or to preload based on access logs from DADS, a digital library system for scientific journal articles developed at DTV, the Technical Knowledge Center of Denmark. We have found that once a literature search has been narrowed to up to ten articles, there is a high likelihood that some of them will be eventually downloaded. This suggests that prefetching can be used to hide the article transfer latency. We have also found that 80% of the article downloads are confined to less<br/><br> than 20% of the journals, so preloading a small fraction of the digital library database could significantly shorten the access latency and improve the availability.}}, author = {{Hollmann, J. and Ardö, Anders and Stenström, P.}}, booktitle = {{Proceedings of the 16th IEEE International Parallel and Distributed Processing Symposium}}, isbn = {{0-7695-1573-8}}, language = {{eng}}, pages = {{221--228}}, publisher = {{IEEE - Institute of Electrical and Electronics Engineers Inc.}}, title = {{Empirical Observations regarding predictability in user access behavior in a distributed digital library system}}, url = {{http://ieeexplore.ieee.org/iel5/7926/21854/01016636.pdf?tp=&isnumber=21854&arnumber=1016636}}, year = {{2002}}, }