Advanced

Semidirect parallel self-consistent field: the load balancing problem in the input/output intensive self-consistent field iterations

Lindh, Roland LU ; Krogh, Jesper Wisborg LU ; Schutz, M and Hirao, K (2003) In Theoretical Chemistry Accounts 110(3). p.156-164
Abstract
The full capacity of contemporary parallel computers can, in the context of iterative ab initio procedures like, for example, self-consistent field (SCF) and multiconfigurational SCF, only be utilized if the disk and input/output (I/O) capacity are fully exploited before the implementation turns to an integral direct strategy. In a recent report on parallel semidirect SCF http://www.tc.cornell.edu/er/media/1996/collabrate.html, http://www.fp.mcs.anl.gd/grand-challenges/chem/non-direct/index.html it was demonstrated that super-linear speedups are achievable for algorithms that exploit scalable parallel I/O. In the I/O-intensive SCF iterations of this implementation a static load balancing, however, was employed, dictated by the initial... (More)
The full capacity of contemporary parallel computers can, in the context of iterative ab initio procedures like, for example, self-consistent field (SCF) and multiconfigurational SCF, only be utilized if the disk and input/output (I/O) capacity are fully exploited before the implementation turns to an integral direct strategy. In a recent report on parallel semidirect SCF http://www.tc.cornell.edu/er/media/1996/collabrate.html, http://www.fp.mcs.anl.gd/grand-challenges/chem/non-direct/index.html it was demonstrated that super-linear speedups are achievable for algorithms that exploit scalable parallel I/O. In the I/O-intensive SCF iterations of this implementation a static load balancing, however, was employed, dictated by the initial iteration in which integral evaluation dominates the central processing unit activity and thus determines the load balancing. In the present paper we present the first implementation in which load balancing is achieved throughout the whole SCF procedure, i.e. also in subsequent iterations. The improved scalability of our new algorithm is demonstrated in some test calculations, for example, for 63-node calculation a speedup of 104 was observed in the computation of the two-electron integral contribution to the Fock matrix. (Less)
Please use this url to cite or link to this publication:
author
organization
publishing date
type
Contribution to journal
publication status
published
subject
keywords
parallel, integral direct, semidirect, input/output
in
Theoretical Chemistry Accounts
volume
110
issue
3
pages
156 - 164
publisher
Springer
external identifiers
  • wos:000186604300007
  • scopus:0242658782
ISSN
1432-881X
DOI
language
English
LU publication?
yes
id
654d41a9-3b22-4a6f-b717-229f4141d78b (old id 294743)
date added to LUP
2007-09-17 10:21:38
date last changed
2018-05-29 11:57:27
@article{654d41a9-3b22-4a6f-b717-229f4141d78b,
  abstract     = {The full capacity of contemporary parallel computers can, in the context of iterative ab initio procedures like, for example, self-consistent field (SCF) and multiconfigurational SCF, only be utilized if the disk and input/output (I/O) capacity are fully exploited before the implementation turns to an integral direct strategy. In a recent report on parallel semidirect SCF http://www.tc.cornell.edu/er/media/1996/collabrate.html, http://www.fp.mcs.anl.gd/grand-challenges/chem/non-direct/index.html it was demonstrated that super-linear speedups are achievable for algorithms that exploit scalable parallel I/O. In the I/O-intensive SCF iterations of this implementation a static load balancing, however, was employed, dictated by the initial iteration in which integral evaluation dominates the central processing unit activity and thus determines the load balancing. In the present paper we present the first implementation in which load balancing is achieved throughout the whole SCF procedure, i.e. also in subsequent iterations. The improved scalability of our new algorithm is demonstrated in some test calculations, for example, for 63-node calculation a speedup of 104 was observed in the computation of the two-electron integral contribution to the Fock matrix.},
  author       = {Lindh, Roland and Krogh, Jesper Wisborg and Schutz, M and Hirao, K},
  issn         = {1432-881X},
  keyword      = {parallel,integral direct,semidirect,input/output},
  language     = {eng},
  number       = {3},
  pages        = {156--164},
  publisher    = {Springer},
  series       = {Theoretical Chemistry Accounts},
  title        = {Semidirect parallel self-consistent field: the load balancing problem in the input/output intensive self-consistent field iterations},
  url          = {http://dx.doi.org/},
  volume       = {110},
  year         = {2003},
}