TY - JOUR
T1 - Grid-based algorithm to search critical points, in the electron density, accelerated by graphics processing units
AU - Hernández-Esparza, Raymundo
AU - Mejía-Chica, Sol Milena
AU - Zapata-Escobar, Andy D.
AU - Guevara-García, Alfredo
AU - Martínez-Melchor, Apolinar
AU - Hernández-Pérez, Julio M.
AU - Vargas, Rubicelia
AU - Garza, Jorge
N1 - Publisher Copyright:
© 2014 Wiley Periodicals, Inc.
PY - 2014/12/5
Y1 - 2014/12/5
N2 - Using a grid-based method to search the critical points in electron density, we show how to accelerate such a method with graphics processing units (GPUs). When the GPU implementation is contrasted with that used on central processing units (CPUs), we found a large difference between the time elapsed by both implementations: the smallest time is observed when GPUs are used. We tested two GPUs, one related with video games and other used for high-performance computing (HPC). By the side of the CPUs, two processors were tested, one used in common personal computers and other used for HPC, both of last generation. Although our parallel algorithm scales quite well on CPUs, the same implementation on GPUs runs around 10× faster than 16 CPUs, with any of the tested GPUs and CPUs. We have found what one GPU dedicated for video games can be used without any problem for our application, delivering a remarkable performance, in fact; this GPU competes against one HPC GPU, in particular when single-precision is used.
AB - Using a grid-based method to search the critical points in electron density, we show how to accelerate such a method with graphics processing units (GPUs). When the GPU implementation is contrasted with that used on central processing units (CPUs), we found a large difference between the time elapsed by both implementations: the smallest time is observed when GPUs are used. We tested two GPUs, one related with video games and other used for high-performance computing (HPC). By the side of the CPUs, two processors were tested, one used in common personal computers and other used for HPC, both of last generation. Although our parallel algorithm scales quite well on CPUs, the same implementation on GPUs runs around 10× faster than 16 CPUs, with any of the tested GPUs and CPUs. We have found what one GPU dedicated for video games can be used without any problem for our application, delivering a remarkable performance, in fact; this GPU competes against one HPC GPU, in particular when single-precision is used.
KW - atoms in molecules
KW - electron density
KW - graphics processing units
KW - parallel computing
UR - http://www.scopus.com/inward/record.url?scp=84922479003&partnerID=8YFLogxK
U2 - 10.1002/jcc.23752
DO - 10.1002/jcc.23752
M3 - Article
AN - SCOPUS:84922479003
SN - 0192-8651
VL - 35
SP - 2272
EP - 2278
JO - Journal of Computational Chemistry
JF - Journal of Computational Chemistry
IS - 31
ER -