Feb 9 2009, 03:53 PM
Post
#1
|
|
![]() ![]() ![]() ![]() ![]() Group: Extranet Users Posts: 107 Joined: 6-October 07 From: Berkeley, CA Member No.: 72,970 Org.: UC Berkeley |
I'd like to share an implementation of LAPACK's routines SGETRF, SPOTRF, and SGEQRF that is accelerated using GPU. This implementation is limited to factorization of square matrices that reside in the host memory (i.e. at the CPU side). The following figure shows the sustained performance on the following platform: Intel Core2 Quad 2.83 GHz (Q9550), PCIe 2.0 x16, Intel MKL 10.1, Windows XP 64-bit, NVIDIA driver 181.20, CUDA 2.1:
![]() The implementation follows the description given in the following paper; however, some of the finer tunings described, such as recursive and variable blocking, are not included in this release: Volkov, V., and Demmel, J. W. 2008. Benchmarking GPUs to tune dense linear algebra, SC08.Regards, Vasily 05/02/09 edit: updated dead URL to the paper. This post has been edited by vvolkov: May 2 2009, 12:30 PM
Attached File(s)
|
|
|
|
vvolkov LU, QR and Cholesky factorizations using GPU Feb 9 2009, 03:53 PM
zhenyu Thank you very much!
For the QR decomposition,... Feb 9 2009, 04:36 PM
vvolkov QUOTE (zhenyu @ Feb 9 2009, 08:36 AM) Tha... Feb 9 2009, 04:58 PM
kerrmudgeon QUOTE (zhenyu @ Feb 9 2009, 12:36 PM) Tha... Aug 3 2009, 04:08 AM
zhenyu QUOTE (kerrmudgeon @ Aug 3 2009, 06:08 AM... Aug 3 2009, 07:39 AM
zhenyu In fact, I am working on a Givens rotation version... Feb 9 2009, 04:40 PM
VictorGre Many thanks! You could make and lay out too mo... Feb 11 2009, 03:23 PM
Boxed Cylon With routines such as these we are ever so close t... Feb 15 2009, 01:29 PM
vvolkov As far as I see, GTX260 has 3/4 peak arithmetic th... Feb 15 2009, 01:50 PM
Boxed Cylon QUOTE (vvolkov @ Feb 15 2009, 05:50 AM) A... Feb 15 2009, 11:03 PM
vvolkov QUOTE (Boxed Cylon @ Feb 15 2009, 03:03 P... Feb 15 2009, 11:27 PM
Boxed Cylon QUOTE (vvolkov @ Feb 15 2009, 03:27 PM) I... Feb 16 2009, 12:04 AM
frea vvolkov could you also post how much time does eve... Feb 15 2009, 08:24 PM
vvolkov QUOTE (frea @ Feb 15 2009, 12:24 PM) vvol... Feb 15 2009, 09:10 PM
Boxed Cylon It so happens that I just today reconfigured my sm... Feb 17 2009, 01:32 PM
vvolkov QUOTE (Boxed Cylon @ Feb 17 2009, 05:32 A... Feb 17 2009, 02:34 PM
Boxed Cylon The short answer as to why I get sub-standard band... Feb 17 2009, 04:00 PM
Boxed Cylon Ah ha! It turns out that on this Gigabyte mot... Feb 18 2009, 06:20 AM
vvolkov QUOTE (Boxed Cylon @ Feb 17 2009, 10:20 P... Feb 18 2009, 06:25 AM
marcof Hi, great work here!
Do you think there is a ... Jun 25 2009, 06:19 PM
vvolkov QUOTE (marcof @ Jun 25 2009, 11:19 AM) Do... Jun 25 2009, 06:46 PM
tmurray QUOTE (vvolkov @ Jun 25 2009, 11:46 AM) I... Jun 25 2009, 06:52 PM
Sarnath Vasily,
Congrats on the Good work!
I have a ... Jun 26 2009, 12:33 PM
vvolkov QUOTE (tmurray @ Jun 25 2009, 11:52 AM) I... Jun 26 2009, 03:17 PM
jam1 Hi,
Is there a way to use the ATLAS package instea... Jun 26 2009, 03:23 PM
vvolkov QUOTE (jam1 @ Jun 26 2009, 08:23 AM) Hi,
... Jun 26 2009, 03:47 PM
Sarnath Thank you, Vasily! Jun 28 2009, 02:40 PM
avidday Now that downloads are working again, I finally wa... Sep 26 2009, 11:55 AM
MMB QUOTE (avidday @ Sep 26 2009, 07:55 AM) N... Sep 26 2009, 12:44 PM
avidday QUOTE (MMB @ Sep 26 2009, 03:44 PM) Hi Av... Sep 26 2009, 02:35 PM![]() ![]() |
| Copyright 2008 NVIDIA Corporation. Terms of Use | Legal Info | Privacy Policy | Time is now: 9th February 2010 - 11:26 PM |