简体繁体中英

Why does CuBlas not use a 1d array for triangular matrices?

原文 2018-07-05 08:44:52 4 1 cuda/ cublas

This might be a throwback to the old BLAS library design, but I was surprised just now to find that CuBlas wastes memory by using regular 2d arrays for triangular matrices. I suppose this makes interfacing with the rest of the API more convenient.

1 answers

I was surprised just now to find that CuBlas wastes memory by using regular 2d arrays for triangular matrices

That isn't strictly true.

If you look at the Level 2 BLAS routines, you will see that they operate on triangular or Hermitian matrices stored in a packed format.

The Level 3 BLAS routines don't, but there are two good reasons why they are stored in full dense format.

BLAS does it that way
Those routines were mostly added to BLAS as support for LAPACK solvers. And those solvers typically store the results of factorizations in-situ in supplied full dense inputs, so it is logical to use that format in BLAS

I guess if you don't like the design choice you can always try writing to Jack Dongarra to complain.

Why does CUBLAS use const pointers for parameters?

How to use texture memory for 1D array in CUDA

Numba cuda: why the sum of the 1D array is not right?

Is there a most efficient way to multiply three matrices A * B * C = D using cuBLAS?

Flattening a 3D array to 1D in cuda

Converting Octave to Use CuBLAS

how to use cublas library

float2 matrix (as 1D array) and CUDA

why can't I get the right sum of 1D array with numba (cuda python)?

2D jagged array to 1D array in C++

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Why does CUBLAS use const pointers for parameters? How to use texture memory for 1D array in CUDA Numba cuda: why the sum of the 1D array is not right? Is there a most efficient way to multiply three matrices A * B * C = D using cuBLAS? Flattening a 3D array to 1D in cuda Converting Octave to Use CuBLAS how to use cublas library float2 matrix (as 1D array) and CUDA why can't I get the right sum of 1D array with numba (cuda python)? 2D jagged array to 1D array in C++

Related Tags

Why does CuBlas not use a 1d array for triangular matrices?

Question

1 answers

solution1 1

solution1
1