“invalid configuration argument” error when calling CUDA kernel

Question

I have GeForce 620M and my code is:

int threadsPerBlock = 256;                  
int blocksPerGrid = Number_AA_GPU / threadsPerBlock; 
for(it=0;it<Number_repeatGPU;it++)
{
    Kernel_Update<<<blocksPerGrid,threadsPerBlock>>>(A, B, C, D, rand(), rand());     
}

I get:

invalid configuration argument.

What could be the reason?

Answer 1

The kernel configuration arguments are the arguments between the <<<...>>> symbols.

Your GeForce 620M is a compute capability 2.1 device.

A compute capability 2.1 device is limited to 65535 when you pass a 1-dimensional parameter for the blocks per grid parameter (the first of the two arguments you are passing.)

Since the other parameter you are passing (256, threadsPerBlock ) is definitely in-bounds, I conclude that your first parameter is out of bounds:

int blocksPerGrid = Number_AA_GPU / threadsPerBlock;

ie Number_AA_GPU is either greater than 65535*256 (greater than or equal to 65536*256 would trigger a failure), or it is zero (actually Number_AA_GPU less than 256 would fail, due to integer division), or it is negative.

In the future, you can write more easily decipherable questions if you provide a complete example. In this case, telling us what Number_AA_GPU is could make my answer more definite.

“invalid configuration argument” error when calling CUDA kernel

Question

1 answers

solution1
2 ACCPTED 2015-02-11 04:30:39

“invalid configuration argument” error when calling CUDA kernel

Question

1 answers

solution1 2 ACCPTED 2015-02-11 04:30:39

solution1
2 ACCPTED 2015-02-11 04:30:39