简体繁体 English

如何使用“尽可能多的动态共享内存”启动内核？

[英]How can I launch a kernel with “as much dynamic shared mem as is possible”?

原文 2016-05-04 17:37:59 7 2 cuda/ gpu-shared-memory

We know CUDA devices have very limited shared memory capacities, in the tens of Kilobytes only. 我们知道CUDA设备的共享内存容量非常有限，仅为几十千字节。 And we also know kernels won't launch (typically? ever?) If you ask for too much shared memory. 而且我们还知道，如果您请求过多的共享内存，内核将无法启动（通常？ And we also know that the available shared memory is used both by the static allocations in code that you use and the dynamically-allocated shared memory. 而且我们还知道，可用的共享内存被您使用的代码中的静态分配和动态分配的共享内存所使用。

Now, cudaGetDeviceProperties() gives us the overall space we have. 现在， cudaGetDeviceProperties()为我们提供了整体空间。 But, given a function symbol, is it possible to determine how much statically-allocated shared memory it would use, so that I can "fill up" the shared mem to full capacity on launch? 但是，给定一个功能符号，是否可以确定它将使用多少静态分配的共享内存，以便我可以在启动时将共享内存“填满”到满容量？ If not, is there a possibility of having CUDA take care of this for me somehow? 如果没有，CUDA是否有可能以某种方式为我解决这个问题？

2 个解决方案

The runtime API has a function cudaFuncGetAttributes which will allow you to retrieve the attributes of any kernel in the current context, including the amount of static shared memory per block which the kernel will consume. 运行时API具有cudaFuncGetAttributes函数，该函数可让您检索当前上下文中任何内核的属性，包括内核将消耗的每个块的静态共享内存量。 You can do the math yourself with that information. 您可以自己使用该信息进行数学运算。

您还可以使用nvcc编译信息来获取共享内存的静态分配

CUDA内核启动的参数可以动态吗？ - Can the arguments for CUDA kernel launch be dynamic?

我的内核代码可以告诉它有多少共享内存吗？ - Can my kernel code tell how much shared memory it has available?

如何在 kernel 启动后让 CUDA 返回控制？ - How can I make CUDA return control after kernel launch?

如何在CUDA内核中使用共享内存？ - How can I use shared memory here in my CUDA kernel?

我可以从内核中获取分配的动态共享内存的数量吗？ - Can I obtain the amount of allocated dynamic shared memory from within a kernel?

CUDA：将 arguments 传递给 kernel 会减慢 Z50484C19F1AFDAF3841A0D821ED393D2 的启动速度吗？ - CUDA: Does passing arguments to a kernel slow the kernel launch much?

我可以在不传递指针数组的情况下启动协作内核吗？ - Can I launch a cooperative kernel without passing an array of pointers?

帮助！ CUDA kernel 使用过多后将不再启动 memory - HELP! CUDA kernel will no longer launch after using too much memory

具有动态共享内存的模板化 CUDA 内核 - Templated CUDA kernel with dynamic shared memory

从一个内核启动到另一个内核，共享内存是否持久？ - is shared memory persistent from one kernel launch to another?

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 CUDA内核启动的参数可以动态吗？ - Can the arguments for CUDA kernel launch be dynamic? 我的内核代码可以告诉它有多少共享内存吗？ - Can my kernel code tell how much shared memory it has available? 如何在 kernel 启动后让 CUDA 返回控制？ - How can I make CUDA return control after kernel launch? 如何在CUDA内核中使用共享内存？ - How can I use shared memory here in my CUDA kernel? 我可以从内核中获取分配的动态共享内存的数量吗？ - Can I obtain the amount of allocated dynamic shared memory from within a kernel? CUDA：将 arguments 传递给 kernel 会减慢 Z50484C19F1AFDAF3841A0D821ED393D2 的启动速度吗？ - CUDA: Does passing arguments to a kernel slow the kernel launch much? 我可以在不传递指针数组的情况下启动协作内核吗？ - Can I launch a cooperative kernel without passing an array of pointers? 帮助！ CUDA kernel 使用过多后将不再启动 memory - HELP! CUDA kernel will no longer launch after using too much memory 具有动态共享内存的模板化 CUDA 内核 - Templated CUDA kernel with dynamic shared memory 从一个内核启动到另一个内核，共享内存是否持久？ - is shared memory persistent from one kernel launch to another?

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM