简体繁体 English

如何在编译的程序中获取函数和全局变量的 CUDA 驱动模块句柄？

[英]How can I get the CUDA driver module handle for functions and globals in the compiled program?

原文 2021-12-06 21:53:11 0 1 cuda/ globals/ nvrtc/ cuda-driver

The CUDA Runtime API has the functions cudaGetSymbolAddress() and cudaGetSymbolSize() for working with device-side globals from host-side code, using their names (source-code identifiers) as handles. CUDA 运行时 API 具有函数cudaGetSymbolAddress()和cudaGetSymbolSize()用于处理来自主机端代码的设备端全局变量，使用它们的名称（源代码标识符）作为句柄。

In the Driver API, we have cuModuleGetGlobal() , which lets us do the same thing... except that it takes a CUmodule which the global symbol is situated in. If you're working with code that you dynamically compiled and loaded/added into a module then you're all set.在驱动程序 API 中，我们有cuModuleGetGlobal() ，它可以让我们做同样的事情......除了它需要一个全局符号所在的 CUmodule。如果您正在使用动态编译和加载/添加的代码进入一个模块，然后你就准备好了。 But what if those globals are part of your program, compiled statically using NVCC rather than loaded dynamically?但是，如果这些全局变量是您程序的一部分，使用 NVCC 静态编译而不是动态加载呢？

I would assume that there's some sort of "primary module" or "default module" for each compiled program, with its baked-in globals and functions.我会假设每个编译的程序都有某种“主模块”或“默认模块”，其中包含内置的全局变量和函数。 Can I get a handle for it?我能得到它的句柄吗？

1 个解决方案

I would assume that there's some sort of "primary module" or "default module" for each compiled program, with its baked-in globals and functions.我会假设每个编译的程序都有某种“主模块”或“默认模块”，其中包含内置的全局变量和函数。

There is, and if you pull apart the runtime API emitted host boilerplate code which makes it work and some runtime traces, you will see it relies on a lot of statically defined symbols and a couple of undocumented runtime API functions which internally maintain the module the runtime API uses.有，如果你拆开运行时 API 发出的主机样板代码，使其工作和一些运行时跟踪，你会看到它依赖于许多静态定义的符号和几个未记录的运行时 API 函数，它们在内部维护模块运行时 API 使用。