cost 513 ms
dram_write_bytes result on P100

I used nvprof to profile a simple vecadd example (n=1024) on P100 but observed the dram_write_bytes is only 256 (rather than 1024*4 that I expected). ...

2020-07-14 03:45:28   1   62    cuda / nvprof  

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM