Is there a way to measure cache line utilization? By cache line utilization I mean the fraction of cache line bytes that are read or written before a cache line is evicted. My target is an Intel CPU running Linux. The measurements don't necessarily need to come from the physical CPU itself. A simulation like valgrind uses is fine too.
The motivation is that this could be a useful way to skim code for areas where changing memory layout is worth considering.
我认为您可能想要英特尔®VTune™放大器XE 2016 。
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.