mirror of
https://github.com/wolfpld/tracy.git
synced 2026-06-08 08:33:48 +00:00
Document known MEMORY2 limitation for graph-launched alloc nodes
CUpti_ActivityMemory3 has no graphId field, so matchGraphActivityToAPICall cannot be applied. Graph-launched cudaGraphAddMemAllocNode emits multiple MEMORY2 records sharing the launch correlationId; only the first is tracked, subsequent ones fire a spurious matchError. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
@@ -1110,6 +1110,11 @@ namespace tracy
|
||||
{
|
||||
ZoneNamedN(kernel, "tracy::CUDACtx::DoProcessDeviceEvent[malloc/free]", instrument);
|
||||
CUpti_ActivityMemory3* memory3 = (CUpti_ActivityMemory3*)record;
|
||||
// NOTE: CUpti_ActivityMemory3 has no graphId field, so matchGraphActivityToAPICall
|
||||
// cannot be used here. Graph-launched memory alloc nodes (cudaGraphAddMemAllocNode)
|
||||
// share the launch's correlationId and CUPTI emits multiple MEMORY2 records per node.
|
||||
// The first record consumes the cudaCallSiteInfo entry; subsequent ones will fire a
|
||||
// spurious matchError and skip memory tracking. This is a known limitation.
|
||||
APICallInfo apiCall;
|
||||
if (!matchActivityToAPICall(memory3->correlationId, apiCall)) {
|
||||
return matchError(memory3->correlationId, "MEMORY");
|
||||
|
||||
Reference in New Issue
Block a user