• Kajol Jain's avatar
    powerpc/perf: Add encodings to represent data based on newer composite PERF_MEM_LVLNUM* fields · 4a20ee10
    Kajol Jain authored
    The code represent data coming from L1/L2/L3 cache hits based on
    PERF_MEM_LVL_* namespace, which is in the process of deprecation in
    the favour of newer composite PERF_MEM_{LVLNUM_,REMOTE_,SNOOPX_,HOPS_}
    fields.
    
    Add data source encodings to represent L1/L2/L3 cache hits based on
    newer composite PERF_MEM_{LVLNUM_,REMOTE_,SNOOPX_,HOPS_} fields for
    power10 and older platforms
    
    Result in power9 system without patch changes:
    
    localhost:# ./perf mem report --sort="mem,sym,dso" --stdio
     # Overhead       Samples  Memory access             Symbol                             Shared Object
     # ........  ............  ........................  .................................  ................
     #
        29.51%             1  L2 hit                    [k] perf_event_exec                [kernel.vmlinux]
        27.05%             1  L1 hit                    [k] perf_ctx_unlock                [kernel.vmlinux]
        13.93%             1  L1 hit                    [k] vtime_delta                    [kernel.vmlinux]
        13.11%             1  L1 hit                    [k] prepend_path.isra.11           [kernel.vmlinux]
         8.20%             1  L1 hit                    [.] 00000038.plt_call.__GI_strlen  libc-2.28.so
         8.20%             1  L1 hit                    [k] perf_event_interrupt           [kernel.vmlinux]
    
    Result in power9 system with patch changes:
    
    localhost:# ./perf mem report --sort="mem,sym,dso" --stdio
     # Overhead       Samples  Memory access             Symbol                      Shared Object
     # ........  ............  ........................  ..........................  ................
     #
        36.63%             1  L2 or L2 hit              [k] perf_event_exec         [kernel.vmlinux]
        25.50%             1  L1 or L1 hit              [k] vtime_delta             [kernel.vmlinux]
        13.12%             1  L1 or L1 hit              [k] unmap_region            [kernel.vmlinux]
        12.62%             1  L1 or L1 hit              [k] perf_sample_event_took  [kernel.vmlinux]
         6.93%             1  L1 or L1 hit              [k] perf_ctx_unlock         [kernel.vmlinux]
         5.20%             1  L1 or L1 hit              [.] __memcpy_power7         libc-2.28.so
    Signed-off-by: default avatarKajol Jain <kjain@linux.ibm.com>
    Reviewed-by: default avatarMadhavan Srinivasan <maddy@linux.ibm.com>
    Signed-off-by: default avatarMichael Ellerman <mpe@ellerman.id.au>
    Link: https://lore.kernel.org/r/20211206091749.87585-4-kjain@linux.ibm.com
    4a20ee10
isa207-common.c 20.3 KB