• Ian Rogers's avatar
    perf stat: Combine the -A/--no-aggr and --no-merge options · 6f33e6fa
    Ian Rogers authored
    The -A or --no-aggr option disables aggregation of core events:
    
      $ perf stat -A -e cycles,data_total -a true
    
       Performance counter stats for 'system wide':
    
      CPU0            1,287,665      cycles
      CPU1            1,831,681      cycles
      CPU2           27,345,998      cycles
      CPU3            1,964,799      cycles
      CPU4              236,174      cycles
      CPU5            3,302,825      cycles
      CPU6            9,201,446      cycles
      CPU7            1,403,043      cycles
      CPU0               110.90 MiB  data_total
    
             0.008961761 seconds time elapsed
    
    The --no-merge option disables the aggregation of uncore events:
    
      $ perf stat --no-merge -e cycles,data_total -a true
    
       Performance counter stats for 'system wide':
    
              38,482,778      cycles
                   15.04 MiB  data_total [uncore_imc_free_running_1]
                   15.00 MiB  data_total [uncore_imc_free_running_0]
    
             0.005915155 seconds time elapsed
    
    Having two options confuses users who generally don't appreciate the
    difference in PMUs. Keep all the options but make it so they all
    disable aggregation both of core and uncore events:
    
      $ perf stat -A -e cycles,data_total -a true
    
       Performance counter stats for 'system wide':
    
      CPU0               85,878      cycles
      CPU1               88,179      cycles
      CPU2               60,872      cycles
      CPU3            3,265,567      cycles
      CPU4               82,357      cycles
      CPU5               83,383      cycles
      CPU6               84,156      cycles
      CPU7              220,803      cycles
      CPU0                 2.38 MiB  data_total [uncore_imc_free_running_0]
      CPU0                 2.38 MiB  data_total [uncore_imc_free_running_1]
    
             0.001397205 seconds time elapsed
    
    Update the relevant 'perf stat' man page information.
    Reviewed-by: default avatarKan Liang <kan.liang@linux.intel.com>
    Signed-off-by: default avatarIan Rogers <irogers@google.com>
    Cc: Adrian Hunter <adrian.hunter@intel.com>
    Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
    Cc: Athira Jajeev <atrajeev@linux.vnet.ibm.com>
    Cc: Changbin Du <changbin.du@huawei.com>
    Cc: Ingo Molnar <mingo@redhat.com>
    Cc: James Clark <james.clark@arm.com>
    Cc: Jiri Olsa <jolsa@kernel.org>
    Cc: John Garry <john.g.garry@oracle.com>
    Cc: K Prateek Nayak <kprateek.nayak@amd.com>
    Cc: Kaige Ye <ye@kaige.org>
    Cc: Mark Rutland <mark.rutland@arm.com>
    Cc: Namhyung Kim <namhyung@kernel.org>
    Cc: Nick Desaulniers <ndesaulniers@google.com>
    Cc: Peter Zijlstra <peterz@infradead.org>
    Link: https://lore.kernel.org/r/20231214060256.2094017-1-irogers@google.comSigned-off-by: default avatarArnaldo Carvalho de Melo <acme@redhat.com>
    6f33e6fa
perf-stat.txt 22.7 KB