• Mark Rutland's avatar
    perf/core: Fix group {cpu,task} validation · bde6608d
    Mark Rutland authored
    commit 64aee2a9 upstream.
    
    Regardless of which events form a group, it does not make sense for the
    events to target different tasks and/or CPUs, as this leaves the group
    inconsistent and impossible to schedule. The core perf code assumes that
    these are consistent across (successfully intialised) groups.
    
    Core perf code only verifies this when moving SW events into a HW
    context. Thus, we can violate this requirement for pure SW groups and
    pure HW groups, unless the relevant PMU driver happens to perform this
    verification itself. These mismatched groups subsequently wreak havoc
    elsewhere.
    
    For example, we handle watchpoints as SW events, and reserve watchpoint
    HW on a per-CPU basis at pmu::event_init() time to ensure that any event
    that is initialised is guaranteed to have a slot at pmu::add() time.
    However, the core code only checks the group leader's cpu filter (via
    event_filter_match()), and can thus install follower events onto CPUs
    violating thier (mismatched) CPU filters, potentially installing them
    into a CPU without sufficient reserved slots.
    
    This can be triggered with the below test case, resulting in warnings
    from arch backends.
    
      #define _GNU_SOURCE
      #include <linux/hw_breakpoint.h>
      #include <linux/perf_event.h>
      #include <sched.h>
      #include <stdio.h>
      #include <sys/prctl.h>
      #include <sys/syscall.h>
      #include <unistd.h>
    
      static int perf_event_open(struct perf_event_attr *attr, pid_t pid, int cpu,
    			   int group_fd, unsigned long flags)
      {
    	return syscall(__NR_perf_event_open, attr, pid, cpu, group_fd, flags);
      }
    
      char watched_char;
    
      struct perf_event_attr wp_attr = {
    	.type = PERF_TYPE_BREAKPOINT,
    	.bp_type = HW_BREAKPOINT_RW,
    	.bp_addr = (unsigned long)&watched_char,
    	.bp_len = 1,
    	.size = sizeof(wp_attr),
      };
    
      int main(int argc, char *argv[])
      {
    	int leader, ret;
    	cpu_set_t cpus;
    
    	/*
    	 * Force use of CPU0 to ensure our CPU0-bound events get scheduled.
    	 */
    	CPU_ZERO(&cpus);
    	CPU_SET(0, &cpus);
    	ret = sched_setaffinity(0, sizeof(cpus), &cpus);
    	if (ret) {
    		printf("Unable to set cpu affinity\n");
    		return 1;
    	}
    
    	/* open leader event, bound to this task, CPU0 only */
    	leader = perf_event_open(&wp_attr, 0, 0, -1, 0);
    	if (leader < 0) {
    		printf("Couldn't open leader: %d\n", leader);
    		return 1;
    	}
    
    	/*
    	 * Open a follower event that is bound to the same task, but a
    	 * different CPU. This means that the group should never be possible to
    	 * schedule.
    	 */
    	ret = perf_event_open(&wp_attr, 0, 1, leader, 0);
    	if (ret < 0) {
    		printf("Couldn't open mismatched follower: %d\n", ret);
    		return 1;
    	} else {
    		printf("Opened leader/follower with mismastched CPUs\n");
    	}
    
    	/*
    	 * Open as many independent events as we can, all bound to the same
    	 * task, CPU0 only.
    	 */
    	do {
    		ret = perf_event_open(&wp_attr, 0, 0, -1, 0);
    	} while (ret >= 0);
    
    	/*
    	 * Force enable/disble all events to trigger the erronoeous
    	 * installation of the follower event.
    	 */
    	printf("Opened all events. Toggling..\n");
    	for (;;) {
    		prctl(PR_TASK_PERF_EVENTS_DISABLE, 0, 0, 0, 0);
    		prctl(PR_TASK_PERF_EVENTS_ENABLE, 0, 0, 0, 0);
    	}
    
    	return 0;
      }
    
    Fix this by validating this requirement regardless of whether we're
    moving events.
    Signed-off-by: default avatarMark Rutland <mark.rutland@arm.com>
    Signed-off-by: default avatarPeter Zijlstra (Intel) <peterz@infradead.org>
    Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
    Cc: Arnaldo Carvalho de Melo <acme@kernel.org>
    Cc: Linus Torvalds <torvalds@linux-foundation.org>
    Cc: Peter Zijlstra <peterz@infradead.org>
    Cc: Thomas Gleixner <tglx@linutronix.de>
    Cc: Zhou Chengming <zhouchengming1@huawei.com>
    Link: http://lkml.kernel.org/r/1498142498-15758-1-git-send-email-mark.rutland@arm.comSigned-off-by: default avatarIngo Molnar <mingo@kernel.org>
    Signed-off-by: default avatarGreg Kroah-Hartman <gregkh@linuxfoundation.org>
    bde6608d
core.c 256 KB