• Thomas Richter's avatar
    s390/cpum_cf: Reject request for sampling in event initialization · 250dea0d
    Thomas Richter authored
    [ Upstream commit 613a41b0 ]
    
    On s390 command perf top fails
    [root@s35lp76 perf] # ./perf top -F100000  --stdio
       Error:
       cycles: PMU Hardware doesn't support sampling/overflow-interrupts.
       	Try 'perf stat'
    [root@s35lp76 perf] #
    
    Using event -e rb0000 works as designed.  Event rb0000 is the event
    number of the sampling facility for basic sampling.
    
    During system start up the following PMUs are installed in the kernel's
    PMU list (from head to tail):
       cpum_cf --> s390 PMU counter facility device driver
       cpum_sf --> s390 PMU sampling facility device driver
       uprobe
       kprobe
       tracepoint
       task_clock
       cpu_clock
    
    Perf top executes following functions and calls perf_event_open(2) system
    call with different parameters many times:
    
    cmd_top
    --> __cmd_top
        --> perf_evlist__add_default
            --> __perf_evlist__add_default
                --> perf_evlist__new_cycles (creates event type:0 (HW)
    			    		config 0 (CPU_CYCLES)
    	        --> perf_event_attr__set_max_precise_ip
    		    Uses perf_event_open(2) to detect correct
    		    precise_ip level. Fails 3 times on s390 which is ok.
    
    Then functions cmd_top
    --> __cmd_top
        --> perf_top__start_counters
            -->perf_evlist__config
    	   --> perf_can_comm_exec
                   --> perf_probe_api
    	           This functions test support for the following events:
    		   "cycles:u", "instructions:u", "cpu-clock:u" using
    		   --> perf_do_probe_api
    		       --> perf_event_open_cloexec
    		           Test the close on exec flag support with
    			   perf_event_open(2).
    	               perf_do_probe_api returns true if the event is
    		       supported.
    		       The function returns true because event cpu-clock is
    		       supported by the PMU cpu_clock.
    	               This is achieved by many calls to perf_event_open(2).
    
    Function perf_top__start_counters now calls perf_evsel__open() for every
    event, which is the default event cpu_cycles (config:0) and type HARDWARE
    (type:0) which a predfined frequence of 4000.
    
    Given the above order of the PMU list, the PMU cpum_cf gets called first
    and returns 0, which indicates support for this sampling. The event is
    fully allocated in the function perf_event_open (file kernel/event/core.c
    near line 10521 and the following check fails:
    
            event = perf_event_alloc(&attr, cpu, task, group_leader, NULL,
    		                 NULL, NULL, cgroup_fd);
    	if (IS_ERR(event)) {
    		err = PTR_ERR(event);
    		goto err_cred;
    	}
    
            if (is_sampling_event(event)) {
    		if (event->pmu->capabilities & PERF_PMU_CAP_NO_INTERRUPT) {
    			err = -EOPNOTSUPP;
    			goto err_alloc;
    		}
    	}
    
    The check for the interrupt capabilities fails and the system call
    perf_event_open() returns -EOPNOTSUPP (-95).
    
    Add a check to return -ENODEV when sampling is requested in PMU cpum_cf.
    This allows common kernel code in the perf_event_open() system call to
    test the next PMU in above list.
    
    Fixes: 97b1198f (" "s390, perf: Use common PMU interrupt disabled code")
    Signed-off-by: default avatarThomas Richter <tmricht@linux.ibm.com>
    Reviewed-by: default avatarHendrik Brueckner <brueckner@linux.ibm.com>
    Signed-off-by: default avatarMartin Schwidefsky <schwidefsky@de.ibm.com>
    Signed-off-by: default avatarSasha Levin <sashal@kernel.org>
    250dea0d
perf_cpum_cf.c 18.9 KB