• Thomas Gleixner's avatar
    sched: Queue RT tasks to head when prio drops · aaedb090
    Thomas Gleixner authored
    commit 81a44c54 upstream.
    
    The following scenario does not work correctly:
    
    Runqueue of CPUx contains two runnable and pinned tasks:
    
     T1: SCHED_FIFO, prio 80
     T2: SCHED_FIFO, prio 80
    
    T1 is on the cpu and executes the following syscalls (classic priority
    ceiling scenario):
    
     sys_sched_setscheduler(pid(T1), SCHED_FIFO, .prio = 90);
     ...
     sys_sched_setscheduler(pid(T1), SCHED_FIFO, .prio = 80);
     ...
    
    Now T1 gets preempted by T3 (SCHED_FIFO, prio 95). After T3 goes back
    to sleep the scheduler picks T2. Surprise!
    
    The same happens w/o actual preemption when T1 is forced into the
    scheduler due to a sporadic NEED_RESCHED event. The scheduler invokes
    pick_next_task() which returns T2. So T1 gets preempted and scheduled
    out.
    
    This happens because sched_setscheduler() dequeues T1 from the prio 90
    list and then enqueues it on the tail of the prio 80 list behind T2.
    This violates the POSIX spec and surprises user space which relies on
    the guarantee that SCHED_FIFO tasks are not scheduled out unless they
    give the CPU up voluntarily or are preempted by a higher priority
    task. In the latter case the preempted task must get back on the CPU
    after the preempting task schedules out again.
    
    We fixed a similar issue already in commit 60db48ca (sched: Queue a
    deboosted task to the head of the RT prio queue). The same treatment
    is necessary for sched_setscheduler(). So enqueue to head of the prio
    bucket list if the priority of the task is lowered.
    
    It might be possible that existing user space relies on the current
    behaviour, but it can be considered highly unlikely due to the corner
    case nature of the application scenario.
    Signed-off-by: default avatarThomas Gleixner <tglx@linutronix.de>
    Signed-off-by: default avatarSebastian Andrzej Siewior <bigeasy@linutronix.de>
    Signed-off-by: default avatarPeter Zijlstra <peterz@infradead.org>
    Link: http://lkml.kernel.org/r/1391803122-4425-6-git-send-email-bigeasy@linutronix.deSigned-off-by: default avatarIngo Molnar <mingo@kernel.org>
    Signed-off-by: default avatarZefan Li <lizefan@huawei.com>
    aaedb090
core.c 204 KB