• Cruz Zhao's avatar
    sched/core: introduce sched_core_idle_cpu() · 548796e2
    Cruz Zhao authored
    As core scheduling introduced, a new state of idle is defined as
    force idle, running idle task but nr_running greater than zero.
    
    If a cpu is in force idle state, idle_cpu() will return zero. This
    result makes sense in some scenarios, e.g., load balance,
    showacpu when dumping, and judge the RCU boost kthread is starving.
    
    But this will cause error in other scenarios, e.g., tick_irq_exit():
    When force idle, rq->curr == rq->idle but rq->nr_running > 0, results
    that idle_cpu() returns 0. In function tick_irq_exit(), if idle_cpu()
    is 0, tick_nohz_irq_exit() will not be called, and ts->idle_active will
    not become 1, which became 0 in tick_nohz_irq_enter().
    ts->idle_sleeptime won't update in function update_ts_time_stats(), if
    ts->idle_active is 0, which should be 1. And this bug will result that
    ts->idle_sleeptime is less than the actual value, and finally will
    result that the idle time in /proc/stat is less than the actual value.
    
    To solve this problem, we introduce sched_core_idle_cpu(), which
    returns 1 when force idle. We audit all users of idle_cpu(), and
    change idle_cpu() into sched_core_idle_cpu() in function
    tick_irq_exit().
    
    v2-->v3: Only replace idle_cpu() with sched_core_idle_cpu() in
    function tick_irq_exit(). And modify the corresponding commit log.
    Signed-off-by: default avatarCruz Zhao <CruzZhao@linux.alibaba.com>
    Signed-off-by: default avatarPeter Zijlstra (Intel) <peterz@infradead.org>
    Reviewed-by: default avatarPeter Zijlstra <peterz@infradead.org>
    Reviewed-by: default avatarFrederic Weisbecker <frederic@kernel.org>
    Reviewed-by: default avatarJoel Fernandes <joel@joelfernandes.org>
    Link: https://lore.kernel.org/r/1688011324-42406-1-git-send-email-CruzZhao@linux.alibaba.com
    548796e2
softirq.c 23.6 KB