1. 20 Aug, 2024 2 commits
    • Yipeng Zou's avatar
      sched_ext: Allow dequeue_task_scx to fail · 9ad2861b
      Yipeng Zou authored
      Since dequeue_task() allowed to fail, there is a compile error:
      
      kernel/sched/ext.c:3630:19: error: initialization of ‘bool (*)(struct rq*, struct task_struct *, int)’ {aka ‘_Bool (*)(struct rq *, struct task_struct *, int)’} from incompatible pointer type ‘void (*)(struct rq*, struct task_struct *, int)’
        3630 |  .dequeue_task  = dequeue_task_scx,
             |                   ^~~~~~~~~~~~~~~~
      
      Allow dequeue_task_scx to fail too.
      
      Fixes: 863ccdbb ("sched: Allow sched_class::dequeue_task() to fail")
      Signed-off-by: default avatarYipeng Zou <zouyipeng@huawei.com>
      Signed-off-by: default avatarTejun Heo <tj@kernel.org>
      9ad2861b
    • Tejun Heo's avatar
      Merge branch 'tip/sched/core' into for-6.12 · 5ac99857
      Tejun Heo authored
      To receive 863ccdbb ("sched: Allow sched_class::dequeue_task() to fail")
      which makes sched_class.dequeue_task() return bool instead of void. This
      leads to compile breakage and will be fixed by a follow-up patch.
      Signed-off-by: default avatarTejun Heo <tj@kernel.org>
      5ac99857
  2. 17 Aug, 2024 23 commits
  3. 13 Aug, 2024 2 commits
    • Tejun Heo's avatar
      sched_ext: Don't use double locking to migrate tasks across CPUs · 89909296
      Tejun Heo authored
      consume_remote_task() and dispatch_to_local_dsq() use
      move_task_to_local_dsq() to migrate the task to the target CPU. Currently,
      move_task_to_local_dsq() expects the caller to lock both the source and
      destination rq's. While this may save a few lock operations while the rq's
      are not contended, under contention, the double locking can exacerbate the
      situation significantly (refer to the linked message below).
      
      Update the migration path so that double locking is not used.
      move_task_to_local_dsq() now expects the caller to be locking the source rq,
      drops it and then acquires the destination rq lock. Code is simpler this way
      and, on a 2-way NUMA machine w/ Xeon Gold 6138, 'hackbench 100 thread 5000`
      shows ~3% improvement with scx_simple.
      Signed-off-by: default avatarTejun Heo <tj@kernel.org>
      Suggested-by: default avatarPeter Zijlstra <peterz@infradead.org>
      Link: http://lkml.kernel.org/r/20240806082716.GP37996@noisy.programming.kicks-ass.netAcked-by: default avatarDavid Vernet <void@manifault.com>
      89909296
    • Manu Bretelle's avatar
      sched_ext: define missing cfi stubs for sched_ext · 33d031ec
      Manu Bretelle authored
      `__bpf_ops_sched_ext_ops` was missing the initialization of some struct
      attributes. With
      
        https://lore.kernel.org/all/20240722183049.2254692-4-martin.lau@linux.dev/
      
      every single attributes need to be initialized programs (like scx_layered)
      will fail to load.
      
        05:26:48 [INFO] libbpf: struct_ops layered: member cgroup_init not found in kernel, skipping it as it's set to zero
        05:26:48 [INFO] libbpf: struct_ops layered: member cgroup_exit not found in kernel, skipping it as it's set to zero
        05:26:48 [INFO] libbpf: struct_ops layered: member cgroup_prep_move not found in kernel, skipping it as it's set to zero
        05:26:48 [INFO] libbpf: struct_ops layered: member cgroup_move not found in kernel, skipping it as it's set to zero
        05:26:48 [INFO] libbpf: struct_ops layered: member cgroup_cancel_move not found in kernel, skipping it as it's set to zero
        05:26:48 [INFO] libbpf: struct_ops layered: member cgroup_set_weight not found in kernel, skipping it as it's set to zero
        05:26:48 [WARN] libbpf: prog 'layered_dump': BPF program load failed: unknown error (-524)
        05:26:48 [WARN] libbpf: prog 'layered_dump': -- BEGIN PROG LOAD LOG --
        attach to unsupported member dump of struct sched_ext_ops
        processed 0 insns (limit 1000000) max_states_per_insn 0 total_states 0 peak_states 0 mark_read 0
        -- END PROG LOAD LOG --
        05:26:48 [WARN] libbpf: prog 'layered_dump': failed to load: -524
        05:26:48 [WARN] libbpf: failed to load object 'bpf_bpf'
        05:26:48 [WARN] libbpf: failed to load BPF skeleton 'bpf_bpf': -524
        Error: Failed to load BPF program
      Signed-off-by: default avatarManu Bretelle <chantr4@gmail.com>
      Signed-off-by: default avatarTejun Heo <tj@kernel.org>
      33d031ec
  4. 08 Aug, 2024 3 commits
    • Tejun Heo's avatar
      sched_ext: Improve logging around enable/disable · 344576fa
      Tejun Heo authored
      sched_ext currently doesn't generate messages when the BPF scheduler is
      enabled and disabled unless there are errors. It is useful to have paper
      trail. Improve logging around enable/disable:
      
      - Generate info messages on enable and non-error disable.
      
      - Update error exit message formatting so that it's consistent with
        non-error message. Also, prefix ei->msg with the BPF scheduler's name to
        make it clear where the message is coming from.
      
      - Shorten scx_exit_reason() strings for SCX_EXIT_UNREG* for brevity and
        consistency.
      
      v2: Use pr_*() instead of KERN_* consistently. (David)
      Signed-off-by: default avatarTejun Heo <tj@kernel.org>
      Suggested-by: default avatarPhil Auld <pauld@redhat.com>
      Reviewed-by: default avatarPhil Auld <pauld@redhat.com>
      Acked-by: default avatarDavid Vernet <void@manifault.com>
      344576fa
    • Tejun Heo's avatar
      sched_ext: Make scx_rq_online() also test cpu_active() in addition to SCX_RQ_ONLINE · 991ef53a
      Tejun Heo authored
      scx_rq_online() currently only tests SCX_RQ_ONLINE. This isn't fully correct
      - e.g. consume_dispatch_q() uses task_run_on_remote_rq() which tests
      scx_rq_online() to see whether the current rq can run the task, and, if so,
      calls consume_remote_task() to migrate the task to @rq. While the test
      itself was done while locking @rq, @rq can be temporarily unlocked by
      consume_remote_task() and nothing prevents SCX_RQ_ONLINE from going offline
      before the migration takes place.
      
      To address the issue, add cpu_active() test to scx_rq_online(). There is a
      synchronize_rcu() between cpu_active() being cleared and the rq going
      offline, so if an on-going scheduling operation sees cpu_active(), the
      associated rq is guaranteed to not go offline until the scheduling operation
      is complete.
      Signed-off-by: default avatarTejun Heo <tj@kernel.org>
      Fixes: 60c27fb5 ("sched_ext: Implement sched_ext_ops.cpu_online/offline()")
      Acked-by: default avatarDavid Vernet <void@manifault.com>
      991ef53a
    • Tejun Heo's avatar
      sched_ext: Fix unsafe list iteration in process_ddsp_deferred_locals() · 72763ea3
      Tejun Heo authored
      process_ddsp_deferred_locals() executes deferred direct dispatches to the
      local DSQs of remote CPUs. It iterates the tasks on
      rq->scx.ddsp_deferred_locals list, removing and calling
      dispatch_to_local_dsq() on each. However, the list is protected by the rq
      lock that can be dropped by dispatch_to_local_dsq() temporarily, so the list
      can be modified during the iteration, which can lead to oopses and other
      failures.
      
      Fix it by popping from the head of the list instead of iterating the list.
      Signed-off-by: default avatarTejun Heo <tj@kernel.org>
      Fixes: 5b26f7b9 ("sched_ext: Allow SCX_DSQ_LOCAL_ON for direct dispatches")
      Acked-by: default avatarDavid Vernet <void@manifault.com>
      72763ea3
  5. 07 Aug, 2024 5 commits
  6. 06 Aug, 2024 5 commits