Commits · 5768f143072855298bf3b4c41097b4e980719530 · Kirill Smelkov / linux

24 May, 2004 7 commits
- [CPUFREQ] Make longhaul debug a module option. · 5768f143
  Dave Jones authored May 25, 2004
  
  5768f143
- [CPUFREQ] Fix leak in powernow-k8 · 78b17ed0
  Dave Jones authored May 24, 2004
```
Spotted by Yury Umanets
```
  78b17ed0
- [CPUFREQ] Remove a bunch of trailing whitespace from the powernow-k8 driver. · d75801fd
  Dave Jones authored May 24, 2004
  
  d75801fd
- [CPUFREQ] Quieten the powernow-k7 init printk a little. · 3c892fff
  Dave Jones authored May 24, 2004
```
There seem to be quite a few desktop K7 processors which support the
powernow cpuid call, but don't actually offer any powernow scaling.
In which case the driver prints out
"PowerNOW! Technology present. Can scale: nothing" which looks a bit odd,
and just adds to confusion of end-users.
Change things so that we don't print anything at all if we can't do anything.

Also kill some trailing whitespace gremlins that crept in.
```
  3c892fff
- [CPUFREQ] Use correct printk prefix in p4-clockmod driver · a936d500
  Dave Jones authored May 24, 2004
  
  a936d500
- [CPUFREQ] Remove some unneeded includes. · 32f20199
  Dave Jones authored May 24, 2004
  
  32f20199
- [CPUFREQ] Convert longhaul driver to use module_param · bea1ff66
  Dave Jones authored May 24, 2004
  
  bea1ff66
23 May, 2004 1 commit
- [CPUFREQ] Silence noisy debugging printk in longhaul driver. · 0a032839
  Dave Jones authored May 23, 2004
  
  0a032839
15 May, 2004 5 commits

[CPUFREQ] Make powernow-k8 work right when ACPI is built as a module. · 6deb6e5b
Dave Jones authored May 16, 2004
```
From: Tony Lindgren <tony@atomide.com>
```
6deb6e5b
[CPUFREQ] Fix an invalid comment in speedstep-ich · 383b816c
Dave Jones authored May 16, 2004
```
This driver is for ICH only, not for PIIX4. Thanks to Christian Hilberg for noting this.
```
383b816c

[CPUFREQ] Sync p4-clockmod MSR access across logical CPUs. · 9dba63f4

Dave Jones authored May 16, 2004

As noted and debugged by Rutger Nijlunsing and verified in section 13.15.3 of Intel's
IA32 Intel Architecture Software Developer's Manual, Volume 3, the p4-clockmod
msr needs to be set to the same value on all logical CPUs ("siblings") to
function "properly".

This patch implements this, and uses cpufreq_p4_get instead of a local copy in
cpufreq_p4_setdc. The latter function now only does the actual setting, all
other (notification, verification and set_cpus_allowed()) stuff is done in
cpufreq_p4_target.

9dba63f4

[CPUFREQ] Makefile reordering issues. · a2bd9297

Dave Jones authored May 16, 2004

As several cpufreq drivers are late_initcalls now [dependency on acpi/processor.c
which is module_init()], we need to use Makefile ordering to assert that
- speedstep-centrino is loaded before acpi [faster: msr instead of io]
- speedstep-centrino, speedstep-ich and acpi are loaded before p4-clockmod
  [frequency and voltage scaling instead of throttling]

a2bd9297

[CPUFREQ] Fix several operator precedence bugs. · 9ae46eef
Dave Jones authored May 16, 2004

9ae46eef

11 May, 2004 6 commits
- [CPUFREQ] Nehemiah improvements for longhaul driver. · 6ded2df9
  Dave Jones authored May 11, 2004
```
From Andreas Meisinger 
```
  6ded2df9
- [CPUFREQ] Fix for longrun.c for degenerate case · a7615aeb
  Dave Jones authored May 11, 2004
```
From H. Peter Anvin

I ran into a system the other day which had a Transmeta processor, but
configured in a degenerate, fixed-frequency configuration.  It crashed
booting Fedora Core 2 test 3 due to a division by zero in the longrun
cpufreq driver.
```
  a7615aeb
- [CPUFREQ] Avoid scheduling cpufreq_delayed_get_work() twice; but do call it a bit earlier. · 06393d0f
  Dave Jones authored May 11, 2004
  
  06393d0f
- [CPUFREQ] Latency is in nanoseconds -- speedstep-centrino got it wrong · b4ffaea1
  Dave Jones authored May 11, 2004
  
  b4ffaea1
- [CPUFREQ] cpu_sibling_mask fixup. · d2b4affa
  Dave Jones authored May 11, 2004
  
  d2b4affa
- Merge delerium.codemonkey.org.uk:/mnt/nfs/neologic/bar/src/kernel/2.6/trees/bk-linus · 86a15ee0
  Dave Jones authored May 11, 2004
```
into delerium.codemonkey.org.uk:/mnt/nfs/neologic/bar/src/kernel/2.6/trees/cpufreq
```
  86a15ee0
10 May, 2004 21 commits

[CPUFREQ] Warning fixes. · 78d1c35b

Dave Jones authored May 10, 2004

On sparc64:
                                                                                                           
drivers/cpufreq/cpufreq.c: In function `cpufreq_add_dev':
drivers/cpufreq/cpufreq.c:394: warning: cast to pointer from integer of different size
drivers/cpufreq/cpufreq.c: In function `handle_update':
drivers/cpufreq/cpufreq.c:507: warning: cast from pointer to integer of different size

78d1c35b

[PATCH] migration_thread() race fix · 74499d32

Andrew Morton authored May 09, 2004

From: Srivatsa Vaddagiri <vatsa@in.ibm.com>

Noticed that migration_thread can examine "kthread_should_stop()?" without
setting its state to TASK_INTERRUPTIBLE first.  This can cause kthread_stop
on that thread to block forever ...

P.S 	- I assumed that having the task state set to TASK_INTERRUTIBLE
	  while it is doing active_load_balance is fine. It seemed to be
	  the case earlier also.

74499d32

[PATCH] sched_getaffinity vs cpu hotplug race fix · 870d3c0a

Andrew Morton authored May 09, 2004

From: Srivatsa Vaddagiri <vatsa@in.ibm.com>

Fix the race in sys_sched_getaffinity.  Patch below takes cpu_hotplug lock
before reading cpus_allowed mask of a task.

870d3c0a

[PATCH] Move migrate_all_tasks to CPU_DEAD handling · ddea677b

Andrew Morton authored May 09, 2004

From: Srivatsa Vaddagiri <vatsa@in.ibm.com>

migrate_all_tasks is currently run with rest of the machine stopped.
It iterates thr' the complete task table, turning off cpu affinity of any task
that it finds affine to the dying cpu. Depending on the task table
size this can take considerable time. All this time machine is stopped, doing
nothing.

Stopping the machine for such extended periods can be avoided if we do
task migration in CPU_DEAD notification and that's precisely what this patch
does.

The patch puts idle task to the _front_ of the dying CPU's runqueue at the 
highest priority possible. This cause idle thread to run _immediately_ after
kstopmachine thread yields. Idle thread notices that its cpu is offline and
dies quickly. Task migration can then be done at leisure in CPU_DEAD
notification, when rest of the CPUs are running.

Some advantages with this approach are:

	- More scalable. Predicatable amout of time that machine is stopped.
	- No changes to hot path/core code. We are just exploiting scheduler
	  rules which runs the next high-priority task on the runqueue. Also
	  since I put idle task to the _front_ of the runqueue, there
	  are no races when a equally high priority task is woken up
	  and added to the runqueue. It gets in at the back of the runqueue,
	  _after_ idle task!
	- cpu_is_offline check that is presenty required in try_to_wake_up,
	  idle_balance and rebalance_tick can be removed, thus speeding them
	  up a bit

From: Srivatsa Vaddagiri <vatsa@in.ibm.com>

  Rusty mentioned that the unlikely hints against cpu_is_offline is
  redundant since the macro already has that hint.  Patch below removes those
  redundant hints I added.

ddea677b

[PATCH] sched: Look at another CPU's domain · 4197ad87

Andrew Morton authored May 09, 2004

From: Nick Piggin <nickpiggin@yahoo.com.au>

The SMT wake_idle code really wants to look at a non-local CPU's domain in
order to check for idle siblings.

So change the domain attachment code a little bit so we continue to hold a
runqueue's lock while attaching a new domain. This means the locking rules
have changed to: you may access your own domain without any lock, you must
hold a remote runqueue's lock in order to view its domain.

4197ad87

[PATCH] sched: micro-optimisation for wake_up · 25de0902

Andrew Morton authored May 09, 2004

From: Nick Piggin <nickpiggin@yahoo.com.au>

This actually does produce better code, especially under the locked
section.

Turns a conditional + unconditional jump under the lock in the unlikely
case into a cmov outside the lock.

25de0902

[PATCH] sched: reduce idle time · 85841fc0

Andrew Morton authored May 09, 2004

From: Nick Piggin <nickpiggin@yahoo.com.au>

It makes NEWLY_IDLE balances cause find_busiest_group return the busiest
available group even if there isn't an imbalance.  Basically - try a bit
harder to prevent schedule emptying the runqueue.

It is quite aggressive, but that isn't so bad because we don't (by default)
do NEWLY_IDLE balancing across NUMA nodes, and NEWLY_IDLE balancing is always
restricted to cache_hot tasks.

It picked up a little bit of idle time that dbt2-pgsql was seeing...

85841fc0

[PATCH] sched: balance-on-clone · 8c8cfc36

Andrew Morton authored May 09, 2004

From: Ingo Molnar <mingo@elte.hu>

Implement balancing during clone().  It does the following things:

- introduces SD_BALANCE_CLONE that can serve as a tool for an
  architecture to limit the search-idlest-CPU scope on clone().
  E.g. the 512-CPU systems should rather not enable this.

- uses the highest sd for the imbalance_pct, not this_rq (which didnt
  make sense).

- unifies balance-on-exec and balance-on-clone via the find_idlest_cpu()
  function. Gets rid of sched_best_cpu() which was still a bit
  inconsistent IMO, it used 'min_load < load' as a condition for
  balancing - while a more correct approach would be to use half of the
  imbalance_pct, like passive balancing does.

- the patch also reintroduces the possibility to do SD_BALANCE_EXEC on
  SMP systems, and activates it - to get testing.

- NOTE: there's one thing in this patch that is slightly unclean: i
  introduced wake_up_forked_thread. I did this to make it easier to get
  rid of this patch later (wake_up_forked_process() has lots of
  dependencies in various architectures). If this capability remains in
  the kernel then i'll clean it up and introduce one function for
  wake_up_forked_process/thread.

- NOTE2: i added the SD_BALANCE_CLONE flag to the NUMA CPU template too.
  Some NUMA architectures probably want to disable this.

8c8cfc36

[PATCH] sched: cpu load management cleanup · a690c9b7

Andrew Morton authored May 09, 2004

From: Ingo Molnar <mingo@elte.hu>

This does the source/target cleanup.  This is a no-functionality patch which
also adds more comments to explain these functions.

a690c9b7

[PATCH] sched: passive balancing damping · df65cdbf

Andrew Morton authored May 09, 2004

From: Nick Piggin <nickpiggin@yahoo.com.au>

This patch starts to balance woken processes when half the relevant domain's
imbalance_pct is reached. Previously balancing would start after a small,
constant difference in waker/wakee runqueue loads was reached, which would
cause too much process movement when there are lots of processes running.

It also turns wake balancing into a domain flag while previously it was always
on. Now sched domains can "soft partition" an SMP system without using
processor affinities.

df65cdbf

[PATCH] sched: cleanups · 237eaf03

Andrew Morton authored May 09, 2004

From: Ingo Molnar <mingo@elte.hu>

This re-adds cleanups which were lost in splitups of an earlier patch.

237eaf03

[PATCH] sched: lock cpu_attach_domain for hotplug · 2ce2e329

Andrew Morton authored May 09, 2004

From: Nick Piggin <nickpiggin@yahoo.com.au>

The attached patch is required to work correctly with the CPU hotplug
framework.  John Hawkes reports successful booting with this.

2ce2e329

[PATCH] sched: extend sync wakeups · 7dc12702

Andrew Morton authored May 09, 2004

From: Ingo Molnar <mingo@elte.hu>

The attached patch extends sync wakeups to the process sys_exit() path too:
the chldwait wakeup can be done sync, since we know that the process is
going to exit (and thus deschedule).

The most visible effect of this change is strace's behavior on SMP systems:
it now stays on a single CPU, together with the traced child.  (previously
it would run in parallel to the child, bouncing around madly.)

7dc12702

[PATCH] sched: add enqueeu_task_head() · 3c6f29aa
Andrew Morton authored May 09, 2004
```
From: Ingo Molnar <mingo@elte.hu>

Helper function for later patches
```
3c6f29aa
[PATCH] sched: uninlinings · 78650e1b
Andrew Morton authored May 09, 2004
```
From: Ingo Molnar <mingo@elte.hu>

Uninline things
```
78650e1b

[PATCH] sched: minor cleanups · 2f16618a

Andrew Morton authored May 09, 2004

From: Nick Piggin <nickpiggin@yahoo.com.au>

Minor cleanups from Ingo's patch including task_hot (do it right in
try_to_wake_up too).

2f16618a

[PATCH] sched: fix setup races · 80b19256

Andrew Morton authored May 09, 2004

From: Nick Piggin <nickpiggin@yahoo.com.au>

De-racify the sched domain setup code.  This involves creating a dummy
"init" domain during sched_init (which is called early).

When topology information becomes available, the sched domains are then
built and attached.  The attach mechanism is asynchronous and uses the
migration threads, which perform the switch with interrupts off.  This is a
quiescent state, so domains can still be lockless on the read side.  It
also allows us to change the domains at runtime without much more work. 
This is something SGI is interested in to elegantly do soft partitioning of
their systems without having to use hard cpu affinities (which cause
balancing problems of their own).

The current setup code also has a race somewhere because it is unable to
boot on a 384 CPU system.



From: Anton Blanchard <anton@samba.org>

   This is basically a mindless ppc64 merge of the x86 changes to sched
   domain init code.

   Actually if I produce a sibling_map[] then the x86 code and the ppc64
   will be identical.  Maybe we can merge it.

80b19256

[PATCH] ARCH_HAS_SCHED_WAKE_BALANCE doesnt exist · 17d66773

Andrew Morton authored May 09, 2004

From: Anton Blanchard <anton@samba.org>

It seems someone has been making trivial changes without using grep.

17d66773

[PATCH] ppc64: sched-domain support · 019bc3be

Andrew Morton authored May 09, 2004

From: Anton Blanchard <anton@samba.org>

Below are the diffs between the current ppc64 sched init stuff and x86.

- Ignore the POWER5 specific stuff, I dont set up a sibling map yet.
- What should I set cache_hot_time to?

large cpumask typechecking requirements (perhaps useful on x86 as well):
- cpu->cpumask = CPU_MASK_NONE -> cpus_clear(cpu->cpumask);
- cpus_and(nodemask, node_to_cpumask(i), cpu_possible_map) doesnt work,
  need to use a temporary

019bc3be

[PATCH] sched: oops fix · a65fb1d0

Andrew Morton authored May 09, 2004

From: Nick Piggin <nickpiggin@yahoo.com.au>

After the for_each_domain change, the warn here won't trigger, instead it
will oops in the if statement.  Also, make sure we don't pass an empty
cpumask to for_each_cpu.

a65fb1d0

[PATCH] sched: altix tuning · fd7b7b0f

Andrew Morton authored May 09, 2004

From: Nick Piggin <nickpiggin@yahoo.com.au>

From: John Hawkes

The following brings up performance on a 64-way Altix.  This system being on
the smaller end of the scale should also be applicable to other NUMA systems.

fd7b7b0f