Commits · 3cab0c3e8636d5005041aa52224f796c3a4ef872 · Kirill Smelkov / linux

20 Mar, 2006 40 commits

[SPARC64]: More SUN4V cpu mondo bug fixing. · 3cab0c3e

David S. Miller authored Mar 02, 2006

This cpu mondo sending interface isn't all that easy to
use correctly...

We were clearing out the wrong bits from the "mask" after getting
something other than EOK from the hypervisor.

It turns out the hypervisor can just be resent the same cpu_list[]
array, with the 0xffff "done" entries still in there, and it will do
the right thing.

So don't update or try to rebuild the cpu_list[] array to condense it.

This requires the "forward_progress" check to be done slightly
differently, but this new scheme is less bug prone than what we were
doing before.
Signed-off-by: David S. Miller <davem@davemloft.net>

3cab0c3e

[SPARC64]: Fix sun4v mna winfixup handling. · bcc28ee0

David S. Miller authored Mar 02, 2006

We were clobbering a base register before we were done
using it.  Fix a comment typo while we're here.
Signed-off-by: David S. Miller <davem@davemloft.net>

bcc28ee0

[SPARC64]: Fix mini RTC driver reading. · c4f8ef77

David S. Miller authored Mar 02, 2006

Need to subtract 1900 from year and 1 from month before
giving it back to userspace.
Signed-off-by: David S. Miller <davem@davemloft.net>

c4f8ef77

[SPARC64]: Do not allow mapping pages within 4GB of 64-bit VA hole. · 8bcd1741

David S. Miller authored Mar 02, 2006

The UltraSPARC T1 manual recommends this because the chip
could instruction prefetch into the VA hole, and this would
also make decoding  certain kinds of memory access traps
more difficult (because the chip sign extends certain pieces
of trap state).
Signed-off-by: David S. Miller <davem@davemloft.net>

8bcd1741

[SPARC64]: Fix _PAGE_EXEC handling. · 45f791eb

David S. Miller authored Mar 01, 2006

First of all, use the known _PAGE_EXEC_{4U,4V} value instead
of loading _PAGE_EXEC from memory.  We either know which one
to use by context, or we can code patch the test.

Next, we need to check executability of a PTE in the generic
TSB miss handler.
Signed-off-by: David S. Miller <davem@davemloft.net>

45f791eb

[SPARC64]: Fix typo in SUN4V D-TLB miss handler. · 92daa77e

David S. Miller authored Mar 01, 2006

Should put FAULT_CODE_DTLB into %g3 not FAULT_CODE_ITLB.
Signed-off-by: David S. Miller <davem@davemloft.net>

92daa77e

[SPARC64]: Kill bogus function externs in asm/pgtable.h · e2299045
David S. Miller authored Mar 01, 2006
```
These are all implemented inline earlier in the file.
Signed-off-by: David S. Miller <davem@davemloft.net>
```
e2299045
[SPARC64]: Add mini-RTC driver for Starfire and SUN4V. · 8ba706a9
David S. Miller authored Mar 01, 2006
```
Signed-off-by: David S. Miller <davem@davemloft.net>
```
8ba706a9

[SPARC64]: Fix bugs in SUN4V cpu mondo dispatch. · b830ab66

David S. Miller authored Feb 28, 2006

There were several bugs in the SUN4V cpu mondo dispatch code.

In fact, if we ever got a EWOULDBLOCK or other error from
the hypervisor call, we'd potentially send a cpu mondo multiple
times to the same cpu and even worse we could loop until the
timeout resending the same mondo over and over to such cpus.

So let's bulletproof this thing as follows:

1) Implement cpu_mondo_send() and cpu_state() hypervisor calls
   in arch/sparc64/kernel/entry.S, add prototypes to asm/hypervisor.h

2) Don't build and update the cpulist using inline functions, this
   was causing the cpu mask to not get updated in the caller.

3) Disable interrupts during the entire mondo send, otherwise our
   cpu list and/or mondo block could get overwritten if we take
   an interrupt and do a cpu mondo send on the current cpu.

4) Check for all possible error return types from the cpu_mondo_send()
   hypervisor call.  In particular:

   HV_EOK) Our work is done, all cpus have received the mondo.
   HV_CPUERROR) One or more of the cpus in the cpu list we passed
                to the hypervisor are in error state.  Use cpu_state()
                calls over the entries in the cpu list to see which
		ones.  Record them in "error_mask" and report this
		after we are done sending the mondo to cpus which are
		not in error state.
   HV_EWOULDBLOCK) We need to keep trying.

   Any other error we consider fatal, we report the event and exit
   immediately.

5) We only timeout if forward progress is not made.  Forward progress
   is defined as having at least one cpu get the mondo successfully
   in a given cpu_mondo_send() call.  Otherwise we bump a counter
   and delay a little.  If the counter hits a limit, we signal an
   error and report the event.

Also, smp_call_function_mask() error handling reports the number
of cpus incorrectly.
Signed-off-by: David S. Miller <davem@davemloft.net>

b830ab66

[SPARC64]: Fix bugs in SMP TLB context version expiration handling. · aac0aadf

David S. Miller authored Feb 27, 2006

1) We must flush the TLB, duh.

2) Even if the sw context was seen to be valid, the local cpu's
   hw context can be out of date, so reload it unconditionally.
Signed-off-by: David S. Miller <davem@davemloft.net>

aac0aadf

[SPARC64]: Fix indexing into kpte_linear_bitmap. · 6889331a

David S. Miller authored Feb 26, 2006

Need to shift back up by 3 bits to get 8-byte entry
index.
Signed-off-by: David S. Miller <davem@davemloft.net>

6889331a

[SPARC64]: Use 13-bit context size always. · 97c4b6f9

David S. Miller authored Feb 26, 2006

We no longer have the problems that require using the smaller
sizes.
Signed-off-by: David S. Miller <davem@davemloft.net>

97c4b6f9

[SPARC64]: Avoid dcache-dirty page state management on sun4v. · 7a591cfe

David S. Miller authored Feb 26, 2006

It is totally wasted work, since we have no D-cache aliasing
issues on sun4v.
Signed-off-by: David S. Miller <davem@davemloft.net>

7a591cfe

[SPARC64]: Bulletproof hypervisor TLB flushing. · 2a3a5f5d

David S. Miller authored Feb 26, 2006

Check TLB flush hypervisor calls for errors and report them.

Pass HV_MMU_ALL always for now, we can add back the optimization
to avoid the I-TLB flush later.

Always explicitly page align the virtual address arguments.
Signed-off-by: David S. Miller <davem@davemloft.net>

2a3a5f5d

[SPARC64]: Report mondo error correctly in hypervisor_xcall_deliver(). · 6cc80cfa
David S. Miller authored Feb 26, 2006
```
It's in "arg0" not "func".
Signed-off-by: David S. Miller <davem@davemloft.net>
```
6cc80cfa
[SPARC64]: Niagara optimized XOR functions for RAID. · 36344762
David S. Miller authored Feb 25, 2006
```
Signed-off-by: David S. Miller <davem@davemloft.net>
```
36344762

[SPARC64]: Fix binfmt_aout32.c build. · c4e9249b

Andrew Morton authored Feb 24, 2006

Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: David S. Miller <davem@davemloft.net>

c4e9249b

[SPARC64]: destroy_context() needs to disable interrupts. · 77b838fa

David S. Miller authored Feb 23, 2006

get_new_mmu_context() can be invoked from interrupt context
now for the new SMP version wrap handling.

So disable interrupt while taking ctx_alloc_lock in destroy_context()
so we don't deadlock.
Signed-off-by: David S. Miller <davem@davemloft.net>

77b838fa

[SPARC64]: Fix TLB context allocation with SMT style shared TLBs. · a0663a79

David S. Miller authored Feb 23, 2006

The context allocation scheme we use depends upon there being a 1<-->1
mapping from cpu to physical TLB for correctness.  Chips like Niagara
break this assumption.

So what we do is notify all cpus with a cross call when the context
version number changes, and if necessary this makes them allocate
a valid context for the address space they are running at the time.

Stress tested with make -j1024, make -j2048, and make -j4096 kernel
builds on a 32-strand, 8 core, T2000 with 16GB of ram.
Signed-off-by: David S. Miller <davem@davemloft.net>

a0663a79

[SPARC64]: Put syscall tables after trap table. · 074d82cf

David S. Miller authored Feb 23, 2006

Otherwise with too much stuff enabled in the kernel config
we can end up with an unaligned trap table.
Signed-off-by: David S. Miller <davem@davemloft.net>

074d82cf

[SPARC64]: Export _PAGE_E and _PAGE_CACHE to modules. · b2bef442
David S. Miller authored Feb 23, 2006
```
SBUS flash driver needs it.

Noticed by Fabbione.
Signed-off-by: David S. Miller <davem@davemloft.net>
```
b2bef442

[SPARC64]: Fix %tstate ASI handling in start_thread{,32}() · 0f05da6d

David S. Miller authored Feb 22, 2006

Niagara helps us find a ancient bug in the sparc64 port :-)

The ASI_* values are plain constant defines, thus signed 32-bit
on sparc64.  To put shift this into the regs->tstate value we were
doing or'ing "(ASI_PNF << 24)" into there.

ASI_PNF is 0x82 and shifted left by 24 makes that topmost bit the
sign bit in a 32-bit value.  This would get sign extended to 64-bits
and thus corrupt the top-half of the reg->tstate value.

This never caused problems in pre-Niagara cpus because the only thing
up there were the condition code values.  But Niagara has the global
register level field, and this all 1's value is illegal there so
Niagara gives an illegal instruction trap due to this bug.

I'm pretty sure this bug is about as old as the sparc64 port itself.

This also points out that we weren't setting ASI_PNF for 32-bit tasks.
We should, so fix that while we're here.
Signed-off-by: David S. Miller <davem@davemloft.net>

0f05da6d

[SPARC64]: Drop %gl to 0 before re-enabling PSTATE_IE in rtrap · fc504928

David S. Miller authored Feb 22, 2006

If we take a window fault, on SUN4V set %gl to zero before we
turn PSTATE_IE back on in %pstate.  Otherwise if we take an
interrupt we'll end up with corrupt register state.
Signed-off-by: David S. Miller <davem@davemloft.net>

fc504928

[SPARC64]: Create a seperate kernel TSB for 4MB/256MB mappings. · d7744a09

David S. Miller authored Feb 21, 2006

It can map all of the linear kernel mappings with zero TSB hash
conflicts for systems with 16GB or less ram.  In such cases, on
SUN4V, once we load up this TSB the first time with all the
mappings, we never take a linear kernel mapping TLB miss ever
again, the hypervisor handles them all.
Signed-off-by: David S. Miller <davem@davemloft.net>

d7744a09

[SPARC64]: Make use of Niagara 256MB PTEs for kernel mappings. · 9cc3a1ac

David S. Miller authored Feb 21, 2006

We use a bitmap, one bit for every 256MB of memory.  If the
bit is set we can use a 256MB PTE for linear mappings, else
we have to use a 4MB PTE.

SUN4V support is there, and we can very easily add support
for Panther cpu 256MB PTEs in the future.
Signed-off-by: David S. Miller <davem@davemloft.net>

9cc3a1ac

[SPARC64]: Use sun4v_cpu_idle() in cpu_idle() on SUN4V. · 30c91d57

David S. Miller authored Feb 21, 2006

We have to turn off the "polling nrflag" bit when we sleep
the cpu like this, so that we'll get a cross-cpu interrupt
to wake the processor up from the yield.

We also have to disable PSTATE_IE in %pstate around the yield
call and recheck need_resched() in order to avoid any races.
Signed-off-by: David S. Miller <davem@davemloft.net>

30c91d57

[SPARC64] math-emu: Delete debugging printk left by previous commit. · 689126a4
David S. Miller authored Feb 21, 2006
```
Signed-off-by: David S. Miller <davem@davemloft.net>
```
689126a4
[SPARC64]: Add sun4v_cpu_yield(). · 6f5374c9
David S. Miller authored Feb 21, 2006
```
Signed-off-by: David S. Miller <davem@davemloft.net>
```
6f5374c9

[SPARC64]: Kill cpudata->idle_volume. · 1bd0cd74

David S. Miller authored Feb 21, 2006

Set, but never used.

We used to use this for dynamic IRQ retargetting, but that
code died a long time ago.
Signed-off-by: David S. Miller <davem@davemloft.net>

1bd0cd74

[SPARC64]: Niagara optimized memset/bzero/clear_user. · 8ca2557c
David S. Miller authored Feb 21, 2006
```
Signed-off-by: David S. Miller <davem@davemloft.net>
```
8ca2557c
[SPARC64]: Pass multiple CPUs at once to hypervisor cross-call API. · d371c0c1
David S. Miller authored Feb 21, 2006
```
Signed-off-by: David S. Miller <davem@davemloft.net>
```
d371c0c1

[SPARC64]: Args to SUNW,set-trap-table are 64-bit. · c79f7677

David S. Miller authored Feb 20, 2006

They were getting truncated to 32-bit and this is very bad
when your MMU fault status area is in physical memory above
4GB on SUN4V.
Signed-off-by: David S. Miller <davem@davemloft.net>

c79f7677

[SPARC64]: Handle unimplemented FPU square-root on Niagara. · 4e74ae80

David S. Miller authored Feb 20, 2006

The math-emu code only expects unfinished fpop traps when
emulating FPU sqrt instructions on pre-Niagara chips.
On Niagara we can get unimplemented fpop, so handle that.
Signed-off-by: David S. Miller <davem@davemloft.net>

4e74ae80

[SPARC] serial: Make sure sysfs nodes get named correctly. · f5deb807

David S. Miller authored Feb 20, 2006

Because we play this trick where we use ttyS? in increasing minor
numbers for different sunfoo.c drivers, we have to inform the TTY
layer of this.

Do so by setting the tty->name_base appropriately.

Probably there should be a generic way to do this in the serial core,
but for now...
Signed-off-by: David S. Miller <davem@davemloft.net>

f5deb807

[SPARC64]: Typo in sun4v_data_access_exception log message. · 55555633
David S. Miller authored Feb 20, 2006
```
Should be "Dax" not "Iax".
Signed-off-by: David S. Miller <davem@davemloft.net>
```
55555633

[SPARC64]: Handle zero-length map requests in pci_sun4v.c · d82965c1

David S. Miller authored Feb 20, 2006

By simply changing the do-while loop into a plain
while loop.
Signed-off-by: David S. Miller <davem@davemloft.net>

d82965c1

[SPARC64]: Kill stray PGLIST_NENTS check in pci_sun4v.c · abf3b7bd

David S. Miller authored Feb 20, 2006

I forgot to remove the one in pci_4v_map_sg() during the
iommu batching commit.
Signed-off-by: David S. Miller <davem@davemloft.net>

abf3b7bd

[SPARC64]: Fix typo in dump_tl1_traplog() · 39334a4b

David S. Miller authored Feb 20, 2006

Actually make use of the 'limit' we compute.
Signed-off-by: David S. Miller <davem@davemloft.net>

39334a4b

[SPARC64]: Disable smp_report_regs() for now. · 37133c00

David S. Miller authored Feb 20, 2006

It's extremely noisy and causes much grief on slow
consoles with large numbers of cpus.

We'll have to provide this some saner way in order
to re-enable this.
Signed-off-by: David S. Miller <davem@davemloft.net>

37133c00

[SPARC64]: Remove PGLIST_NENTS PCI IOMMU mapping limitation on SUN4V. · 6a32fd4d

David S. Miller authored Feb 19, 2006

Use a batching queue system for IOMMU mapping setup,
with a page sized batch.
Signed-off-by: David S. Miller <davem@davemloft.net>

6a32fd4d