Commits · 2b76c05794e66655e10633d2d78287854c991f63 · nexedi / linux

11 Jan, 2011 35 commits

Merge branches 'cxgb4', 'ipath', 'ipoib', 'mlx4', 'mthca', 'nes', 'qib' and 'srp' into for-next · 2b76c057
Roland Dreier authored Jan 10, 2011

2b76c057

IB/qib: Fix refcount leak in lkey/rkey validation · 4db62d47

Mike Marciniszyn authored Jan 10, 2011

The mr optimization introduced a reference count leak on an exception
test.  The lock/refcount manipulation is moved down and the problematic
exception test now calls bail to insure that the lock is released.

Additional fixes as suggested by Ralph Campbell <ralph.campbell@qlogic.org>:
- reduce lock scope of dma regions
- use explicit values on returns vs. automatic ret value
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

4db62d47

IB/qib: Improve SERDES tunning on QMH boards · f2d255a0

Mike Marciniszyn authored Jan 10, 2011

Improve the QMH SERDES tunning on initial driver load by having the
driver go through a link state change.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

f2d255a0

IB/qib: Unnecessary delayed completions on RC connection · dd04e43d

Mike Marciniszyn authored Jan 10, 2011

Currently on receipt of a response message (ACKs, RDMA Response,
Atomic Responses etc.) if the SDMA completion counter is not advanced
the driver delays the completion of the WQE.  In most cases this is
overly pessimistic as the response (ACK) to a previously transmitted
send implies that the send is complete.  Ensure that SDMA queue is
progressed appropriately before determining if a send has delayed
completions.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

dd04e43d

IB/qib: Issue pre-emptive NAKs on eager buffer overflow · 994bcd28

Mike Marciniszyn authored Jan 10, 2011

Under congestion resulting in eager buffer overflow attempt to send
pre-emptive NAKs if header queue entries with TID errors are generated
and a valid header is present.  This prevents long timeouts and flow
restarts if a trailing set of packets are dropped due to eager
overflows.  Pre-emptive NAKs are currently only supported for RDMA
writes.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

994bcd28

IB/qib: RDMA lkey/rkey validation is inefficient for large MRs · 2a600f14

Mike Marciniszyn authored Jan 10, 2011

The current code loops during rkey/lkey validiation to isolate the MR
for the RDMA, which is expensive when the current operation is inside
a very large memory region.

This fix optimizes rkey/lkey validation routines for user memory
regions and fast memory regions.  The MR entry can be isolated by
shifts/mods instead of looping.  The existing loop is preserved for
phys memory regions for now.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

2a600f14

IB/qib: Change QPN increment · 7c3edd3f

Mike Marciniszyn authored Jan 10, 2011

Changing from +1 to +2 allows for better QP distribution across
receive contexts.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

7c3edd3f

IB/qib: Add fix missing from earlier patch · 057ae62f

Mike Marciniszyn authored Jan 10, 2011

The upstream code was missing part of a receive/error race fix from
the internal tree.  Add the missing part, which makes future merges
possible.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

057ae62f

IB/qib: Change receive queue/QPN selection · 2528ea60

Mike Marciniszyn authored Jan 10, 2011

The basic idea is that on SusieQ, the difficult part of mapping QPN to
context is handled by the mapping registers so the generic QPN
allocation doesn't need to worry about chip specifics.  For Monty and
Linda, there is no mapping table so the qpt->mask (same as
dd->qpn_mask), is used to see if the QPN to context falls within
[zero..dd->n_krcv_queues).
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

2528ea60

IB/qib: Fix interrupt mitigation · 19ede2e4

Mike Marciniszyn authored Jan 10, 2011

For SusieQ we need to write to the interrupt timer register before
updating the header queue head with interrupt count. This is to
ensure that the timer is enabled properly and a receive available
interrupt is delivered. Otherwise this interrupt can be lost if the
receiver header/eager queues are full before the timer is enabled.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

19ede2e4

IB/qib: Avoid duplicate writes to the rcv head register · aa7374ac

Mike Marciniszyn authored Jan 10, 2011

Avoid duplicate writes to the head register as this can lead to lost
interrupts if the context goes full before the second write is done.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

aa7374ac

IB/qib: Add a few new SERDES tunings · e706203c

Mike Marciniszyn authored Jan 10, 2011

Add new SERDES tuning to aid manufacturing.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

e706203c

IB/qib: Reset packet list after freeing · f73df408

Mike Marciniszyn authored Jan 10, 2011

Reset the list pointers after freeing the SDMA packet list.  This is
done to any potential double-free cases.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

f73df408

IB/qib: New SERDES init routine and improvements to SI quality · a0a234d4

Mike Marciniszyn authored Jan 10, 2011

Implement new SERDES initialization routine and improvements to signal
integrity -- disable LE1 adaptation, disable LOS after link-up, set
better SERDES parameters.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

a0a234d4

IB/qib: Clear WAIT_SEND flags when setting QP to error state · 16028f27

Mike Marciniszyn authored Jan 10, 2011

If these flags are set when the QP is transitioned to the error state,
it will wait until the flags are cleared, which may never happen if
the error transition is due to a link going down.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

16028f27

IB/qib: Fix context allocation with multiple HCAs · 6676b3f7

Mike Marciniszyn authored Jan 10, 2011

The driver was incorrectly choosing HCAs on which to allocate new user
contexts based on overall count of usable ports regardless whether the
usable port was on the currently selected HCA.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

6676b3f7

IB/qib: Fix multi-Florida HCA host panic on reboot · 5dbbcb97

Mike Marciniszyn authored Jan 10, 2011

Add check when setting configured contexts that the value does not
exceed the number of contexts allocated for the card.  If the value
exceeds the already allocated count, set it to what is already
allocated.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

5dbbcb97

IB/qib: Handle transitions from ACTIVE_DEFERRED to ACTIVE better · b3d5cb2f

Mike Marciniszyn authored Jan 10, 2011

When the link transitions from ACTIVE_DEFERRED to ACTIVE, the driver
only sees the ACTIVE state. With this change, it will check whether
the state was already ACTIVE and if so, it will not generated IB
events and will not clear symbol error counts.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

b3d5cb2f

IB/qib: UD send with immediate receive completion has wrong size · c7665e5a

Mike Marciniszyn authored Jan 10, 2011

The code to generate receive completion entries for UD send with
immediate contains the wrong payload length.  This is because when the
code to compute the payload size was moved, the value of hdrsize
didn't get moved too.  The fix is to update tlen directly.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

c7665e5a

IB/qib: Set port physical state even if other fields are invalid · 3c9e5f4d

Mike Marciniszyn authored Jan 10, 2011

The IBTA vol. 1 release 1.2.1 spec. says:
C14-24.2.1: If PortInfo:Portstate=Down, then a SubnSet(PortInfo) shall
make any changes it specifies to PortInfo:PortPhysicalState; any other
result is vendor-dependent.

The patch changes the error handling so that the reply says there are
invalid fields but still attempts to set fields that are in range
including PortInfo:PortPhysicalState.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

3c9e5f4d

IB/qib: Generate completion callback on errors · a377acd1

Mike Marciniszyn authored Jan 10, 2011

According to IBTA vol. 1, C11-30.1.1, a notification callback is
invoked if the CQ is armed for the next solicited completion event or
an error completion. The error case wasn't being generated correctly.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

a377acd1

IB/qib: Add support for the new QME7362 card · f509f9c1

Mike Marciniszyn authored Jan 10, 2011

Add support to recognize another board variation named QME7362.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

f509f9c1

IB/qib: Add receive header queue size module parameters · 0a43e117

Mike Marciniszyn authored Jan 10, 2011

The receive header queue sizes need to modified for performance
tuning.  Three module parameters are added to support this.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

0a43e117

IB/qib: Remove IB latency turnoff · 9d5b243f

Mike Marciniszyn authored Jan 10, 2011

This is required for hardware testing.
Signed-off-by: Mike Marciniszyn <mike.marciniszyn@qlogic.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

9d5b243f

RDMA/nes: Fix string continuation line · 601d87b0

Joe Perches authored Jan 10, 2011

Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

601d87b0

IB/mthca: Handle -ENOMEM in forward_trap() · d0444f15

Dan Carpenter authored Jan 10, 2011

ib_create_send_mad() can return ERR_PTR(-ENOMEM) here.
Signed-off-by: Dan Carpenter <error27@gmail.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

d0444f15

IB/mlx4: Handle -ENOMEM in forward_trap() · 13974909

Dan Carpenter authored Jan 10, 2011

ib_create_send_mad() can return ERR_PTR(-ENOMEM) here.
Signed-off-by: Dan Carpenter <error27@gmail.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

13974909

mlx4_core: Avoid vunmap() of invalid pointer if allocation fails · 030b4b33
Ali Ayoub authored Jan 10, 2011
```
Signed-off-by: Ali Ayoub <ali@mellanox.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>
```
030b4b33

IB/mlx4: Don't call dma_free_coherent() with irqs disabled · 3afa9f19

Vladimir Sokolovsky authored Jan 10, 2011

mlx4_ib_free_cq_buf() should not be called under spin_lock_irq() since
it calls dma_free_coherent(), which needs irqs enabled.  Fix this by
deferring the free to outside the locked region.

This was found due to the

	WARN_ON(irqs_disabled());

in swiotlb_free_coherent().
Signed-off-by: Vladimir Sokolovsky <vlad@mellanox.co.il>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

3afa9f19

mlx4_core: Remove warning message about firmware bug · f5a49539

Roland Dreier authored Jan 10, 2011

The kernel warning message added in commit 58d74bb1 ("mlx4_core:
Workaround firmware bug in query dev cap") about mlx4 reporting the
wrong number of "blue flame registers" doesn't really help anyone, since
the firmware bug is known and fixed and the bug is pretty much harmless
to users.  So just get rid of the warning.
Signed-off-by: Roland Dreier <rolandd@cisco.com>

f5a49539

IPoIB: Add GRO support · 8ae31e5b

Or Gerlitz authored Jan 10, 2011

Signed-off-by: Or Gerlitz <ogerlitz@voltaire.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

8ae31e5b

IPoIB: Remove LRO support · 19e364f6

Or Gerlitz authored Jan 10, 2011

As a first step in moving from LRO to GRO, revert commit af40da89
("IPoIB: add LRO support").  Also eliminate the ethtool set_flags
callback which isn't needed anymore.  Finally, we need to include
<linux/sched.h> directly to get the declaration of restart_syscall()
(which used to be included implicitly through <linux/inet_lro.h>).

Cc: Ben Hutchings <bhutchings@solarflare.com>
Cc: Eric W. Biederman <ebiederm@xmission.com>
Cc: Vladimir Sokolovsky <vlad@mellanox.co.il>
Signed-off-by: Or Gerlitz <ogerlitz@voltaire.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

19e364f6

IB/ipath: Use printf extension %pR for struct resource · 1eba27e8

Joe Perches authored Jan 10, 2011

Using %pR standardizes the struct resource output.
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

1eba27e8

RDMA/cxgb4: Don't re-init wait object in init/fini paths · db8b1016

Steve Wise authored Jan 10, 2011

Re-initializing the wait object in rdma_init()/rdma_fini() causes a
timing window which can lead to a deadlock during close.  Once this
deadlock hits, all RDMA activity over the T4 device will be stuck.

There's no need to re-init the wait object, so remove it.
Signed-off-by: Steve Wise <swise@opengridcomputing.com>
Cc: <stable@kernel.org>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

db8b1016

RDMA/cxgb3,cxgb4: Remove dead code · c9431091

Stephen Hemminger authored Jan 10, 2011

This removes unused code found by running 'make namespacecheck';
compile tested only.
Signed-off-by: Stephen Hemminger <shemminger@vyatta.com>
Acked-by: Steve Wise <swise@opengridcomputing.com>
Signed-off-by: Roland Dreier <rolandd@cisco.com>

c9431091

10 Jan, 2011 5 commits

IB/srp: consolidate hot-path variables into cache lines · 9af76271

David Dillow authored Nov 26, 2010

Put the variables accessed together in the hot-path into common
cachelines, and separate them by RW vs RO to avoid false dirtying.
We keep a local copy of the lkey and rkey in the target to avoid
traversing pointers (and associated cache lines) to find them.
Reviewed-by: Bart Van Assche <bvanassche@acm.org>
Signed-off-by: David Dillow <dillowda@ornl.gov>

9af76271

IB/srp: stop sharing the host lock with SCSI · e9684678

Bart Van Assche authored Nov 26, 2010

We don't need protection against the SCSI stack, so use our own lock to
allow parallel progress on separate CPUs.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
[ broken out and small cleanups by David Dillow ]
Signed-off-by: David Dillow <dillowda@ornl.gov>

e9684678

IB/srp: reduce lock coverage of command completion · 94a9174c

Bart Van Assche authored Nov 26, 2010

We only need the lock to cover list and credit manipulations, so push
those into srp_remove_req() and update the call chains.

We reorder the request removal and command completion in
srp_process_rsp() to avoid the SCSI mid-layer sending another command
before we've released our request and added any credits returned by the
target. This prevents us from returning HOST_BUSY unneccesarily.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
[ broken out, small cleanups, and modified to avoid potential extraneous
  HOST_BUSY returns by David Dillow ]
Signed-off-by: David Dillow <dillowda@ornl.gov>

94a9174c

IB/srp: reduce local coverage for command submission and EH · 76c75b25

Bart Van Assche authored Nov 26, 2010

We only need locks to protect our lists and number of credits available.
By pre-consuming the credit for the request, we can reduce our lock
coverage to just those areas. If we don't actually send the request,
we'll need to put the credit back into the pool.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
[ broken out and small cleanups by David Dillow ]
Signed-off-by: David Dillow <dillowda@ornl.gov>

76c75b25

IB/srp: don't move active requests to their own list · 536ae14e

Bart Van Assche authored Nov 26, 2010

We use req->scmnd != NULL to indicate an active request, so there's no
need to keep a separate list for them. We can afford the array iteration
during error handling, and dropping it gives us one less item that needs
lock protection.
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
[ broken out and small cleanups by David Dillow ]
Signed-off-by: David Dillow <dillowda@ornl.gov>

536ae14e