Commit e4afd2e5 authored by Matthew Wilcox's avatar Matthew Wilcox Committed by Linus Torvalds

lib/find_bit.c: micro-optimise find_next_*_bit

This saves 32 bytes on my x86-64 build, mostly due to alignment
considerations and sharing more code between find_next_bit and
find_next_zero_bit, but it does save a couple of instructions.

There's really two parts to this commit:
 - First, the first half of the test: (!nbits || start >= nbits) is
   trivially a subset of the second half, since nbits and start are both
   unsigned
 - Second, while looking at the disassembly, I noticed that GCC was
   predicting the branch taken. Since this is a failure case, it's
   clearly the less likely of the two branches, so add an unlikely() to
   override GCC's heuristics.

[mawilcox@microsoft.com: v2]
  Link: http://lkml.kernel.org/r/1483709016-1834-1-git-send-email-mawilcox@linuxonhyperv.com
Link: http://lkml.kernel.org/r/1483709016-1834-1-git-send-email-mawilcox@linuxonhyperv.comSigned-off-by: default avatarMatthew Wilcox <mawilcox@microsoft.com>
Acked-by: default avatarYury Norov <ynorov@caviumnetworks.com>
Acked-by: default avatarRasmus Villemoes <linux@rasmusvillemoes.dk>
Signed-off-by: default avatarAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: default avatarLinus Torvalds <torvalds@linux-foundation.org>
parent 55ded955
...@@ -33,7 +33,7 @@ static unsigned long _find_next_bit(const unsigned long *addr, ...@@ -33,7 +33,7 @@ static unsigned long _find_next_bit(const unsigned long *addr,
{ {
unsigned long tmp; unsigned long tmp;
if (!nbits || start >= nbits) if (unlikely(start >= nbits))
return nbits; return nbits;
tmp = addr[start / BITS_PER_LONG] ^ invert; tmp = addr[start / BITS_PER_LONG] ^ invert;
...@@ -151,7 +151,7 @@ static unsigned long _find_next_bit_le(const unsigned long *addr, ...@@ -151,7 +151,7 @@ static unsigned long _find_next_bit_le(const unsigned long *addr,
{ {
unsigned long tmp; unsigned long tmp;
if (!nbits || start >= nbits) if (unlikely(start >= nbits))
return nbits; return nbits;
tmp = addr[start / BITS_PER_LONG] ^ invert; tmp = addr[start / BITS_PER_LONG] ^ invert;
......
...@@ -34,7 +34,7 @@ static unsigned long _find_next_bit(const unsigned long *addr, ...@@ -34,7 +34,7 @@ static unsigned long _find_next_bit(const unsigned long *addr,
{ {
unsigned long tmp; unsigned long tmp;
if (!nbits || start >= nbits) if (unlikely(start >= nbits))
return nbits; return nbits;
tmp = addr[start / BITS_PER_LONG] ^ invert; tmp = addr[start / BITS_PER_LONG] ^ invert;
......
Markdown is supported
0%
or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment