Commits · a51842dcd77200159e091da8b2e38e428652532d · Kirill Smelkov / iproute2

20 Apr, 2015 3 commits

cbq: fix find syntax in example · a51842dc

Pavel Šimerda authored Apr 13, 2015

Without modification, using the example resulted in the following error:

[root@localhost sbin]# cbq restart
find: warning: you have specified the -maxdepth option after a
non-option argument (, but options are not positional (-maxdepth affects
tests specified before it as well as those specified after it).  Please
specify options before other arguments.

find: warning: you have specified the -maxdepth option after a
non-option argument (, but options are not positional (-maxdepth affects
tests specified before it as well as those specified after it).  Please
specify options before other arguments.

**CBQ: failed to compile CBQ configuration!

See also:

 * https://bugzilla.redhat.com/show_bug.cgi?id=539232Reported-by: Mads Kiilerich <mads@kiilerich.com>
Signed-off-by: Pavel Šimerda <psimerda@redhat.com>

a51842dc

ip-xfrm: support 'proto any' with 'sport' and 'dport' · 11a3e5c4

Pavel Šimerda authored Apr 13, 2015

When creating an IPsec SA that sets 'proto any' (IPPROTO_IP) and
specifies 'sport' and 'dport' at the same time in selector, the
following error is issued:

"sport" and "dport" are invalid with proto=ip

However using IPPROTO_IP with ports is completely legal and necessary
when one wants to share the SA on both TCP and UDP. One of the
applications requiring sharing SAs is 3GPP IMS AKA authentication.

See also:

 * https://bugzilla.redhat.com/show_bug.cgi?id=497355Reported-by: Jiří Klimeš <jklimes@redhat.com>
Signed-off-by: Pavel Šimerda <psimerda@redhat.com>

11a3e5c4

turn Makefile more distribution friendly · 06ec9039

Pavel Šimerda authored Apr 13, 2015

Changes:

 * Accept directory settings from environment.
 * Remove redundant ROOTDIR variable.
 * Set KERNEL_INCLUDE default to '/usr/include'.
 * Use CFLAGS from environemnt.

Note: In the long term it might be better to improve the configure
script to generate those parts of the Makefile in a manner similar
to autoconf. It might be even practical to autotoolize the package.
Signed-off-by: Pavel Šimerda <psimerda@redhat.com>

06ec9039

13 Apr, 2015 9 commits

tc: add support for connmark action · b8d5c9a7

Felix Fietkau authored Feb 15, 2015

Add ability to add the netfilter connmark support.

Typical usage:
...lets tag outgoing icmp with mark 0x10..
iptables -tmangle -A PREROUTING -p icmp -j CONNMARK --set-mark 0x10
..add on ingress of $ETH an extractor for connmark...
tc filter add dev $ETH parent ffff: prio 4 protocol ip \
u32 match ip protocol 1 0xff \
flowid 1:1 \
action connmark continue
...if the connmark was 0x11, we police to a ridic rate of 10Kbps
tc filter add dev $ETH parent ffff: prio 5 protocol ip \
handle 0x11 fw flowid 1:1 \
action police rate 10kbit burst 10k

Other ways to use the connmark is to supply the zone, index and
branching choice. Refer to help.
Signed-off-by: Felix Fietkau <nbd@openwrt.org>
Signed-off-by: Jamal Hadi Salim <jhs@mojatatu.com>

b8d5c9a7

update kernel headers and add tc_connmark.h · 94f66538
Stephen Hemminger authored Apr 13, 2015
```
Needed for later tc action patches
```
94f66538

iproute2: unify naming for entries offloaded to hardware · aa05b988

Andy Gospodarek authored Apr 10, 2015

The kernel now has the capability to offload FDB and FIB entries to hardware.
It is important to let users know if table entries are also offloaded to
hardware.  Currently offloaded FDB entries are indicated by the existence of
the flag 'external' on the entry as of the following commit:

commit 28467b7f
Author: Scott Feldman <sfeldma@gmail.com>
Date:   Thu Dec 4 09:57:15 2014 +0100

    bridge/fdb: add flag/indication for FDB entry synced from offload device

When the patch to add support for indicating that FIB entries were also
offloaded as posted to netdev by Scott Feldman it became clear that 'external'
would not be an ideal name for routes.  There could definitely be confusion
about what this might mean since many routes are to external networks -- a
collision/confusion that did not happen with FDB.

Scott Feldman asked me to check with others and build concensus around a name.
After speaking with several people about this I am proposing we refer to both
FDB and FIB entries that are currently backed by hardware (based on the work
done in rocker) with the flag 'offload' appended to the end ofthe entry.

Some people liked the string 'external,' others liked 'hardware,' but the point
is to communicate that these routes are available to something that will will
offload the forwarding normally done by the kernel.  Since the term 'offload'
is used so frequently it seems appropriate to use the same language in
ip/bridge output.

The term 'offload' also seems to resonate with many of the people who have
responded on Scott's original thread or to those who I reached out to directly
and did respond to my query, so it seems we have reached consensus that it
should be the term used going forward.

v2: rebased against net-next branch
Signed-off-by: Andy Gospodarek <gospo@cumulusnetworks.com>
CC: Jamal Hadi Salim <jhs@mojatatu.com>
CC: Jeff Kirsher <jeffrey.t.kirsher@intel.com>
CC: Jiri Pirko <jiri@resnulli.us>
CC: John W. Linville <linville@tuxdriver.com>
CC: Roopa Prabhu <roopa@cumulusnetworks.com>
CC: Scott Feldman <sfeldma@gmail.com>
CC: Stephen Hemminger <stephen@networkplumber.org>

aa05b988

Merge branch 'master' into net-next · 93531fac
Stephen Hemminger authored Apr 13, 2015

93531fac
fix whitespace · 672acc72
Stephen Hemminger authored Apr 13, 2015

672acc72
v4.0.0 · aed6d85d
Stephen Hemminger authored Apr 13, 2015

aed6d85d

ipnetns: add a runtime check for RTM_GETNSID support · 4c7d9a58

Nicolas Dichtel authored Apr 13, 2015

The goal of this patch is to test during the runtime if the command RTM_GETNSID
is supported by the kernel.
Signed-off-by: Nicolas Dichtel <nicolas.dichtel@6wind.com>

4c7d9a58

Revert "ip netns: Fix rtnl error while print netns list" · 5a2ce868
Nicolas Dichtel authored Apr 13, 2015
```
This reverts commit d116ff34.
Signed-off-by: Nicolas Dichtel <nicolas.dichtel@6wind.com>
```
5a2ce868
Revert "configure: add missing INCLUDE to netnsid detection" · 694ed195
Nicolas Dichtel authored Apr 13, 2015
```
This reverts commit d059de70.
Signed-off-by: Nicolas Dichtel <nicolas.dichtel@6wind.com>
```
694ed195

10 Apr, 2015 8 commits

tc, bpf: finalize eBPF support for cls and act front-end · 6256f8c9

Daniel Borkmann authored Apr 01, 2015

This work finalizes both eBPF front-ends for the classifier and action
part in tc, it allows for custom ELF section selection, a simplified tc
command frontend (while keeping compat), reusing of common maps between
classifier and actions residing in the same object file, and exporting
of all map fds to an eBPF agent for handing off further control in user
space.

It also adds an extensive example of how eBPF can be used, and a minimal
self-contained example agent that dumps map data. The example is well
documented and hopefully provides a good starting point into programming
cls_bpf and act_bpf.
Signed-off-by: Daniel Borkmann <daniel@iogearbox.net>
Cc: Alexei Starovoitov <ast@plumgrid.com>
Cc: Jiri Pirko <jiri@resnulli.us>
Cc: Jamal Hadi Salim <jhs@mojatatu.com>
Acked-by: Alexei Starovoitov <ast@plumgrid.com>
Acked-by: Thomas Graf <tgraf@suug.ch>
Acked-by: Jiri Pirko <jiri@resnulli.us>
Acked-by: Jamal Hadi Salim <jhs@mojatatu.com>

6256f8c9

Merge branch 'master' into net-next · f0eb8da5
Stephen Hemminger authored Apr 10, 2015

f0eb8da5
man tc: Add description about class name option · 943c30de
Vadim Kochan authored Apr 08, 2015
```
Signed-off-by: Vadim Kochan <vadim4j@gmail.com>
```
943c30de

configure: add missing INCLUDE to netnsid detection · d059de70

Jiri Benc authored Apr 08, 2015

Fixes: d116ff34 ("ip netns: Fix rtnl error while print netns list")
Signed-off-by: Jiri Benc <jbenc@redhat.com>

d059de70

xfrm: revise man page and document ip xfrm policy set · 811aca04

Christophe Gouault authored Apr 09, 2015

- document ip xfrm policy set
- update ip xfrm monitor documentation
- in DESCRIPTION section, reorganize grouping of commands
Signed-off-by: Christophe Gouault <christophe.gouault@6wind.com>

811aca04

xfrm: add command for configuring SPD hash table · 025fa9dc

Christophe Gouault authored Apr 09, 2015

add a new command to configure the SPD hash table:
   ip xfrm policy set [ hthresh4 LBITS RBITS ] [ hthresh6 LBITS RBITS ]

and code to display the SPD hash configuration:
  ip -s -s xfrm policy count

hthresh4: defines minimum local and remote IPv4 prefix lengths of
selectors to hash a policy. If prefix lengths are greater or equal
to the thresholds, then the policy is hashed, otherwise it falls back
in the policy_inexact chained list.

hthresh6: defines minimum local and remote IPv6 prefix lengths of
selectors to hash a policy, otherwise it falls back
in the policy_inexact chained list.

Example:

% ip -s -s xfrm policy count
         SPD IN  0 OUT 0 FWD 0 (Sock: IN 0 OUT 0 FWD 0)
         SPD buckets: count 7 Max 1048576
         SPD IPv4 thresholds: local 32 remote 32
         SPD IPv6 thresholds: local 128 remote 128

% ip xfrm pol set hthresh4 24 16 hthresh6 64 56

% ip -s -s xfrm policy count
         SPD IN  0 OUT 0 FWD 0 (Sock: IN 0 OUT 0 FWD 0)
         SPD buckets: count 7 Max 1048576
         SPD IPv4 thresholds: local 24 remote 16
         SPD IPv6 thresholds: local 64 remote 56
Signed-off-by: Christophe Gouault <christophe.gouault@6wind.com>

025fa9dc

update kernel headers for net-next · e46efaed
Stephen Hemminger authored Apr 10, 2015
```
Current santized kernel headers from net-next
```
e46efaed

xfrm: fix build with later kernel headers · 93390779

Stephen Hemminger authored Apr 10, 2015

Need to include netinet/in.h to get the correct glibc headers
instead of getting definitions in linux/in6.h

93390779

07 Apr, 2015 6 commits
- Merge branch 'master' into net-next · bd733e40
  Stephen Hemminger authored Apr 07, 2015
```
Conflicts:
	man/man8/ip-route.8.in
```
  bd733e40
- docs: make spacing consistent · a89d5329
  Pavel Šimerda authored Apr 07, 2015
```
Result of the following command:

    sed -ri 's/\.  /. /g' man/*/*
Signed-Off-By: Pavel Šimerda <psimerda@redhat.com>
```
  a89d5329
- man ip-link: Add missing link types - vti,ipvlan,nlmon · b6d6b5a1
  Vadim Kochan authored Apr 04, 2015
```
Signed-off-by: Vadim Kochan <vadim4j@gmail.com>
```
  b6d6b5a1
- ip-link: Align usage at [link-netns ID] line · 21107f52
  Vadim Kochan authored Apr 04, 2015
```
Output of the usage was shifted be cause of missing TAB
Signed-off-by: Vadim Kochan <vadim4j@gmail.com>
```
  21107f52
- man ip-netns: Fix shifted layout at bottom of 'ip netns del' · bbf2f7c6
  Vadim Kochan authored Apr 02, 2015
```
Signed-off-by: Vadim Kochan <vadim4j@gmail.com>
```
  bbf2f7c6
- tc class: Ignore if default class name file does not exist · 8b90a990
  Vadim Kochan authored Mar 25, 2015
```
If '-nm' specified that do not fail if there is no
default class names file in /etc/iproute2.

Changed default class name file cls_names -> tc_cls.
Signed-off-by: Vadim Kochan <vadim4j@gmail.com>
```
  8b90a990
24 Mar, 2015 14 commits

ip: support RFC4191 router preference · 194e9b85

Lubomir Rintel authored Mar 16, 2015

This allows querying and setting the route preference. It's usually set from
the IPv6 Neighbor Discovery Router Advertisement messages.

Introduced in "ipv6: expose RFC4191 route preference via rtnetlink", enqueued
for Linux 4.1.
Signed-off-by: Lubomir Rintel <lkundrak@v3.sk>

194e9b85

add basic mpls support to iproute · dacc5d41

Eric W. Biederman authored Mar 15, 2015

- Pull in the uapi mpls.h
- Update rtnetlink.h to include the mpls rtnetlink notification multicast group.
- Define AF_MPLS in utils.h if it is not defined from elsewhere
  as is done with AF_DECnet

The address syntax for multiple mpls labels is a complete invention.
When I looked there seemed to be no wide spread convention for talking
about an mpls label stack in text for.  Sometimes people did:
"{ Label1, Label2, Label3 }", sometimes people would do:
"[ label3, label2, label1 ]", and most of the time label
stacks were not explicitly shown at all.

The syntax I wound up using, so it would not have spaces and so it
would visually distinct from other kinds of addresses is.

label1/label2/label3 Where label1 is the label at the top of the label
stack and label3 is the label at the bottom on the label stack.

When there is a single label this matches what seems to be convention
with other tools.  Just print out the numeric value of the mpls label.

The netlink protocol for labels uses the on the wire format for a
label stack. The ttl and traffic class are expected to be 0.  Using
the on the wire format is common and what happens with other address
types. BGP when passing label stacks also uses this technique with the
exception that the ttl byte is not included making each label in a BGP
label stack 3 bytes instead of 4.
Signed-off-by: "Eric W. Biederman" <ebiederm@xmission.com>

dacc5d41

add support for the RTA_NEWDST attribute. · 6f7a9f4d

Eric W. Biederman authored Mar 15, 2015

This attribute is like RTA_DST except it specifies the destination
address to place on a packet when it leaves the host.  For ip based
protocols this is destination NAT and not a common part of forwarding.
For protocols like MPLS label swapping is something that typically
happens on every hop.

There is likely to be a RTA_NEWSRC at some point so RTA_NEWDST
is printed as "as to"  and can be specified either as "as to"
or just "as"
Signed-off-by: "Eric W. Biederman" <ebiederm@xmission.com>

6f7a9f4d

add support for the RTA_VIA attribute · 93ae2835

Eric W. Biederman authored Mar 15, 2015

Add support for the RTA_VIA attribute that specifies an address family
as well as an address for the next hop gateway.

To make it easy to pass this reorder inet_prefix so that it's tail
is a proper RTA_VIA attribute.
Signed-off-by: "Eric W. Biederman" <ebiederm@xmission.com>

93ae2835

misc whitespace cleanup · 8e8f8de4
Eric W. Biederman authored Mar 15, 2015

8e8f8de4

add address family to/from string helper functions. · 45c90d19

Eric W. Biederman authored Mar 15, 2015

Add the functions family_name and read_family to convert an address
family to a string and to convernt a string to an address family.
Signed-off-by: "Eric W. Biederman" <ebiederm@xmission.com>

45c90d19

add support for printing AF_PACKET addresses · 0b218ab1
Eric W. Biederman authored Mar 15, 2015
```
Signed-off-by: "Eric W. Biederman" <ebiederm@xmission.com>
```
0b218ab1

make the addr argument of ll_addr_n2a const · 71b4d59b

Eric W. Biederman authored Mar 15, 2015

This avoids build warnings when AF_PACKET support is added
to rt_addr_n2a.
Signed-off-by: "Eric W. Biederman" <ebiederm@xmission.com>

71b4d59b

add a source addres length parameter to rt_addr_n2a · 26dcdf3a

Eric W. Biederman authored Mar 15, 2015

For some address families (like AF_PACKET) it is helpful to have the
length when prenting the address.
Signed-off-by: "Eric W. Biederman" <ebiederm@xmission.com>

26dcdf3a

tc: add eBPF support to f_bpf · 11c39b5e

Daniel Borkmann authored Mar 16, 2015

This work adds the tc frontend for kernel commit e2e9b6541dd4 ("cls_bpf:
add initial eBPF support for programmable classifiers").

A C-like classifier program (f.e. see e2e9b6541dd4) is being compiled via
LLVM's eBPF backend into an ELF file, that is then being passed to tc. tc
then loads, if any, eBPF maps and eBPF opcodes (with fixed-up eBPF map file
descriptors) out of its dedicated sections, and via bpf(2) into the kernel
and then the resulting fd via netlink down to cls_bpf. cls_bpf allows for
annotations, currently, I've used the file name for that, so that the user
can easily identify his filter when dumping configurations back.

Example usage:

clang -O2 -emit-llvm -c cls.c -o - | llc -march=bpf -filetype=obj -o cls.o
tc filter add dev em1 parent 1: bpf run object-file cls.o classid x:y

tc filter show dev em1 [...]
filter parent 1: protocol all pref 49152 bpf handle 0x1 flowid x:y cls.o

I placed the parser bits derived from Alexei's kernel sample, into tc_bpf.c
as my next step is to also add the same support for BPF action, so we can
have a fully fledged eBPF classifier and action in tc.
Signed-off-by: Daniel Borkmann <daniel@iogearbox.net>
Acked-by: Alexei Starovoitov <ast@plumgrid.com>

11c39b5e

update kernel headers to net-next 4.0-rc5 · cbdc3ed8
Stephen Hemminger authored Mar 24, 2015
```
Lastest features
```
cbdc3ed8

misc: header rebase, add bpf.h · b54ac87e

Daniel Borkmann authored Mar 16, 2015

Include the bpf.h uapi header file.
Signed-off-by: Daniel Borkmann <daniel@iogearbox.net>

b54ac87e

ip: enable configuring multicast group autojoin · e31867ac

Madhu Challa authored Mar 04, 2015

Joining multicast group on ethernet level via "ip maddr" command would
not work if we have an Ethernet switch that does igmp snooping since
the switch would not replicate multicast packets on ports that did not
have IGMP reports for the multicast addresses.

Linux vxlan interfaces created via "ip link add vxlan" have the group option
that enables then to do the required join.

By extending ip address command with option "autojoin" we can get similar
functionality for openvswitch vxlan interfaces as well as other tunneling
mechanisms that need to receive multicast traffic.

example:
ip address add 224.1.1.10/24 dev eth5 autojoin
ip address del 224.1.1.10/24 dev eth5

e31867ac

route: label externally offloaded routes · 655444bd

Scott Feldman authored Mar 07, 2015

On ip route print dump, label externally offloaded routes with "external".
Offloaded routes are flagged with RTNH_F_EXTERNAL, a recent additon to
net-next.  For example:

$ ip route
default via 192.168.0.2 dev eth0
11.0.0.0/30 dev swp1  proto kernel  scope link  src 11.0.0.2 external
11.0.0.4/30 via 11.0.0.1 dev swp1  proto zebra  metric 20 external
11.0.0.8/30 dev swp2  proto kernel  scope link  src 11.0.0.10 external
11.0.0.12/30 via 11.0.0.9 dev swp2  proto zebra  metric 20 external
12.0.0.2  proto zebra  metric 30 external
        nexthop via 11.0.0.1  dev swp1 weight 1
        nexthop via 11.0.0.9  dev swp2 weight 1
12.0.0.3 via 11.0.0.1 dev swp1  proto zebra  metric 20 external
12.0.0.4 via 11.0.0.9 dev swp2  proto zebra  metric 20 external
192.168.0.0/24 dev eth0  proto kernel  scope link  src 192.168.0.15
Signed-off-by: Scott Feldman <sfeldma@gmail.com>
Reviewed-by: Jiri Pirko <jiri@resnulli.us>

655444bd