iproute2

mirror of https://git.kernel.org/pub/scm/network/iproute2/iproute2.git synced 2024-11-16 06:25:35 +08:00

Author	SHA1	Message	Date
Nikolay Aleksandrov	720f8613bd	bridge: vlan: add support to show global vlan options Add support for new bridge vlan command grouping called global which operates on global options. The first command it supports is "show". To do that we update print_vlan_rtm to recognize the global vlan options attribute and parse it properly. Man page and help are also updated with the new command. Syntax is: $ bridge vlan global show [ vid VID ] [ dev DEV ] Signed-off-by: Nikolay Aleksandrov <nikolay@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-08-31 21:21:04 -06:00
Nikolay Aleksandrov	d2eecb9d1d	ip: bridge: add support for mcast_vlan_snooping Add support for mcast_vlan_snooping option which controls per-vlan multicast snooping, also update the man page. Syntax: $ ip link set dev bridge type bridge mcast_vlan_snooping 0/1 Signed-off-by: Nikolay Aleksandrov <nikolay@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-08-31 21:20:03 -06:00
Gokul Sivakumar	10ecd12690	man: bridge: fix the typo to change "-c[lor]" into "-c[olor]" in man page Fixes: `3a1ca9a5b` ("bridge: update man page for new color and json changes") Signed-off-by: Gokul Sivakumar <gokulkumar792@gmail.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-08-18 14:04:53 -07:00
Parav Pandit	355c49ffa5	devlink: Show port state values in man page and in the help command Port function state can have either of the two values - active or inactive. Update the documentation and help command for these two values to tell user about it. With the introduction of state, hw_addr and state are optional. Hence mark them as optional in man page that also aligns with the help command output. Fixes: `bdfb9f1bd6` ("devlink: Support set of port function state") Signed-off-by: Parav Pandit <parav@nvidia.com> Reviewed-by: Jiri Pirko <jiri@nvidia.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-08-11 15:02:30 -07:00
Peilin Ye	e78411948d	tc/skbmod: Introduce SKBMOD_F_ECN option Recently we added SKBMOD_F_ECN option support to the kernel; support it in the tc-skbmod(8) front end, and update its man page accordingly. The 2 least significant bits of the Traffic Class field in IPv4 and IPv6 headers are used to represent different ECN states [1]: 0b00: "Non ECN-Capable Transport", Non-ECT 0b10: "ECN Capable Transport", ECT(0) 0b01: "ECN Capable Transport", ECT(1) 0b11: "Congestion Encountered", CE This new option, "ecn", marks ECT(0) and ECT(1) IPv{4,6} packets as CE, which is useful for ECN-based rate limiting. For example: $ tc filter add dev eth0 parent 1: protocol ip prio 10 \ u32 match ip protocol 1 0xff flowid 1:2 \ action skbmod \ ecn The updated tc-skbmod SYNOPSIS looks like the following: tc ... action skbmod { set SETTABLE \| swap SWAPPABLE \| ecn } ... Only one of "set", "swap" or "ecn" shall be used in a single tc-skbmod command. Trying to use more than one of them at a time is considered undefined behavior; pipe multiple tc-skbmod commands together instead. "set" and "swap" only affect Ethernet packets, while "ecn" only affects IP packets. Depends on kernel patch "net/sched: act_skbmod: Add SKBMOD_F_ECN option support", as well as iproute2 patch "tc/skbmod: Remove misinformation about the swap action". [1] https://en.wikipedia.org/wiki/Explicit_Congestion_Notification Reviewed-by: Cong Wang <cong.wang@bytedance.com> Signed-off-by: Peilin Ye <peilin.ye@bytedance.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-08-08 11:56:55 -06:00
David Ahern	09d8ce3db1	Merge branch 'main' into next Signed-off-by: David Ahern <dsahern@kernel.org>	2021-08-04 09:24:12 -06:00
Justin Iurman	78832863ef	IOAM man8 This patch provides man8 documentation for IOAM inside ip, ip-ioam and ip-route. Signed-off-by: Justin Iurman <justin.iurman@uliege.be> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-08-02 11:33:35 -06:00
Gokul Sivakumar	cf866f0a5a	ipneigh: add support to print brief output of neigh cache in tabular format Make use of the already available brief flag and print the basic details of the IPv4 or IPv6 neighbour cache in a tabular format for better readability when the brief output is expected. $ ip -br neigh 172.16.12.100 bridge0 b0:fc:36:2f:07:43 172.16.12.174 bridge0 8c:16:45:2f:bc:1c 172.16.12.250 bridge0 04:d9:f5:c1:0c:74 fe80::267b:9f70:745e:d54d bridge0 b0:fc:36:2f:07:43 fd16:a115:6a62:0:8744:efa1:9933:2c4c bridge0 8c:16:45:2f:bc:1c fe80::6d9:f5ff:fec1:c74 bridge0 04:d9:f5:c1:0c:74 And add "ip neigh show" to the list of ip sub commands mentioned in the man page that support the brief output in tabular format. Signed-off-by: Gokul Sivakumar <gokulkumar792@gmail.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-08-02 10:14:50 -06:00
Peilin Ye	c06d313d86	tc/skbmod: Remove misinformation about the swap action Currently man 8 tc-skbmod says that "...the swap action will occur after any smac/dmac substitutions are executed, if they are present." This is false. In fact, trying to "set" and "swap" in a single skbmod command causes the "set" part to be completely ignored. As an example: $ tc filter add dev eth0 parent 1: protocol ip prio 10 \ matchall action skbmod \ set dmac AA:AA:AA:AA:AA:AA smac BB:BB:BB:BB:BB:BB \ swap mac The above command simply does a "swap", without setting DMAC or SMAC to AA's or BB's. The root cause of this is in the kernel, see net/sched/act_skbmod.c:tcf_skbmod_init(): parm = nla_data(tb[TCA_SKBMOD_PARMS]); index = parm->index; if (parm->flags & SKBMOD_F_SWAPMAC) lflags = SKBMOD_F_SWAPMAC; ^^^^^^^^^^^^^^^^^^^^^^^^^^ Doing a "=" instead of "\|=" clears all other "set" flags when doing a "swap". Discourage using "set" and "swap" in the same command by documenting it as undefined behavior, and update the "SYNOPSIS" section as well as tc -help text accordingly. If one really needs to e.g. "set" DMAC to all AA's then "swap" DMAC and SMAC, one should do two separate commands and "pipe" them together. Reviewed-by: Cong Wang <cong.wang@bytedance.com> Signed-off-by: Peilin Ye <peilin.ye@bytedance.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-07-22 15:14:29 -07:00
Christian Schürmann	1f2c908d53	man8/ip-tunnel.8: fix typo, 'encaplim' is not a valid option Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-07-15 09:31:51 -07:00
David Ahern	02c06ffc13	Merge branch 'main' into next Signed-off-by: David Ahern <dsahern@kernel.org>	2021-07-01 14:29:42 +00:00
Asbjørn Sloth Tønnesen	2ff4761db4	tc: pedit: add decrement operation Implement a decrement operation for ttl and hoplimit. Since this is just syntactic sugar, it goes that: tc filter add ... action pedit ex munge ip ttl dec ... tc filter add ... action pedit ex munge ip6 hoplimit dec ... is just a more readable version of this: tc filter add ... action pedit ex munge ip ttl add 0xff ... tc filter add ... action pedit ex munge ip6 hoplimit add 0xff ... This feature was suggested by some pseudo tc examples in Mellanox's documentation[1], but wasn't present in neither their mlnx-iproute2 nor iproute2. Tested with skip_sw on Mellanox ConnectX-6 Dx. [1] https://docs.mellanox.com/pages/viewpage.action?pageId=47033989 v3: - Use dedicated flags argument in parse_cmd() (David Ahern) - Minor rewording of the man page v2: - Fix whitespace issue (Stephen Hemminger) - Add to usage info in explain() Signed-off-by: Asbjørn Sloth Tønnesen <asbjorn@asbjorn.st> Acked-by: Jamal Hadi Salim <jhs@mojatatu.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-06-26 04:45:19 +00:00
Stephen Hemminger	8316825a52	man: fix syntax for ip link property The ip link property add/delete requires a device; but the device argument was not show on the man page. It is correct in the usage message. Fixes: `3aa0e51be6` ("ip: add support for alternative name addition/deletion/list") Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-06-24 11:54:04 -07:00
Paolo Lungaroni	3e26254f31	seg6: add support for SRv6 End.DT46 Behavior We introduce the new "End.DT46" action for supporting the SRv6 End.DT46 Behavior in iproute2. The SRv6 End.DT46 Behavior, defined in RFC 8986 [1] section 4.8, can be used to implement L3 VPNs based on Segment Routing over IPv6 networks in multi-tenants environments and it is capable of handling both IPv4 and IPv6 tenant traffic at the same time. The SRv6 End.DT46 Behavior decapsulates the received packets and it performs the IPv4 or IPv6 routing lookup in the routing table of the tenant. As for the End.DT4 and for the End.DT6 in VRF mode, the SRv6 End.DT46 Behavior leverages a VRF device in order to force the routing lookup into the associated routing table using the "vrftable" attribute. To make the End.DT46 work properly, it must be guaranteed that the routing table used for routing lookup operations is bound to one and only one VRF during the tunnel creation. Such constraint has to be enforced by enabling the VRF strict_mode sysctl parameter, i.e.: $ sysctl -wq net.vrf.strict_mode=1 Note that the same approach is used for the End.DT4 Behavior and for the End.DT6 Behavior in VRF mode. An SRv6 End.DT46 Behavior instance can be created as follows: $ ip -6 route add 2001:db8::1 encap seg6local action End.DT46 vrftable 100 dev vrf100 Standard Output: $ ip -6 route show 2001:db8::1 2001:db8::1 encap seg6local action End.DT46 vrftable 100 dev vrf100 metric 1024 pref medium JSON Output: $ ip -6 -j -p route show 2001:db8::1 [ { "dst": "2001:db8::1", "encap": "seg6local", "action": "End.DT46", "vrftable": 100, "dev": "vrf100", "metric": 1024, "flags": [ ], "pref": "medium" } ] This patch updates the route.8 man page and the ip route help with the information related to End.DT46. Considering that the same information was missing for the SRv6 End.DT4 and the End.DT6 Behaviors, we have also added it. [1] https://www.rfc-editor.org/rfc/rfc8986.html#name-enddt46-decapsulation-and-s Signed-off-by: Andrea Mayer <andrea.mayer@uniroma2.it> Signed-off-by: Paolo Lungaroni <paolo.lungaroni@uniroma2.it> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-06-22 15:36:17 +00:00
Dmytro Linkin	dedf895184	devlink: Add ISO/IEC switch Add -i/--iec switch to print rate values using binary prefixes. Update devlink(8) and devlink-rate(8) pages. Signed-off-by: Dmytro Linkin <dlinkin@nvidia.com> Reviewed-by: Jiri Pirko <jiri@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-06-12 04:38:13 +00:00
Dmytro Linkin	6c70aca76e	devlink: Add port func rate support Implement user commands to manage devlink port func rate objects. List all rate commands: $ devlink port func rate help or just $ devlink port func rate To list all OR particular rate object: $ devlink port func rate show pci/0000:03:00.0/some_group: type node pci/0000:03:00.0/0: type leaf pci/0000:03:00.0/1: type leaf $ devlink prot func rate show pci/0000:03:00.0/1 pci/0000:03:00.0/0: type leaf $ devlink prot func rate show pci/0000:03:00.0/some_group pci/0000:03:00.0/some_group: type node Rate object of type "leaf" created by it's driver where name is the name of corresponding devlink port. Rate object of type "node" represents rate group created by the user using commands: $ devlink port func rate add pci/0000:03:00.0/some_group or with defining tx rate limits $ devlink port func rate add pci/0000:03:00.0/some_group \ tx_shara 10kbit tx_max 100mbit NOTE: node name cannot be a decimal value because it conflicts with devlink port indexes. To delete node object: $ devlink port func rate del pci/0000:03:00.0/some_group Set rate limits of existing rate object: $ devlink prot func rate set pci/0000:03:00.0/0 \ tx_share 5MBps tx_max 25GBps $ devlink prot func rate set pci/0000:03:00.0/some_group \ tx_share 0 Both SET and ADD commands accept any units of rates defined in IEC 60027-2 standard. NOTE: rate value 0 means that rate is unlimited. Such value is also ommited in show command output. NOTE: In SHOW command output rate values will be printed with suffixes as well, but in JSON output they are always units of Bps. Set or unset parent of existing rate object: $ devlink prot func rate set pci/0000:03:00.0/0 parent some_group $ devlink port func rate set pci/0000:03:00.0/0 noparent NOTE: Setting parent to empty ("") name due to kernel logic means unset parent and shouldn't be used to avoid unexpected parent unsets. Signed-off-by: Dmytro Linkin <dlinkin@nvidia.com> Reviewed-by: Jiri Pirko <jiri@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-06-12 04:38:06 +00:00
Parav Pandit	fbd4b581cb	devlink: Add optional controller user input A user optionally provides the external controller number when user wants to create devlink port for the external controller. An example on eswitch system: $ devlink dev eswitch set pci/0033:01:00.0 mode switchdev $ devlink port show pci/0033:01:00.0/196607: type eth netdev enP51p1s0f0np0 flavour physical port 0 splittable false pci/0033:01:00.0/131072: type eth netdev eth0 flavour pcipf controller 1 pfnum 0 external true splittable false function: hw_addr 00:00:00:00:00:00 $ devlink port add pci/0033:01:00.0 flavour pcisf pfnum 0 sfnum 77 controller 1 pci/0033:01:00.0/163840: type eth netdev eth1 flavour pcisf controller 1 pfnum 0 sfnum 77 external true splittable false function: hw_addr 00:00:00:00:00:00 state inactive opstate detached Signed-off-by: Parav Pandit <parav@nvidia.com> Reviewed-by: Jiri Pirko <jiri@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-06-11 02:28:49 +00:00
Roman Mashak	9d9b1a84a5	ss: update ss man page '-b' option allows to request BPF filter opcodes, however currently the kernel returns only classic BPF filter, so reflect this in man page. Signed-off-by: Roman Mashak <mrv@mojatatu.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-06-01 15:55:06 -07:00
Ariel Levkovich	7fda6c588a	tc: f_flower: Add option to match on related ct state Add support for matching on ct_state flag related. The related state indicates a packet is associated with an existing connection. Example: $ tc filter add dev ens1f0_0 ingress prio 1 chain 1 proto ip flower \ ct_state -est-rel+trk \ action mirred egress redirect dev ens1f0_1 $ tc filter add dev ens1f0_0 ingress prio 1 chain 1 proto ip flower \ ct_state +rel+trk \ action mirred egress redirect dev ens1f0_1 Signed-off-by: Ariel Levkovich <lariel@nvidia.com> Reviewed-by: Jiri Pirko <jiri@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-05-27 14:39:14 +00:00
Paolo Lungaroni	02ca3aabe9	seg6: add counters support for SRv6 Behaviors We introduce the "count" optional attribute for supporting counters in SRv6 Behaviors as defined in [1], section 6. For each SRv6 Behavior instance, counters defined in [1] are: - the total number of packets that have been correctly processed; - the total amount of traffic in bytes of all packets that have been correctly processed; In addition, we introduce a new counter that counts the number of packets that have NOT been properly processed (i.e. errors) by an SRv6 Behavior instance. Each SRv6 Behavior instance can be configured, at the time of its creation, to make use of counters specifing the "count" attribute as follows: $ ip -6 route add 2001:db8::1 encap seg6local action End count dev eth0 per-behavior counters can be shown by adding "-s" to the iproute2 command line, i.e.: $ ip -s -6 route show 2001:db8::1 2001:db8::1 encap seg6local action End packets 0 bytes 0 errors 0 dev eth0 [1] https://www.rfc-editor.org/rfc/rfc8986.html#name-counters v2: - add help and route.8 man page updates Signed-off-by: Andrea Mayer <andrea.mayer@uniroma2.it> Signed-off-by: Paolo Lungaroni <paolo.lungaroni@uniroma2.it> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-05-09 22:20:59 +00:00
Tyson Moore	0d95472a4b	tc-cake: update docs to include LE diffserv Linux kernel commit b8392808eb3fc28e ("sch_cake: add RFC 8622 LE PHB support to CAKE diffserv handling") added packets with LE diffserv to the Bulk priority tin. Update the documentation to reflect this change. Signed-off-by: Tyson Moore <tyson@tyson.me> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-05-06 14:59:52 +00:00
Jethro Beekman	d56dcd3549	ip: Add nodst option to macvlan type source The default behavior for source MACVLAN is to duplicate packets to appropriate type source devices, and then do the normal destination MACVLAN flow. This patch adds an option to skip destination MACVLAN processing if any matching source MACVLAN device has the option set. This allows setting up a "catch all" device for source MACVLAN: create one or more devices with type source nodst, and one device with e.g. type vepa, and incoming traffic will be received on exactly one device. Signed-off-by: Jethro Beekman <kernel@jbeekman.nl> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-04-28 15:45:59 +00:00
Neta Ostrovsky	9b272e138d	rdma: Add SRQ resource tracking information Sample output: $ rdma res show srq dev ibp8s0f0 srqn 0 type BASIC pdn 3 comm [ib_ipoib] dev ibp8s0f0 srqn 4 type BASIC lqpn 125-128,130-140 pdn 9 pid 3581 comm ibv_srq_pingpon dev ibp8s0f0 srqn 5 type BASIC lqpn 141-156 pdn 10 pid 3584 comm ibv_srq_pingpon dev ibp8s0f0 srqn 6 type BASIC lqpn 157-172 pdn 11 pid 3590 comm ibv_srq_pingpon dev ibp8s0f1 srqn 0 type BASIC pdn 3 comm [ib_ipoib] dev ibp8s0f1 srqn 1 type BASIC lqpn 329-344 pdn 4 pid 3586 comm ibv_srq_pingpon $ rdma res show srq lqpn 126-141 dev ibp8s0f0 srqn 4 type BASIC lqpn 126-128,130-140 pdn 9 pid 3581 comm ibv_srq_pingpon dev ibp8s0f0 srqn 5 type BASIC lqpn 141 pdn 10 pid 3584 comm ibv_srq_pingpon $ rdma res show srq lqpn 127 dev ibp8s0f0 srqn 4 type BASIC lqpn 127 pdn 9 pid 3581 comm ibv_srq_pingpon Reviewed-by: Ido Kalir <idok@nvidia.com> Reviewed-by: Mark Zhang <markz@mellanox.com> Signed-off-by: Neta Ostrovsky <netao@nvidia.com> Signed-off-by: Leon Romanovsky <leonro@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-04-28 15:37:16 +00:00
Neta Ostrovsky	4278941285	rdma: Add context resource tracking information Sample output: $ rdma res show ctx dev ibp8s0f0 ctxn 0 pid 980 comm ibv_rc_pingpong dev ibp8s0f0 ctxn 1 pid 981 comm ibv_rc_pingpong dev ibp8s0f0 ctxn 2 pid 992 comm ibv_rc_pingpong dev ibp8s0f1 ctxn 0 pid 984 comm ibv_rc_pingpong dev ibp8s0f1 ctxn 1 pid 987 comm ibv_rc_pingpong $ rdma res show ctx dev ibp8s0f1 dev ibp8s0f1 ctxn 0 pid 984 comm ibv_rc_pingpong dev ibp8s0f1 ctxn 1 pid 987 comm ibv_rc_pingpong Reviewed-by: Mark Zhang <markz@mellanox.com> Reviewed-by: Ido Kalir <idok@nvidia.com> Signed-off-by: Neta Ostrovsky <netao@nvidia.com> Signed-off-by: Leon Romanovsky <leonro@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-04-28 15:36:59 +00:00
Nikolay Aleksandrov	c311404780	bridge: monitor: add support for vlan monitoring Add support for vlan activity monitoring, we display vlan notifications on vlan add/del/options change. The man page and help are also updated accordingly. Signed-off-by: Nikolay Aleksandrov <nikolay@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-04-22 05:13:39 +00:00
Nikolay Aleksandrov	e5f87c8341	bridge: vlan: add support for the new rtm dump call Use the new bridge vlan rtm dump helper to dump all of the available vlan information when -details (-d) is used with vlan show. It is also capable of dumping vlan stats if -statistics (-s) is added. Currently this is the only interface capable of dumping per-vlan options. The vlan dump format is compatible with current vlan show, it uses the same helpers to dump vlan information. The new addition is one line which will contain the per-vlan options (similar to ip -d link show for ports). Currently only the vlan STP state is printed. The call uses compressed vlan format by default. Example: $ bridge -s -d vlan show port vlan-id virbr1 1 PVID Egress Untagged state forwarding Signed-off-by: Nikolay Aleksandrov <nikolay@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-04-22 05:13:34 +00:00
Nikolay Aleksandrov	04e2783d5e	bridge: vlan: add option set command and state option Add a new per-vlan option set command. It allows to manipulate vlan options, those can be bridge-wide or per-port depending on what device is specified. The first option that can be set is the vlan STP state, it is identical to the bridge port STP state. The man page is also updated accordingly. Example: $ bridge vlan set vid 10 dev br0 state learning or a range: $ bridge vlan set vid 10-20 dev swp1 state blocking Signed-off-by: Nikolay Aleksandrov <nikolay@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-04-22 05:13:24 +00:00
Florian Westphal	ff619e4fd3	mptcp: add support for event monitoring This adds iproute2 support for mptcp event monitoring, e.g. creation, establishment, address announcements from the peer, subflow establishment and so on. While the kernel-generated events are primarily aimed at mptcpd (e.g. for subflow management), this is also useful for debugging. This adds print support for the existing events. Sample output of 'ip mptcp monitor': [ CREATED] token=83f3a692 remid=0 locid=0 saddr4=10.0.1.2 daddr4=10.0.1.1 sport=58710 dport=10011 [ ESTABLISHED] token=83f3a692 remid=0 locid=0 saddr4=10.0.1.2 daddr4=10.0.1.1 sport=58710 dport=10011 [SF_ESTABLISHED] token=83f3a692 remid=0 locid=1 saddr4=10.0.2.2 daddr4=10.0.1.1 sport=40195 dport=10011 backup=0 [ CLOSED] token=83f3a692 Signed-off-by: Florian Westphal <fw@strlen.de>	2021-04-22 05:10:25 +00:00
Baowen Zheng	cf9ae1bd31	police: add support for packet-per-second rate limiting Allow a policer action to enforce a rate-limit based on packets-per-second, configurable using a packet-per-second rate and burst parameters. e.g. # $TC actions add action police pkts_rate 1000 pkts_burst 200 index 1 # $TC actions ls action police total acts 1 action order 0: police 0x1 rate 0bit burst 0b mtu 4096Mb pkts_rate 1000 pkts_burst 200 ref 1 bind 0 Signed-off-by: Baowen Zheng <baowen.zheng@corigine.com> Signed-off-by: Simon Horman <simon.horman@netronome.com> Signed-off-by: Louis Peens <louis.peens@netronome.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-03-30 03:04:50 +00:00
Sabrina Dubroca	3c75135835	ip: xfrm: add support for tfcpad This patch adds support for setting and displaying the Traffic Flow Confidentiality attribute for an XFRM state, which allows padding ESP packets to a specified length. Signed-off-by: Sabrina Dubroca <sd@queasysnail.net> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-03-21 17:15:07 +00:00
Ido Schimmel	2be6d18b30	nexthop: Add support for nexthop buckets Add ability to dump multiple nexthop buckets and get a specific one. Example: # ip nexthop add id 10 group 1/2 type resilient buckets 8 # ip nexthop id 1 via 192.0.2.2 dev dummy10 scope link id 2 via 192.0.2.19 dev dummy20 scope link id 10 group 1/2 type resilient buckets 8 idle_timer 120 unbalanced_timer 0 unbalanced_time 0 # ip nexthop bucket id 10 index 0 idle_time 28.1 nhid 2 id 10 index 1 idle_time 28.1 nhid 2 id 10 index 2 idle_time 28.1 nhid 2 id 10 index 3 idle_time 28.1 nhid 2 id 10 index 4 idle_time 28.1 nhid 1 id 10 index 5 idle_time 28.1 nhid 1 id 10 index 6 idle_time 28.1 nhid 1 id 10 index 7 idle_time 28.1 nhid 1 # ip nexthop bucket show nhid 1 id 10 index 4 idle_time 53.59 nhid 1 id 10 index 5 idle_time 53.59 nhid 1 id 10 index 6 idle_time 53.59 nhid 1 id 10 index 7 idle_time 53.59 nhid 1 # ip nexthop bucket get id 10 index 5 id 10 index 5 idle_time 81 nhid 1 # ip -j -p nexthop bucket get id 10 index 5 [ { "id": 10, "bucket": { "index": 5, "idle_time": 104.89, "nhid": 1 }, "flags": [ ] } ] Signed-off-by: Ido Schimmel <idosch@nvidia.com> Signed-off-by: Petr Machata <petrm@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-03-19 15:01:25 +00:00
Ido Schimmel	9167671822	nexthop: Add support for resilient nexthop groups Add ability to configure resilient nexthop groups and show their current configuration. Example: # ip nexthop add id 10 group 1/2 type resilient buckets 8 # ip nexthop show id 10 id 10 group 1/2 type resilient buckets 8 idle_timer 120 unbalanced_timer 0 # ip -j -p nexthop show id 10 [ { "id": 10, "group": [ { "id": 1 },{ "id": 2 } ], "type": "resilient", "resilient_args": { "buckets": 8, "idle_timer": 120, "unbalanced_timer": 0 }, "flags": [ ] } ] Signed-off-by: Ido Schimmel <idosch@nvidia.com> Signed-off-by: Petr Machata <petrm@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-03-19 15:01:18 +00:00
Ido Schimmel	b82d6b81fa	nexthop: Add ability to specify group type Next patches are going to add a 'resilient' nexthop group type, so allow users to specify the type using the 'type' argument. Currently, only 'mpath' type is supported. These two commands are equivalent: # ip nexthop add id 10 group 1/2/3 # ip nexthop add id 10 group 1/2/3 type mpath Signed-off-by: Ido Schimmel <idosch@nvidia.com> Signed-off-by: Petr Machata <petrm@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-03-19 15:00:49 +00:00
Paolo Abeni	42fbca91cd	mptcp: add support for port based endpoint The feature is supported by the kernel since 5.11-net-next, let's allow user-space to use it. Just parse and dump an additional, per endpoint, u16 attribute Signed-off-by: Paolo Abeni <pabeni@redhat.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-03-01 00:15:10 +00:00
Stephen Hemminger	52c5f3f043	Merge git://git.kernel.org/pub/scm/network/iproute2/iproute2-next	2021-02-23 23:03:42 -08:00
Vladimir Oltean	e1d79d49ed	man8/bridge.8: be explicit that "flood" is an egress setting Talking to varios people, it became apparent that there is a certain ambiguity in the description of these flags. They refer to egress flooding, which should perhaps be stated more clearly. Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-02-22 11:19:38 -08:00
Vladimir Oltean	14f528a556	man8/bridge.8: explain self vs master for "bridge fdb add" The "usually hardware" and "usually software" distinctions make no sense, try to clarify what these do based on the actual kernel behavior. Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-02-22 11:19:38 -08:00
Vladimir Oltean	b64ceb687d	man8/bridge.8: fix which one of self/master is default for "bridge fdb" The bridge program does: fdb_modify: /* Assume self */ if (!(req.ndm.ndm_flags&(NTF_SELF\|NTF_MASTER))) req.ndm.ndm_flags \|= NTF_SELF; which is clearly against the documented behavior. The only thing we can do, sadly, is update the documentation. Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-02-22 11:19:38 -08:00
Vladimir Oltean	10130bfafe	man8/bridge.8: explain what a local FDB entry is Explaining the "local" flag by saying that it is "a local permanent fdb entry" is not very helpful, be more specific. Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-02-22 11:19:38 -08:00
Vladimir Oltean	ae3cb3d34d	man8/bridge.8: document that "local" is default for "bridge fdb add" The bridge does this: fdb_modify: /* Assume permanent */ if (!(req.ndm.ndm_state&(NUD_PERMANENT\|NUD_REACHABLE))) req.ndm.ndm_state \|= NUD_PERMANENT; So let's make the user aware of the fact that if they don't want local entries, they need to specify some other flag like "static". Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-02-22 11:19:38 -08:00
Vladimir Oltean	1261459c64	man8/bridge.8: document the "permanent" flag for "bridge fdb add" The bridge program parses "local" and "permanent" in just the same way, so it makes sense to tell that to users: fdb_modify: } else if (matches(argv, "local") == 0 \|\| matches(argv, "permanent") == 0) { req.ndm.ndm_state \|= NUD_PERMANENT; Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-02-22 11:19:38 -08:00
Oleksandr Mazur	c946f5d3e4	devlink: add support for port params get/set Add implementation for the port parameters getting/setting. Add bash completion for port param. Add man description for port param. Signed-off-by: Oleksandr Mazur <oleksandr.mazur@plvision.eu> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-02-11 09:21:24 -07:00
Parav Pandit	c2ecc82b9d	vdpa: Add vdpa tool vdpa tool is created to create, delete and query vdpa devices. examples: Show vdpa management device that supports creating, deleting vdpa devices. $ vdpa mgmtdev show vdpasim: supported_classes net $ vdpa mgmtdev show -jp { "show": { "vdpasim": { "supported_classes": [ "net" ] } } } Create a vdpa device of type networking named as "foo2" from the management device vdpasim_net: $ vdpa dev add mgmtdev vdpasim_net name foo2 Show the newly created vdpa device by its name: $ vdpa dev show foo2 foo2: type network mgmtdev vdpasim_net vendor_id 0 max_vqs 2 max_vq_size 256 $ vdpa dev show foo2 -jp { "dev": { "foo2": { "type": "network", "mgmtdev": "vdpasim_net", "vendor_id": 0, "max_vqs": 2, "max_vq_size": 256 } } } Delete the vdpa device after its use: $ vdpa dev del foo2 Signed-off-by: Parav Pandit <parav@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-02-11 09:09:15 -07:00
Paul Blakey	049708a002	tc: flower: Add support for ct_state reply flag Matches on conntrack rpl ct_state. Example: $ tc filter add dev ens1f0_0 ingress prio 1 chain 1 proto ip flower \ ct_state +trk+est+rpl \ action mirred egress redirect dev ens1f0_1 $ tc filter add dev ens1f0_1 ingress prio 1 chain 1 proto ip flower \ ct_state +trk+est-rpl \ action mirred egress redirect dev ens1f0_0 Signed-off-by: Paul Blakey <paulb@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-02-04 21:54:28 -07:00
Maxim Mikityanskiy	b8b8b6d4c9	tc/htb: Hierarchical QoS hardware offload This commit adds support for configuring HTB in offload mode. HTB offload eliminates the single qdisc lock in the datapath and offloads the algorithm to the NIC. The new 'offload' parameter is added to enable this mode: # tc qdisc replace dev eth0 root handle 1: htb offload Classes are created as usual, but filters should be moved to clsact for lock-free classification (filters attached to HTB itself are not supported in the offload mode): # tc filter add dev eth0 egress protocol ip flower dst_port 80 action skbedit priority 1:10 tc qdisc show and tc class show will indicate whether the offload is enabled. Example output: $ tc qdisc show dev eth1 qdisc htb 1: root offloaded r2q 10 default 0 direct_packets_stat 0 direct_qlen 1000 offload qdisc pfifo 0: parent 1: limit 1000p qdisc pfifo 0: parent 1: limit 1000p qdisc pfifo 0: parent 1: limit 1000p qdisc pfifo 0: parent 1: limit 1000p qdisc pfifo 0: parent 1: limit 1000p qdisc pfifo 0: parent 1: limit 1000p qdisc pfifo 0: parent 1: limit 1000p qdisc pfifo 0: parent 1: limit 1000p $ tc class show dev eth1 class htb 1:101 parent 1:1 prio 0 rate 4Gbit ceil 4Gbit burst 1000b cburst 1000b offload class htb 1:1 root rate 100Gbit ceil 100Gbit burst 0b cburst 0b offload class htb 1:103 parent 1:1 prio 0 rate 4Gbit ceil 4Gbit burst 1000b cburst 1000b offload class htb 1:102 parent 1:1 prio 0 rate 4Gbit ceil 4Gbit burst 1000b cburst 1000b offload class htb 1:105 parent 1:1 prio 0 rate 4Gbit ceil 4Gbit burst 1000b cburst 1000b offload class htb 1:104 parent 1:1 prio 0 rate 4Gbit ceil 4Gbit burst 1000b cburst 1000b offload class htb 1:107 parent 1:1 prio 0 rate 4Gbit ceil 4Gbit burst 1000b cburst 1000b offload class htb 1:106 parent 1:1 prio 0 rate 4Gbit ceil 4Gbit burst 1000b cburst 1000b offload class htb 1:108 parent 1:1 prio 0 rate 4Gbit ceil 4Gbit burst 1000b cburst 1000b offload $ tc -j qdisc show dev eth1 [{"kind":"htb","handle":"1:","root":true,"offloaded":true,"options":{"r2q":10,"default":"0","direct_packets_stat":0,"direct_qlen":1000,"offload":null}},{"kind":"pfifo","handle":"0:","parent":"1:","options":{"limit":1000}},{"kind":"pfifo","handle":"0:","parent":"1:","options":{"limit":1000}},{"kind":"pfifo","handle":"0:","parent":"1:","options":{"limit":1000}},{"kind":"pfifo","handle":"0:","parent":"1:","options":{"limit":1000}},{"kind":"pfifo","handle":"0:","parent":"1:","options":{"limit":1000}},{"kind":"pfifo","handle":"0:","parent":"1:","options":{"limit":1000}},{"kind":"pfifo","handle":"0:","parent":"1:","options":{"limit":1000}},{"kind":"pfifo","handle":"0:","parent":"1:","options":{"limit":1000}}] Signed-off-by: Maxim Mikityanskiy <maximmi@mellanox.com> Reviewed-by: Tariq Toukan <tariqt@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-02-04 21:54:13 -07:00
Thayne McCombs	38957a2f6c	ss: Add clarification about host conditions with multiple familes to man In creating documentation for expressions I ran into an interesting case where if you use two different familie types in the expression, such as in `ss 'sport inet:ssh or src unix:/run/*'`, then you would only get the results for one address family (in this case unix sockets). The reason is that in parse_hostcond if the family is specified we remove any previously added families from filter->families, and preserve the "states" if any states are set. I tried changing this to not reset the families, but ran into some issues with Invalid Argument errors in inet_show_netlink, I think related to the state. I can dig into that more if supporting this is useful, but I'm not sure if these types of expressions would actually be useful in practice. Or perhaps an error should be given if an expression contains conditions with multiple families (besides inet and inet6)? Anyway, for now, this patch just notes the limitation in the man page. Signed-off-by: Thayne McCombs <astrothayne@gmail.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-02-02 14:30:40 -08:00
Thayne McCombs	df361a27c2	Add documentation of ss filter to man page This adds some documentation of the syntax for the FILTER argument to the ss command to the ss (8) man page. Signed-off-by: Thayne McCombs <astrothayne@gmail.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-02-02 14:24:03 -08:00
Vladimir Oltean	4712a46174	man: tc-taprio.8: document the full offload feature Since this feature's introduction in commit 9c66d1564676 ("taprio: Add support for hardware offloading") from kernel v5.4, it never got documented in the man pages. Due to this reason, we see customer reports of seemingly contradictory information: the community manpages claim there is no support for full offload, nonetheless many silicon vendors have already implemented it. This patch documents the full offload feature (enabled by specifying "flags 2" to the taprio qdisc) and gives one more example that tries to illustrate some of the finer points related to the usage. Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-02-02 14:12:27 -08:00
Guillaume Nault	86d9660dc1	iplink_bareudp: cleanup help message and man page * Fix PROTO description in help message (mpls isn't a valid argument). * Remove SRCPORTMIN description from help message since it doesn't appear in the syntax string. * Use same keywords in help message and in man page. * Use the "ethertype" option name (.B ethertype) rather than the option value (.I ETHERTYPE) in the man page description of [no]multiproto. Signed-off-by: Guillaume Nault <gnault@redhat.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-02-02 14:11:32 -08:00
Parav Pandit	bdfb9f1bd6	devlink: Support set of port function state Support set operation of the devlink port function state. Example of a PCI SF port function which supports the state: $ devlink dev eswitch set pci/0000:06:00.0 mode switchdev $ devlink port show pci/0000:06:00.0/65535: type eth netdev ens2f0np0 flavour physical port 0 splittable false $ devlink port add pci/0000:06:00.0 flavour pcisf pfnum 0 sfnum 88 pci/0000:08:00.0/32768: type eth netdev eth6 flavour pcisf controller 0 pfnum 0 sfnum 88 splittable false function: hw_addr 00:00:00:00:00:00 state inactive opstate detached $ devlink port show pci/0000:06:00.0/32768 pci/0000:06:00.0/32768: type eth netdev ens2f0npf0sf88 flavour pcisf controller 0 pfnum 0 sfnum 88 splittable false function: hw_addr 00:00:00:00:00:00 state inactive opstate detached $ devlink port function set pci/0000:06:00.0/32768 hw_addr 00:00:00:00:88:88 state active $ devlink port show pci/0000:06:00.0/32768 -jp { "port": { "pci/0000:06:00.0/32768": { "type": "eth", "netdev": "ens2f0npf0sf88", "flavour": "pcisf", "controller": 0, "pfnum": 0, "sfnum": 88, "splittable": false, "function": { "hw_addr": "00:00:00:00:88:88", "state": "active", "opstate": "attached" } } } } Signed-off-by: Parav Pandit <parav@nvidia.com> Reviewed-by: Jiri Pirko <jiri@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-02-02 02:06:48 +00:00
Parav Pandit	331bf89ad0	devlink: Supporting add and delete of devlink port Enable user to add and delete the devlink port. Examples for adding and deleting one SF port: Examples of add, show and delete commands: $ devlink dev eswitch set pci/0000:06:00.0 mode switchdev $ devlink port show pci/0000:06:00.0/65535: type eth netdev ens2f0np0 flavour physical port 0 splittable false Add devlink port of flavour 'pcipf' for PF number 0 SF number 88: $ devlink port add pci/0000:06:00.0 flavour pcisf pfnum 0 sfnum 88 pci/0000:06:00.0/32768: type eth netdev eth6 flavour pcisf controller 0 pfnum 0 sfnum 88 splittable false function: hw_addr 00:00:00:00:00:00 state inactive opstate detached Delete newly added devlink port $ devlink port del pci/0000:06:00.0/32768 Signed-off-by: Parav Pandit <parav@nvidia.com> Reviewed-by: Jiri Pirko <jiri@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-02-02 02:06:36 +00:00
wenxu	c94fd71b34	tc: flower: add tc conntrack inv ct_state support Matches on conntrack inv ct_state. Signed-off-by: wenxu <wenxu@ucloud.cn> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-01-23 18:16:35 +00:00
Petr Machata	89d11ea596	dcb: Add a subtool for the DCBX object The Linux DCBX object is a 1-byte bitfield of flags that configure whether the DCBX protocol is implemented in the device or in the host, and which version of the protocol should be used. Add a tool to access the per-port Linux DCBX object. For example: # dcb dcbx set dev eni1np1 host ieee # dcb dcbx show dev eni1np1 host ieee Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-01-18 04:09:29 +00:00
Petr Machata	8e9bed1493	dcb: Add a subtool for the DCB APP object DCB APP interfaces are standardized in 802.1q-2018, and allow configuration of traffic prioritization rules based on several possible headers. Add a dcb subtool for maintenance and display of the APP table. For example: # dcb app add dev eni1np1 dscp-prio 0:0 CS3:3 CS6:6 # dcb app show dev eni1np1 dscp-prio 0:0 CS3:3 CS6:6 # dcb app add dev eni1np1 dscp-prio CS3:4 # dcb app show dev eni1np1 dscp-prio 0:0 CS3:3 CS3:4 CS6:6 # dcb app replace dev eni1np1 dscp-prio CS3:5 # dcb app show dev eni1np1 dscp-prio 0:0 CS3:5 CS6:6 Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-01-18 04:09:29 +00:00
Petr Machata	0aebd32b82	dcb: Support -N to suppress translation to human-readable names Some DSCP values can be translated to symbolic names. That may not be always desirable. Introduce a command-line option similar to other tools, -N or --Numeric, to suppress this translation. Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-01-18 04:09:29 +00:00
David Ahern	fa2881b664	Merge branch 'main' into next Signed-off-by: David Ahern <dsahern@kernel.org>	2021-01-18 03:57:29 +00:00
Petr Machata	934919b991	dcb: Change --Netns/-N to --netns/-n This to keep compatible with the major tools, ip and tc. Also document the option in the man page, which was neglected. Fixes: `67033d1c1c` ("Add skeleton of a new tool, dcb") Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-01-16 09:12:15 -08:00
Ido Schimmel	9bd498bfcd	ipmonitor: Mention "nexthop" object in help and man page Before: # ip monitor help Usage: ip monitor [ all \| LISTofOBJECTS ] [ FILE ] [ label ] [all-nsid] [dev DEVICE] LISTofOBJECTS := link \| address \| route \| mroute \| prefix \| neigh \| netconf \| rule \| nsid FILE := file FILENAME After: # ip monitor help Usage: ip monitor [ all \| LISTofOBJECTS ] [ FILE ] [ label ] [all-nsid] [dev DEVICE] LISTofOBJECTS := link \| address \| route \| mroute \| prefix \| neigh \| netconf \| rule \| nsid \| nexthop FILE := file FILENAME Signed-off-by: Ido Schimmel <idosch@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-01-10 17:17:32 +00:00
David Ahern	c01dec8475	Merge branch 'main' into next Signed-off-by: David Ahern <dsahern@gmail.com>	2020-12-16 04:06:06 +00:00
Thomas Karlsson	42f5642a40	iplink:macvlan: Added bcqueuelen parameter This patch allows the user to set and retrieve the IFLA_MACVLAN_BC_QUEUE_LEN parameter via the bcqueuelen command line argument This parameter controls the requested size of the queue for broadcast and multicast packages in the macvlan driver. If not specified, the driver default (1000) will be used. Note: The request is per macvlan but the actually used queue length per port is the maximum of any request to any macvlan connected to the same port. For this reason, the used queue length IFLA_MACVLAN_BC_QUEUE_LEN_USED is also retrieved and displayed in order to aid in the understanding of the setting. However, it can of course not be directly set. Signed-off-by: Thomas Karlsson <thomas.karlsson@paneda.se> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-12-16 04:02:07 +00:00
Andrea Claudi	309e6027e5	man: tc-flower: fix manpage Commit `924c43778a` ("man: tc-ct.8: Add manual page for ct tc action") add man page for tc-ct, but it brings with it a bogus block of text in the benning of tc-flower man page. This commit simply removes it. Fixes: `924c43778a` ("man: tc-ct.8: Add manual page for ct tc action") Reported-by: Paolo Valerio <pvalerio@redhat.com> Signed-off-by: Andrea Claudi <aclaudi@redhat.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-12-14 09:22:53 -08:00
Petr Machata	117939d9bd	dcb: Add a subtool for the DCB maxrate object DCBNL maxrate interfaces are an extension to the 802.1q DCB interfaces and allow configuration of rate with which traffic in a given traffic class is sent. Add a dcb subtool to allow showing and tweaking of this per-TC maximum rate. For example: # dcb maxrate show dev eni1np1 tc-maxrate 0:25Gbit 1:25Gbit 2:25Gbit 3:25Gbit 4:25Gbit 5:25Gbit 6:100Gbit 7:25Gbit Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-12-14 16:42:07 +00:00
Petr Machata	2e36f91000	dcb: Add a subtool for the DCB buffer object DCBNL buffer interfaces are an extension to the 802.1q DCB interfaces and allow configuration of port headroom buffers. Add a dcb subtool to allow showing and tweaking of buffer priority mapping and buffer sizes. For example: # dcb buf show dev eni1np1 prio-buffer 0:0 1:0 2:0 3:3 4:0 5:0 6:6 7:0 buffer-size 0:10000 1:0 2:0 3:70000 4:0 5:0 6:10000 7:0 total-size 221072 Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-12-14 16:42:03 +00:00
Petr Machata	6567cb588b	dcb: Add a subtool for the DCB PFC object PFC, for "Priority-based Flow Control", allows configuration of priority lossiness, and related toggles. Add a dcb subtool to allow showing and tweaking of individual PFC configuration options, and querying statistics. For example: # dcb pfc show dev eni1np1 pfc-cap 8 macsec-bypass on delay 0 pg-pfc 0:off 1:on 2:off 3:off 4:off 5:off 6:off 7:on requests 0:0 1:217 2:0 3:0 4:0 5:0 6:0 7:28 indications 0:0 1:179 2:0 3:0 4:0 5:0 6:0 7:18 Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-12-14 16:41:58 +00:00
Petr Machata	808dd741fc	dcb: Add -i to enable IEC mode Allow switching "dcb" into the ISO/IEC mode of units by passing -i. Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-12-14 16:41:54 +00:00
Petr Machata	6e9687db04	dcb: Add -s to enable statistics Allow selective display of statistical counters by passing -s. Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-12-14 16:41:50 +00:00
Petr Machata	7e94711c71	man: dcb-ets: Remove an unnecessary empty line Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-12-14 16:41:40 +00:00
Petr Machata	a7c2eaac39	dcb: ets: Change the way show parameters are given in synopsis None, one, or many parameters can be given on the command line, but the current synopsis allows only none or one. Fix it. Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-12-14 16:41:22 +00:00
Moshe Shemesh	f28c910274	devlink: Add devlink reload action and limit options Add reload action and reload limit to devlink reload command to enable the user to select the reload action required and constrains limits on these actions that he may want to ensure. The following reload actions are supported: driver_reinit: driver entities re-initialization, applying devlink-param and devlink-resource values. fw_activate: firmware activate. The uAPI is backward compatible, if the reload action option is omitted from the reload command, the driver reinit action will be used. Note that when required to do firmware activation some drivers may need to reload the driver. On the other hand some drivers may need to reset the firmware to reinitialize the driver entities. Therefore, the devlink reload command returns the actions which were actually performed. By default reload actions are not limited and driver implementation may include reset or downtime as needed to perform the actions. However, if reload limit is selected, the driver should perform only if it can do it while keeping the limit constraints. Reload limit added: no_reset: No reset allowed, no down time allowed, no link flap and no configuration is lost. Command examples: $devlink dev reload pci/0000:82:00.0 action driver_reinit reload_actions_performed: driver_reinit $devlink dev reload pci/0000:82:00.0 action fw_activate reload_actions_performed: driver_reinit fw_activate devlink dev reload pci/0000:82:00.1 action driver_reinit -jp { "reload": { "reload_actions_performed": [ "driver_reinit" ] } } devlink dev reload pci/0000:82:00.0 action fw_activate -jp { "reload": { "reload_actions_performed": [ "driver_reinit","fw_activate" ] } } Signed-off-by: Moshe Shemesh <moshe@mellanox.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-12-09 02:40:00 +00:00
Vlad Buslov	ea130da81e	tc: implement support for action terse dump Implement support for action terse dump using new TCA_ACT_FLAG_TERSE_DUMP value of TCA_ROOT_FLAGS tlv. Set the flag when user requested it with following example CLI (-br for 'brief'): $ tc -s -br actions ls action tunnel_key total acts 2 action order 0: tunnel_key index 1 Action statistics: Sent 0 bytes 0 pkt (dropped 0, overlimits 0 requeues 0) backlog 0b 0p requeues 0 action order 1: tunnel_key index 2 Action statistics: Sent 0 bytes 0 pkt (dropped 0, overlimits 0 requeues 0) backlog 0b 0p requeues 0 In terse mode dump only outputs essential data needed to identify the action (kind, index) and stats, if requested by the user. Signed-off-by: Vlad Buslov <vlad@buslov.dev> Suggested-by: Jamal Hadi Salim <jhs@mojatatu.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-12-03 03:51:06 +00:00
Vladimir Oltean	c29f65db34	bridge: add support for L2 multicast groups Extend the 'bridge mdb' command for the following syntax: bridge mdb add dev br0 port swp0 grp 01:02:03:04:05:06 permanent Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-11-29 20:54:02 +00:00
David Ahern	ee5d4b24e3	Merge branch 'main' into next Signed-off-by: David Ahern <dsahern@gmail.com>	2020-11-24 22:04:48 -07:00
Petr Machata	ef15b07601	dcb: Add a subtool for the DCB ETS object ETS, for "Enhanced Transmission Selection", is a set of configurations that permit configuration of mapping of priorities to traffic classes, traffic selection algorithm to use per traffic class, bandwidth allocation, etc. Add a dcb subtool to allow showing and tweaking of individual ETS configuration options. For example: # dcb ets show dev eni1np1 willing on ets_cap 8 cbs off tc-bw 0:0 1:0 2:0 3:0 4:100 5:0 6:0 7:0 pg-bw 0:0 1:0 2:0 3:0 4:0 5:0 6:0 7:0 tc-tsa 0:strict 1:strict 2:strict 3:strict 4:ets 5:strict 6:strict 7:strict prio-tc 0:1 1:3 2:5 3:0 4:0 5:0 6:0 7:0 reco-tc-bw 0:0 1:0 2:0 3:0 4:0 5:0 6:0 7:0 reco-tc-tsa 0:strict 1:strict 2:strict 3:strict 4:strict 5:strict 6:strict 7:strict reco-prio-tc 0:0 1:0 2:0 3:0 4:0 5:0 6:0 7:0 Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-11-13 19:43:19 -07:00
Petr Machata	67033d1c1c	Add skeleton of a new tool, dcb The Linux DCB interface allows configuration of a broad range of hardware-specific attributes, such as TC scheduling, flow control, per-port buffer configuration, TC rate, etc. Add a new tool to show that configuration and tweak it. DCB allows configuration of several objects, and possibly could expand to pre-standard CEE interfaces. Therefore the tool itself is a lean shell that dispatches to subtools each dedicated to one of the objects. Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-11-13 19:43:19 -07:00
Guillaume Nault	8682f588bf	tc-mpls: fix manpage example and help message string Manpage: * Remove the extra "and to ip packets" part from command description to make it more understandable. * Redirect packets to eth1, instead of eth0, as told in the description. Help string: * "mpls pop" can be followed by a CONTROL keyword. * "mpls modify" can also set the MPLS_BOS field. Signed-off-by: Guillaume Nault <gnault@redhat.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-11-08 10:49:28 -08:00
Stephen Hemminger	c8424b73e1	man: fix spelling errors Lots of little typo errors on man pages. Found by running codespell Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-11-08 10:40:30 -08:00
Vlad Buslov	477ca0dfb4	tc: implement support for terse dump Implement support for classifier/action terse dump using new TCA_DUMP_FLAGS tlv with only available flag value TCA_DUMP_FLAGS_TERSE. Set the flag when user requested it with following example CLI (-br for 'brief'): $ tc -s -br filter show dev ens1f0 ingress filter protocol ip pref 49151 flower chain 0 filter protocol ip pref 49151 flower chain 0 handle 0x1 not_in_hw action order 1: gact Action statistics: Sent 0 bytes 0 pkt (dropped 0, overlimits 0 requeues 0) backlog 0b 0p requeues 0 filter protocol ip pref 49152 flower chain 0 filter protocol ip pref 49152 flower chain 0 handle 0x1 not_in_hw action order 1: gact Action statistics: Sent 0 bytes 0 pkt (dropped 0, overlimits 0 requeues 0) backlog 0b 0p requeues 0 In terse mode dump only outputs essential data needed to identify the filter and action (handle, cookie, etc.) and stats, if requested by the user. The intention is to significantly improve rule dump rate by omitting all static data that do not change after rule is created. Signed-off-by: Vlad Buslov <vladbu@mellanox.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-10-31 09:15:15 -06:00
Guillaume Nault	02a261b5ba	m_mpls: add mac_push action Add support for the new TCA_MPLS_ACT_MAC_PUSH action (kernel commit a45294af9e96 ("net/sched: act_mpls: Add action to push MPLS LSE before Ethernet header")). This action let TC push an MPLS header before the MAC header of a frame. Example (encapsulate all outgoing frames with label 20, then add an outer Ethernet header): # tc filter add dev ethX matchall \ action mpls mac_push label 20 ttl 64 \ action vlan push_eth dst_mac 0a:00:00:00:00:02 \ src_mac 0a:00:00:00:00:01 This patch also adds an alias for ETH_P_TEB, since it is useful when decapsulating MPLS packets that contain an Ethernet frame. With MAC_PUSH, there's no previous Ethertype to modify. However, the "protocol" option is still needed, because the kernel uses it to set skb->protocol. So rename can_modify_ethtype() to can_set_ethtype(). Also add a test suite for m_mpls, which covers the new action and the pre-existing ones. Signed-off-by: Guillaume Nault <gnault@redhat.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-10-20 08:57:08 -06:00
Guillaume Nault	d61167dd88	m_vlan: add pop_eth and push_eth actions Add support for the new TCA_VLAN_ACT_POP_ETH and TCA_VLAN_ACT_PUSH_ETH actions (kernel commit 19fbcb36a39e ("net/sched: act_vlan: Add {POP,PUSH}_ETH actions"). These action let TC remove or add the Ethernet at the head of a frame. Drop an Ethernet header: # tc filter add dev ethX matchall action vlan pop_eth Push an Ethernet header (the original frame must have no MAC header): # tc filter add dev ethX matchall action vlan \ push_eth dst_mac 0a:00:00:00:00:02 src_mac 0a:00:00:00:00:01 Also add a test suite for m_vlan, which covers these new actions and the pre-existing ones. Signed-off-by: Guillaume Nault <gnault@redhat.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-10-20 08:36:38 -06:00
David Ahern	b5a583fb32	Merge branch 'main' into next Signed-off-by: David Ahern <dsahern@gmail.com>	2020-10-11 20:11:09 -06:00
Nikolay Aleksandrov	547b319762	bridge: mdb: add support for source address This patch adds the user-space control and dump of mdb entry source address. When setting the new MDBA_SET_ENTRY_ATTRS nested attribute is used and inside is added MDBE_ATTR_SOURCE based on the address family. When dumping we look for MDBA_MDB_EATTR_SOURCE and if present we add the "src x.x.x.x" output. The source address will be always shown as it's needed to match the entry to modify it from user-space. Example: $ bridge mdb add dev bridge port ens13 grp 239.0.0.1 src 1.2.3.4 permanent vid 100 $ bridge mdb show dev bridge port ens13 grp 239.0.0.1 src 1.2.3.4 permanent vid 100 Signed-off-by: Nikolay Aleksandrov <nikolay@nvidia.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-10-11 20:07:25 -06:00
Antony Antony	4322b13c8d	ip xfrm: support setting XFRMA_SET_MARK_MASK attribute in states The XFRMA_SET_MARK_MASK attribute can be set in states (4.19+) It is optional and the kernel default is 0xffffffff It is the mask of XFRMA_SET_MARK(a.k.a. XFRMA_OUTPUT_MARK in 4.18) e.g. ./ip/ip xfrm state add output-mark 0x6 mask 0xab proto esp \ auth digest_null 0 enc cipher_null '' ip xfrm state src 0.0.0.0 dst 0.0.0.0 proto esp spi 0x00000000 reqid 0 mode transport replay-window 0 output-mark 0x6/0xab auth-trunc digest_null 0x30 0 enc ecb(cipher_null) anti-replay context: seq 0x0, oseq 0x0, bitmap 0x00000000 sel src 0.0.0.0/0 dst 0.0.0.0/0 Signed-off-by: Antony Antony <antony@phenome.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-10-07 00:10:47 -06:00
Jiri Pirko	8dc1db80e4	devlink: Add health reporter test command support Add health reporter test command and allow user to trigger a test event. Signed-off-by: Jiri Pirko <jiri@nvidia.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-10-07 00:08:53 -06:00
Roman Mashak	aba44dc2ea	ip: updated ip-link man page Added description of link flags allmulticast, promisc and trailers. Signed-off-by: Roman Mashak <mrv@mojatatu.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-09-14 20:42:04 -07:00
Wei Wang	ad34d5fadb	iproute2: ss: add support to expose various inet sockopts This commit adds support to expose the following inet socket options: -- recverr -- is_icsk -- freebind -- hdrincl -- mc_loop -- transparent -- mc_all -- nodefrag -- bind_address_no_port -- recverr_rfc4884 -- defer_connect with the option --inet-sockopt. The individual option is only shown when set. Signed-off-by: Wei Wang <weiwan@google.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-09-08 20:36:06 -06:00
Roopa Prabhu	6fd53b2a1c	iplink: add support for protodown reason This patch adds support for recently added link IFLA_PROTO_DOWN_REASON attribute. IFLA_PROTO_DOWN_REASON enumerates reasons for the already existing IFLA_PROTO_DOWN link attribute. $ cat /etc/iproute2/protodown_reasons.d/r.conf 0 mlag 1 evpn 2 vrrp 3 psecurity $ ip link set dev vx10 protodown on protodown_reason vrrp on $ip link show dev vx10 14: vx10: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN mode DEFAULT group default qlen 1000 link/ether f2:32:28:b8:35:ff brd ff:ff:ff:ff:ff:ff protodown on protodown_reason <vrrp> $ip -p -j link show dev vx10 [ { <snip> "proto_down": true, "proto_down_reason": [ "vrrp" ] } ] $ip link set dev vx10 protodown_reason mlag on $ip link show dev vx10 14: vx10: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN mode DEFAULT group default qlen 1000 link/ether f2:32:28:b8:35:ff brd ff:ff:ff:ff:ff:ff protodown on protodown_reason <mlag,vrrp> $ip -p -j link show dev vx10 [ { <snip> "proto_down": true, "protodown_reason": [ "mlag","vrrp" ] } ] $ip -p -j link show dev vx10 $ip link set dev vx10 protodown off protodown_reason vrrp off Error: Cannot clear protodown, active reasons. $ip link set dev vx10 protodown off protodown_reason mlag off $ Note: for somereason the json and non-json key for protodown are different (protodown and proto_down). I have kept the same for protodown reason for consistency (protodown_reason and proto_down_reason). Signed-off-by: Roopa Prabhu <roopa@cumulusnetworks.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-09-01 19:52:13 -06:00
Murali Karicheri	ea6aeeb90c	ip: iplink: prp: update man page for new parameter PRP support requires a proto parameter which is 0 for hsr and 1 for prp. Default is hsr and is backward compatible. Signed-off-by: Murali Karicheri <m-karicheri2@ti.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-08-22 21:14:12 -07:00
Mark Zhang	e8e8f16ed1	rdma: Document the new "pid" criteria for auto mode Document the new supported criteria of auto mode. Examples: $ rdma statistic qp set link mlx5_2/1 auto pid on $ rdma statistic qp set link mlx5_2/1 auto pid,type on Signed-off-by: Mark Zhang <markz@mellanox.com> Reviewed-by: Ido Kalir <idok@mellanox.com> Signed-off-by: Leon Romanovsky <leonro@mellanox.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2020-08-06 16:26:12 +00:00
David Ahern	e572e3af0d	Merge branch 'main' into next Conflicts: bridge/fdb.c man/man8/bridge.8 Signed-off-by: David Ahern <dsahern@kernel.org>	2020-08-06 16:21:35 +00:00
Petr Vaněk	a7f1974f6e	ip-xfrm: add support for oseq-may-wrap extra flag This flag allows to create SA where sequence number can cycle in outbound packets if set. Signed-off-by: Petr Vaněk <pv@excello.cz> Signed-off-by: David Ahern <dsahern@kernel.org>	2020-08-03 14:57:25 +00:00
Anton Danilov	65c0c4d21b	bridge: fdb: the 'dynamic' option in the show/get commands In most of cases a user wants to see only the dynamic mac addresses in the fdb output. But currently the 'fdb show' displays tons of various self entries, those only waste the output without any useful goal. New option 'dynamic' for 'show' and 'get' commands forces display only relevant records. Signed-off-by: Anton Danilov <littlesmilingcloud@gmail.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-07-27 16:41:39 -07:00
Vladyslav Tarasiuk	1fe8c44bd9	devlink: Update devlink-health and devlink-port manpages Describe support for per-port reporters in devlink-health and devlink-port commands. Signed-off-by: Vladyslav Tarasiuk <vladyslavt@mellanox.com> Reviewed-by: Moshe Shemesh <moshe@mellanox.com> Reviewed-by: Jiri Pirko <jiri@mellanox.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2020-07-23 00:32:37 +00:00
Anton Danilov	8f5a602f7a	misc: make the pattern matching case-insensitive To improve the usability better use case-insensitive pattern-matching in ifstat, nstat and ss tools. Signed-off-by: Anton Danilov <littlesmilingcloud@gmail.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-07-20 13:29:55 -07:00
David Ahern	b78c480532	Merge branch 'main' into next Signed-off-by: David Ahern <dsahern@kernel.org>	2020-07-14 23:52:43 +00:00
Eyal Birger	f33a871b80	ip xfrm: policy: support policies with IF_ID in get/delete/deleteall The XFRMA_IF_ID attribute is set in policies for them to be associated with an XFRM interface (4.19+). Add support for getting/deleting policies with this attribute. For supporting 'deleteall' the XFRMA_IF_ID attribute needs to be explicitly copied. Signed-off-by: Eyal Birger <eyal.birger@gmail.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-07-13 08:51:37 -07:00
Eyal Birger	ee93c1107f	ip xfrm: update man page on setting/printing XFRMA_IF_ID in states/policies In commit `aed63ae1ac` ("ip xfrm: support setting/printing XFRMA_IF_ID attribute in states/policies") I added the ability to set/print the xfrm interface ID without updating the man page. Fixes: `aed63ae1ac` ("ip xfrm: support setting/printing XFRMA_IF_ID attribute in states/policies") Signed-off-by: Eyal Birger <eyal.birger@gmail.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-07-13 08:51:37 -07:00
Ido Schimmel	2d4c3f65e2	devlink: Document zero policer identifier When setting a policer to a trap group, a value of "0" will unbind the currently bound policer from the group. The behavior is intentional and tested in kernel selftests, so document it. Signed-off-by: Ido Schimmel <idosch@mellanox.com> Suggested-by: Alex Kushnarov <alexanderk@mellanox.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-07-06 11:14:24 -07:00
Guillaume Nault	eb09a15c12	tc: flower: support multiple MPLS LSE match Add the new "mpls" keyword that can be used to match MPLS fields in arbitrary Label Stack Entries. LSEs are introduced by the "lse" keyword and followed by LSE options: "depth", "label", "tc", "bos" and "ttl". The depth is manadtory, the other options are optionals. For example, the following filter drops MPLS packets having two labels, where the first label is 21 and has TTL 64 and the second label is 22: $ tc filter add dev ethX ingress proto mpls_uc flower mpls \ lse depth 1 label 21 ttl 64 \ lse depth 2 label 22 bos 1 \ action drop Signed-off-by: Guillaume Nault <gnault@redhat.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-07-06 11:12:43 -07:00
Guillaume Nault	a6c5c952ab	ip link: initial support for bareudp devices Bareudp devices provide a generic L3 encapsulation for tunnelling different protocols like MPLS, IP, NSH, etc. inside a UDP tunnel. This patch is based on original work from Martin Varghese: https://lore.kernel.org/netdev/1570532361-15163-1-git-send-email-martinvarghesenokia@gmail.com/ Examples: - ip link add dev bareudp0 type bareudp dstport 6635 ethertype mpls_uc This creates a bareudp tunnel device which tunnels L3 traffic with ethertype 0x8847 (unicast MPLS traffic). The destination port of the UDP header will be set to 6635. The device will listen on UDP port 6635 to receive traffic. - ip link add dev bareudp0 type bareudp dstport 6635 ethertype ipv4 multiproto Same as the MPLS example, but for IPv4. The "multiproto" keyword allows the device to also tunnel IPv6 traffic. Signed-off-by: Guillaume Nault <gnault@redhat.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-07-06 11:11:05 -07:00
Anton Danilov	d80a05b795	tc: improve the qdisc show command Before can be possible show only all qeueue disciplines on an interface. There wasn't a way to get the qdisc info by handle or parent, only full dump of the disciplines with a following grep/sed usage. Now new and old options work as expected to filter a qdisc by handle or parent. Full syntax of the qdisc show command: tc qdisc { show \| list } [ dev STRING ] [ QDISC_ID ] [ invisible ] QDISC_ID := { root \| ingress \| handle QHANDLE \| parent CLASSID } This change doesn't require any changes in the kernel. Signed-off-by: Anton Danilov <littlesmilingcloud@gmail.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-07-06 11:00:51 -07:00

1 2 3 4 5 ...

1052 Commits