linux

korg/linux

mirror of https://mirrors.bfsu.edu.cn/git/linux.git synced 2024-11-16 00:34:20 +08:00

History

Doug Ledford 29f27e8477 IB/cma: Use cached gids The cma_acquire_dev function was changed by commit `3c86aa70bf` ("RDMA/cm: Add RDMA CM support for IBoE devices") to use find_gid_port() because multiport devices might have either IB or IBoE formatted gids. The old function assumed that all ports on the same device used the same GID format. However, when it was changed to use find_gid_port(), we inadvertently lost usage of the GID cache. This turned out to be a very costly change. In our testing, each iteration through each index of the GID table takes roughly 35us. When you have multiple devices in a system, and the GID you are looking for is on one of the later devices, the code loops through all of the GID indexes on all of the early devices before it finally succeeds on the target device. This pathological search behavior combined with 35us per GID table index retrieval results in results such as the following from the cmtime application that's part of the latest librdmacm git repo: ib1: step total ms max ms min us us / conn create id : 29.42 0.04 1.00 2.94 bind addr : 186705.66 19.00 18556.00 18670.57 resolve addr : 41.93 9.68 619.00 4.19 resolve route: 486.93 0.48 101.00 48.69 create qp : 4021.95 6.18 330.00 402.20 connect : 68350.39 68588.17 24632.00 6835.04 disconnect : 1460.43 252.65-1862269.00 146.04 destroy : 41.16 0.04 2.00 4.12 ib0: step total ms max ms min us us / conn create id : 28.61 0.68 1.00 2.86 bind addr : 2178.86 2.95 201.00 217.89 resolve addr : 51.26 16.85 845.00 5.13 resolve route: 620.08 0.43 92.00 62.01 create qp : 3344.40 6.36 273.00 334.44 connect : 6435.99 6368.53 7844.00 643.60 disconnect : 5095.38 321.90 757.00 509.54 destroy : 37.13 0.02 2.00 3.71 Clearly, both the bind address and connect operations suffer a huge penalty for being anything other than the default GID on the first port in the system. After applying this patch, the numbers now look like this: ib1: step total ms max ms min us us / conn create id : 30.15 0.03 1.00 3.01 bind addr : 80.27 0.04 7.00 8.03 resolve addr : 43.02 13.53 589.00 4.30 resolve route: 482.90 0.45 100.00 48.29 create qp : 3986.55 5.80 330.00 398.66 connect : 7141.53 7051.29 5005.00 714.15 disconnect : 5038.85 193.63 918.00 503.88 destroy : 37.02 0.04 2.00 3.70 ib0: step total ms max ms min us us / conn create id : 34.27 0.05 1.00 3.43 bind addr : 26.45 0.04 1.00 2.64 resolve addr : 38.25 10.54 760.00 3.82 resolve route: 604.79 0.43 97.00 60.48 create qp : 3314.95 6.34 273.00 331.49 connect : 12399.26 12351.10 8609.00 1239.93 disconnect : 5096.76 270.72 1015.00 509.68 destroy : 37.10 0.03 2.00 3.71 It's worth noting that we still suffer a bit of a penalty on connect to the wrong device, but the penalty is much less than it used to be. Follow on patches deal with this penalty. Many thanks to Neil Horman for helping to track the source of slow function that allowed us to track down the fact that the original patch I mentioned above backed out cache usage and identify just how much that impacted the system. Signed-off-by: Doug Ledford <dledford@redhat.com> Signed-off-by: Roland Dreier <roland@purestorage.com>		2013-11-08 14:42:24 -08:00
..
accessibility
acpi	ACPI / PM: Drop two functions that are not used any more	2013-10-17 15:44:48 +02:00
amba
ata	Merge branch 'for-3.12-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/tj/libata	2013-10-22 08:21:34 +01:00
atm	atm: nicstar: fix regression made by previous patch	2013-09-15 22:18:55 -04:00
auxdisplay
base	driver core: Release device_hotplug_lock when store_mem_state returns EINVAL	2013-10-16 18:42:41 -07:00
bcma	bcma: make bcma_core_pci_{up,down}() callable from atomic context	2013-09-26 14:02:33 -04:00
block	cciss: fix info leak in cciss_ioctl32_passthru()	2013-09-24 17:00:26 -07:00
bluetooth	Bluetooth: btusb: Add support for Belkin F8065bf	2013-09-23 17:44:25 -03:00
bus	bus: mvebu-mbus: Fix optional pcie-mem/io-aperture properties	2013-10-01 17:04:17 +00:00
cdrom
char	A small fix for Xen on x86_32 and a build fix for xen-tpmfront on arm64.	2013-10-15 16:22:11 -07:00
clk	The common clk framework changes for 3.12 are dominated by clock driver	2013-09-09 15:49:04 -07:00
clocksource	clocksource: em_sti: Set cpu_possible_mask to fix SMP broadcast	2013-09-26 02:31:04 +02:00
connector	connector: use 'size' everywhere in cn_netlink_send()	2013-10-02 16:03:50 -04:00
cpufreq	acpi-cpufreq: Fail initialization if driver cannot be registered	2013-10-25 16:22:47 +02:00
cpuidle	ACPI and power management fixes for 3.12-rc1	2013-09-12 11:22:45 -07:00
crypto	Merge git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux	2013-09-07 12:53:35 +10:00
dca
devfreq
dio
dma	dmaengine: edma: fix another memory leak	2013-10-24 22:17:50 +05:30
edac	Merge git://git.kernel.org/pub/scm/linux/kernel/git/cmetcalf/linux-tile	2013-09-06 11:14:33 -07:00
eisa
extcon	Driver core patches for 3.12-rc1	2013-09-03 11:37:15 -07:00
firewire
firmware	firmware/dmi_scan: drop OOM messages	2013-09-11 15:58:51 -07:00
fmc
gpio	gpiolib: let gpiod_request() return -EPROBE_DEFER	2013-10-11 16:31:54 +02:00
gpu	Merge branch 'drm-fixes-3.12' of git://people.freedesktop.org/~agd5f/linux into drm-fixes	2013-10-22 07:35:17 +01:00
hid	HID: wiimote: add LEGO-wiimote VID	2013-10-21 13:39:37 +02:00
hsi
hv	Revert "Drivers: hv: vmbus: Fix a bug in channel rescind code"	2013-10-17 09:30:10 -07:00
hwmon	hwmon: (applesmc) Always read until end of data	2013-10-09 09:48:55 -07:00
hwspinlock
i2c	i2c: i2c-mux-pinctrl: use deferred probe when adapter not found	2013-10-10 10:22:35 +02:00
ide	ide: sgiioc4: Staticize ioc4_ide_attach_one()	2013-09-05 15:21:30 -04:00
idle
iio	iio:buffer: Free active scan mask in iio_disable_all_buffers()	2013-10-15 19:26:43 +01:00
infiniband	IB/cma: Use cached gids	2013-11-08 14:42:24 -08:00
input	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/dtor/input	2013-09-15 07:13:39 -04:00
iommu	x86, build, pci: Fix PCI_MSI build on !SMP	2013-10-04 10:43:34 -07:00
ipack
irqchip	ARM: SoC late changes for v3.12	2013-09-09 16:35:29 -07:00
isdn	isdn: hfcpci_softirq: get func return to suppress compiler warning	2013-09-15 22:27:04 -04:00
leds	Merge branch 'for-next' of git://git.kernel.org/pub/scm/linux/kernel/git/cooloney/linux-leds	2013-09-12 11:35:33 -07:00
lguest	lguest: fix guest kernel stack overflow when TF bit set.	2013-09-06 08:09:27 +09:30
macintosh
mailbox	mailbox: remove unnecessary platform_set_drvdata()	2013-09-18 12:02:00 -07:00
md	raid5: avoid finding "discard" stripe	2013-10-24 13:00:24 +11:00
media	[media] saa7134: Fix crash when device is closed before streamoff	2013-10-14 06:37:00 -03:00
memory	ARM: SoC platform changes for 3.12	2013-09-06 13:30:06 -07:00
memstick	memstick: add support for legacy memorysticks	2013-09-11 15:59:35 -07:00
message
mfd	Remove GENERIC_HARDIRQ config option	2013-09-13 15:09:52 +02:00
misc	mei: cancel stall timers in mei_reset	2013-09-26 13:56:53 -07:00
mmc	ARM: shmobile: update SDHI DT compatibility string to the <unit>-<soc> format	2013-09-21 09:55:32 -07:00
mtd	mtd: gpmi: fix ECC regression	2013-10-25 10:09:43 -07:00
net	Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net	2013-10-23 07:47:42 +01:00
nfc
ntb	ntb: clean up unnecessary MSI/MSI-X capability find	2013-09-05 11:08:01 -07:00
nubus
of	Revert "drivers: of: add initialization code for dma reserved memory"	2013-10-15 09:26:07 +01:00
oprofile	oprofile: get rid of pointless forward declarations of struct super_block	2013-09-03 22:52:48 -04:00
parisc	PCI changes for the v3.12 merge window:	2013-09-03 16:24:35 -07:00
parport
pci	ACPI / hotplug / PCI: Drop WARN_ON() from acpiphp_enumerate_slots()	2013-10-12 01:49:48 +02:00
pcmcia
pinctrl	pinctrl: palmas: do not abort pin configuration for BIAS_DEFAULT	2013-09-27 16:22:43 +02:00
platform	platform/x86: fix asus-wmi build error	2013-10-23 07:57:57 +01:00
pnp	pnp: change pnp bus pm_ops to invoke pnp driver dev_pm_ops if specified	2013-09-11 15:58:15 -07:00
power	Remove GENERIC_HARDIRQ config option	2013-09-13 15:09:52 +02:00
pps	Remove GENERIC_HARDIRQ config option	2013-09-13 15:09:52 +02:00
ps3
ptp
pwm	ARM: SoC cleanups for 3.12	2013-09-06 13:21:16 -07:00
rapidio
regulator	Merge remote-tracking branch 'regulator/fix/wm8350' into regulator-linus	2013-09-30 12:04:33 +01:00
remoteproc
reset
rpmsg
rtc	rtc: simplify devm_request_mem_region/devm_ioremap	2013-09-11 15:58:59 -07:00
s390	s390/time: correct use of store clock fast	2013-10-22 09:16:40 +02:00
sbus
scsi	[SCSI] sd: call blk_pm_runtime_init before add_disk	2013-10-23 14:09:18 +01:00
sfi
sh
sn
spi	Merge remote-tracking branch 'spi/fix/s3c64xx' into spi-linus	2013-10-07 14:51:59 +01:00
ssb
staging	Merge branch 'v4l_for_linus' of git://git.kernel.org/pub/scm/linux/kernel/git/mchehab/linux-media	2013-10-23 07:52:36 +01:00
target	target/pscsi: fix return value check	2013-10-25 10:42:09 -07:00
tc
thermal	Merge branch 'x86_pkg_temp' of .git into for-rc	2013-10-21 11:26:45 +08:00
tty	serial: vt8500: add missing braces	2013-10-16 13:22:16 -07:00
uio	Driver core patches for 3.12-rc1	2013-09-03 11:37:15 -07:00
usb	usb: misc: usb3503: Fix compile error due to incorrect regmap depedency	2013-10-16 13:52:16 -07:00
uwb
vfio	VFIO: vfio_iommu_type1: fix bug caused by break in nested loop	2013-10-11 10:40:46 -06:00
vhost	vhost/scsi: Fix incorrect usage of get_user_pages_fast write parameter	2013-10-25 11:03:34 -07:00
video	video: mxsfb: Add missing break	2013-09-26 15:33:23 +03:00
virt
virtio	virtio_pci: pm: Use CONFIG_PM_SLEEP instead of CONFIG_PM	2013-09-09 10:02:53 +09:30
vlynq
vme
w1	w1 - call request_module with w1 master mutex unlocked	2013-10-07 00:12:14 -07:00
watchdog	watchdog: sunxi: Fix section mismatch	2013-10-13 20:02:03 +02:00
xen	Bug-fixes:	2013-09-25 15:50:53 -07:00
zorro
Kconfig
Makefile