linux/arch/ia64/kernel
Martin Hicks 753ee72896 [PATCH] VM: early zone reclaim
This is the core of the (much simplified) early reclaim.  The goal of this
patch is to reclaim some easily-freed pages from a zone before falling back
onto another zone.

One of the major uses of this is NUMA machines.  With the default allocator
behavior the allocator would look for memory in another zone, which might be
off-node, before trying to reclaim from the current zone.

This adds a zone tuneable to enable early zone reclaim.  It is selected on a
per-zone basis and is turned on/off via syscall.

Adding some extra throttling on the reclaim was also required (patch
4/4).  Without the machine would grind to a crawl when doing a "make -j"
kernel build.  Even with this patch the System Time is higher on
average, but it seems tolerable.  Here are some numbers for kernbench
runs on a 2-node, 4cpu, 8Gig RAM Altix in the "make -j" run:

			wall  user   sys   %cpu  ctx sw.  sleeps
			----  ----   ---   ----   ------  ------
No patch		1009  1384   847   258   298170   504402
w/patch, no reclaim     880   1376   667   288   254064   396745
w/patch & reclaim       1079  1385   926   252   291625   548873

These numbers are the average of 2 runs of 3 "make -j" runs done right
after system boot.  Run-to-run variability for "make -j" is huge, so
these numbers aren't terribly useful except to seee that with reclaim
the benchmark still finishes in a reasonable amount of time.

I also looked at the NUMA hit/miss stats for the "make -j" runs and the
reclaim doesn't make any difference when the machine is thrashing away.

Doing a "make -j8" on a single node that is filled with page cache pages
takes 700 seconds with reclaim turned on and 735 seconds without reclaim
(due to remote memory accesses).

The simple zone_reclaim syscall program is at
http://www.bork.org/~mort/sgi/zone_reclaim.c

Signed-off-by: Martin Hicks <mort@sgi.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>
2005-06-21 18:46:14 -07:00
..
acpi-ext.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
acpi.c [IA64] use common pxm function 2005-05-03 13:33:18 -07:00
asm-offsets.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
brl_emu.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
cyclone.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
domain.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
efi_stub.S Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
efi.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
entry.h Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
entry.S [PATCH] VM: early zone reclaim 2005-06-21 18:46:14 -07:00
fsys.S [IA64] Fix race condition in the rt_sigprocmask fastcall 2005-06-09 13:04:30 -07:00
gate-data.S Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
gate.lds.S Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
gate.S Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
head.S [IA64] Fix build errors for !HOTPLUG case. 2005-04-22 14:46:24 -07:00
ia64_ksyms.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
init_task.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
iosapic.c [IA64] iosapic.c: typo ... s/spin_unlock_irq/spin_unlock/ 2005-04-25 13:27:48 -07:00
irq_ia64.c [IA64] vector sharing (Large I/O system support) 2005-04-25 13:26:23 -07:00
irq_lsapic.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
irq.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
ivt.S Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
machvec.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
Makefile Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
mca_asm.S [IA64] cpu hotplug: return offlined cpus to SAL 2005-04-22 14:44:40 -07:00
mca_drv_asm.S [IA64] MCA recovery improvements 2005-05-03 13:47:42 -07:00
mca_drv.c [IA64] MCA recovery improvements 2005-05-03 13:47:42 -07:00
mca_drv.h Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
mca.c [IA64] Cleanup compile warnings for ski config 2005-06-01 15:20:17 -07:00
minstate.h [IA64] Fix stack placement when INIT hits in kernel mode. 2005-05-06 10:16:07 -07:00
module.c [IA64] Module gp must point to valid memory 2005-06-08 11:41:31 -07:00
pal.S Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
palinfo.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
patch.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
perfmon_default_smpl.c [IA64] perfmon: make pfm_sysctl a global, and other cleanup 2005-04-25 13:08:30 -07:00
perfmon_generic.h Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
perfmon_itanium.h Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
perfmon_mckinley.h Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
perfmon.c [IA64] Use "PER_CPU" form of EXPORT macro 2005-05-31 10:38:32 -07:00
process.c [IA64] Fix two warnings introduced by perfmon patches. 2005-05-03 16:21:45 -07:00
ptrace.c [PATCH] ia64: fix floating-point preemption problem 2005-06-08 16:21:14 -07:00
sal.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
salinfo.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
semaphore.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
setup.c [IA64] Update comment to describe modes set in default control register. 2005-06-08 12:12:48 -07:00
sigframe.h Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
signal.c [IA64] fix ia64 syscall auditing 2005-05-03 13:45:39 -07:00
smp.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
smpboot.c [IA64] fix "section mismatch" compile-time-error 2005-05-09 10:16:17 -07:00
sys_ia64.c [IA64] sys_mmap doesn't follow posix.1 when parameter len=0 2005-05-26 10:19:07 -07:00
time.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
topology.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
traps.c [PATCH] ia64: fix floating-point preemption problem 2005-06-08 16:21:14 -07:00
unaligned.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
unwind_decoder.c Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
unwind_i.h Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
unwind.c [IA64] Tighten up unw_unwind_to_user check 2005-04-25 11:45:26 -07:00
vmlinux.lds.S Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00