sched/numa: Load balance between remote nodes
authorAlex Shi <alex.shi@intel.com>
Wed, 6 Jun 2012 06:52:51 +0000 (14:52 +0800)
committerIngo Molnar <mingo@kernel.org>
Wed, 6 Jun 2012 14:52:25 +0000 (16:52 +0200)
commit10717dcde10d09f9fcee53a12a4236af1a82b484
treeef7fa575d2bec5631201cd72c5764f042650e382
parentceb1cbac8eda66cf0f889def226b4e82f8ff857b
sched/numa: Load balance between remote nodes

Commit cb83b629b ("sched/numa: Rewrite the CONFIG_NUMA sched
domain support") removed the NODE sched domain and started checking
if the node distance in SLIT table is farther than REMOTE_DISTANCE,
if so, it will lose the load balance chance at exec/fork/wake_affine
points.

But actually, even the node distance is farther than REMOTE_DISTANCE.

Modern CPUs also has QPI like connections, which ensures that memory
access is not too slow between nodes. So the above change in behavior
on NUMA machine causes a performance regression on various benchmarks:
hackbench, tbench, netperf, oltp, etc.

This patch will recover the scheduler behavior to old mode on all my
Intel platforms: NHM EP/EX, WSM EP, SNB EP/EP4S, and thus fixes the
perfromance regressions. (all of them just have 2 kinds distance, 10, 21)

Signed-off-by: Alex Shi <alex.shi@intel.com>
Signed-off-by: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/1338965571-9812-1-git-send-email-alex.shi@intel.com
Signed-off-by: Ingo Molnar <mingo@kernel.org>
kernel/sched/core.c