perf bench: Fix memcpy benchmark for large sizes
authorAndi Kleen <ak@linux.intel.com>
Thu, 18 Jul 2013 22:33:46 +0000 (15:33 -0700)
committerArnaldo Carvalho de Melo <acme@redhat.com>
Mon, 22 Jul 2013 15:41:56 +0000 (12:41 -0300)
The glibc calloc() function has an optimization to not explicitely
memset() very large calloc allocations that just came from mmap(),
because they are known to be zero.

This could result in the perf memcpy benchmark reading only from
the zero page, which gives unrealistic results.

Always call memset explicitly on the source area to avoid this problem.

Signed-off-by: Andi Kleen <ak@linux.intel.com>
Cc: Hitoshi Mitake <h.mitake@gmail.com>
Cc: Kirill A. Shutemov <kirill.shutemov@linux.intel.com>
Link: http://lkml.kernel.org/n/tip-pzz2qrdq9eymxda0y8yxdn33@git.kernel.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
tools/perf/bench/mem-memcpy.c

index 25fd3f1..8cdca43 100644 (file)
@@ -117,6 +117,8 @@ static void alloc_mem(void **dst, void **src, size_t length)
        *src = zalloc(length);
        if (!*src)
                die("memory allocation failed - maybe length is too large?\n");
+       /* Make sure to always replace the zero pages even if MMAP_THRESH is crossed */
+       memset(*src, 0, length);
 }
 
 static u64 do_memcpy_cycle(memcpy_t fn, size_t len, bool prefault)