perf: Optimize perf_output_*() by avoiding local_xchg()