x86: Add 1/2/4/8 byte optimization to 64bit __copy_{from,to}_user_inatomic