Caller must ensure the type of Self is appropriate for the hardware of the execution
environment.
sourceimpl<S3: Copy, S4: Copy, NI: Copy> Machine for SseMachine<S3, S4, NI> where
u128x1_sse2<S3, S4, NI>: Swap64,
u64x2_sse2<S3, S4, NI>: BSwap + RotateEachWord32 + MultiLane<[u64; 2]> + Vec2<u64>,
u32x4_sse2<S3, S4, NI>: BSwap + RotateEachWord32 + MultiLane<[u32; 4]> + Vec4<u32>,
x2<u64x2_sse2<S3, S4, NI>, G1>: BSwap + Words4,
u128x1_sse2<S3, S4, NI>: BSwap,
x2<u128x1_sse2<S3, S4, NI>, G0>: Into<x2<u64x2_sse2<S3, S4, NI>, G0>>,
x2<u128x1_sse2<S3, S4, NI>, G0>: Into<x2<u64x2_sse2<S3, S4, NI>, G1>>,
x2<u128x1_sse2<S3, S4, NI>, G0>: Into<x2<u32x4_sse2<S3, S4, NI>, G0>>,
x4<u128x1_sse2<S3, S4, NI>>: Into<x4<u64x2_sse2<S3, S4, NI>>>,
x4<u128x1_sse2<S3, S4, NI>>: Into<x4<u32x4_sse2<S3, S4, NI>>>,