In order to use the functions provided by this module, you need to import this module:
>>> import HardwareIntrinsics
These intrinsic functions are only available if your CPU supports Ssse3
features.
mm_abs_epi16
Compute the absolute value of packed 16-bit integers in "a", and store the unsigned results in "dst".
__m128i _mm_abs_epi16 (__m128i a) PABSW xmm, xmm/m128
mm_abs_epi32
Compute the absolute value of packed 32-bit integers in "a", and store the unsigned results in "dst".
__m128i _mm_abs_epi32 (__m128i a) PABSD xmm, xmm/m128
mm_abs_epi8
Compute the absolute value of packed 8-bit integers in "a", and store the unsigned results in "dst".
__m128i _mm_abs_epi8 (__m128i a) PABSB xmm, xmm/m128
mm_alignr_epi8
Concatenate 16-byte blocks in "a" and "b" into a 32-byte temporary result, shift the result right by "count" bytes, and store the low 16 bytes in "dst".
__m128i _mm_alignr_epi8 (__m128i a, __m128i b, int count) PALIGNR xmm, xmm/m128, imm8
mm_hadd_epi16
Horizontally add adjacent pairs of 16-bit integers in "a" and "b", and pack the signed 16-bit results in "dst".
__m128i _mm_hadd_epi16 (__m128i a, __m128i b) PHADDW xmm, xmm/m128
mm_hadd_epi32
Horizontally add adjacent pairs of 32-bit integers in "a" and "b", and pack the signed 32-bit results in "dst".
__m128i _mm_hadd_epi32 (__m128i a, __m128i b) PHADDD xmm, xmm/m128
mm_hadds_epi16
Horizontally add adjacent pairs of 16-bit integers in "a" and "b" using saturation, and pack the signed 16-bit results in "dst".
__m128i _mm_hadds_epi16 (__m128i a, __m128i b) PHADDSW xmm, xmm/m128
mm_hsub_epi16
Horizontally subtract adjacent pairs of 16-bit integers in "a" and "b", and pack the signed 16-bit results in "dst".
__m128i _mm_hsub_epi16 (__m128i a, __m128i b) PHSUBW xmm, xmm/m128
mm_hsub_epi32
Horizontally subtract adjacent pairs of 32-bit integers in "a" and "b", and pack the signed 32-bit results in "dst".
__m128i _mm_hsub_epi32 (__m128i a, __m128i b) PHSUBD xmm, xmm/m128
mm_hsubs_epi16
Horizontally subtract adjacent pairs of 16-bit integers in "a" and "b" using saturation, and pack the signed 16-bit results in "dst".
__m128i _mm_hsubs_epi16 (__m128i a, __m128i b) PHSUBSW xmm, xmm/m128
mm_maddubs_epi16
Vertically multiply each unsigned 8-bit integer from "a" with the corresponding signed 8-bit integer from "b", producing intermediate signed 16-bit integers. Horizontally add adjacent pairs of intermediate signed 16-bit integers, and pack the saturated results in "dst".
__m128i _mm_maddubs_epi16 (__m128i a, __m128i b) PMADDUBSW xmm, xmm/m128
mm_mulhrs_epi16
Multiply packed 16-bit integers in "a" and "b", producing intermediate signed 32-bit integers. Truncate each intermediate integer to the 18 most significant bits, round by adding 1, and store bits [16:1] to "dst".
__m128i _mm_mulhrs_epi16 (__m128i a, __m128i b) PMULHRSW xmm, xmm/m128
mm_shuffle_epi8
Shuffle packed 8-bit integers in "a" according to shuffle control mask in the corresponding 8-bit element of "b", and store the results in "dst".
__m128i _mm_shuffle_epi8 (__m128i a, __m128i b) PSHUFB xmm, xmm/m128
mm_sign_epi16
Negate packed 16-bit integers in "a" when the corresponding signed 16-bit integer in "b" is negative, and store the results in "dst". Element in "dst" are zeroed out when the corresponding element in "b" is zero.
__m128i _mm_sign_epi16 (__m128i a, __m128i b) PSIGNW xmm, xmm/m128
mm_sign_epi32
Negate packed 32-bit integers in "a" when the corresponding signed 32-bit integer in "b" is negative, and store the results in "dst". Element in "dst" are zeroed out when the corresponding element in "b" is zero.
__m128i _mm_sign_epi32 (__m128i a, __m128i b) PSIGND xmm, xmm/m128
mm_sign_epi8
Negate packed 8-bit integers in "a" when the corresponding signed 8-bit integer in "b" is negative, and store the results in "dst". Element in "dst" are zeroed out when the corresponding element in "b" is zero.
__m128i _mm_sign_epi8 (__m128i a, __m128i b) PSIGNB xmm, xmm/m128