Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in sse

SSE intrinsic over int16[8] to extract the sign of each element

c x86 sse simd sign

128bit hash comparison with SSE

How to perform uint32/float conversion with SSE?

c x86 sse simd

SSE intrinsics: Convert 32-bit floats to UNSIGNED 8-bit integers

x86 sse mmx

What does UnsignedSaturate in SSE instruction mean?

c++ c sse

SSE2 intrinsics - comparing unsigned integers

c++ x86 sse simd intrinsics

Uses of the monitor/mwait instructions

Minimum and maximum of signed zero

Best way to shuffle 64-bit portions of two __m128i's

intel sse simd intrinsics

Java performance in numerical algorithms

SSE Code runs 30% faster, yet when in use show over 20% CPU increase

c sse

Using ymm registers as a "memory-like" storage location

assembly x86 sse avx

efficient way to convert scatter indices into gather indices?

Permuting bytes inside SSE __m128i register

optimization sse simd

How to merge a scalar into a vector without the compiler wasting an instruction zeroing upper elements? Design limitation in Intel's intrinsics?

c gcc x86 sse intrinsics

Can PTEST be used to test if two registers are both zero or some other condition?

assembly x86 sse intrinsics sse4

libc's system() when the stack pointer is not 16-padded causes segmentation fault

Neon equivalent to SSE intrinsics

c arm sse multiplication neon