Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in sse

difference between MMX and XMM register?

assembly x86 x86-64 sse mmx

print a __m128i variable

c assembly sse simd intrinsics

Benefits of x87 over SSE

x86 x86-64 sse fpu x87

Newton Raphson with SSE2 - can someone explain me these 3 lines

c++ c math sse newtons-method

How to implement atoi using SIMD?

c++ x86 sse simd atoi

Using SSE instructions

Why is my hand-tuned, SSE-enabled code so slow?

c++ optimization opencv sse

What are the best instruction sequences to generate vector constants on the fly?

assembly x86 sse simd avx

best cross-platform method to get aligned memory

Can one construct a "good" hash function using CRC32C as a base?

hash intel sse crc32

Are different mmx, sse and avx versions complementary or supersets of each other?

x86 sse avx mmx

SSE instructions: which CPUs can do atomic 16B memory operations?

Difference between MOVDQA and MOVAPS x86 instructions?

assembly x86 sse simd mov intel

Intel SSE and AVX Examples and Tutorials [closed]

intel sse vectorization avx

What does ordered / unordered comparison mean?

Why is strcmp not SIMD optimized?

c++ sse simd strcmp sse2

AVX2 what is the most efficient way to pack left based on a mask?

c++ vectorization sse simd avx2

Why does mulss take only 3 cycles on Haswell, different from Agner's instruction tables? (Unrolling FP loops with multiple accumulators)

Using AVX intrinsics instead of SSE does not improve speed -- why?

c++ performance gcc sse avx

How to use Fused Multiply-Add (FMA) instructions with SSE/AVX

c sse cpu-architecture avx fma