Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in sse

SSE multiplication 16 x uint8_t

x86 sse simd sse4

Why does vectorization behave differently for almost the same code?

Computing Hamming distances to several strings with SSE

c gcc sse simd hamming-distance

SSE register return with SSE disabled

c gcc floating-point sse

Looking for sse 128 bit shift operation for non-immediate shift value

c++ c sse

Which versions of Windows support/require which CPU multimedia extensions? (How to check if SSE or AVX are fully usable?)

windows assembly sse avx avx512

Why are there 128bit load functions for SSE?

c++ x86 sse simd intrinsics

Look-Up Table using SIMD

c++ sse simd

Are older SIMD-versions available when using newer ones?

c++ c sse simd avx

Find index of maximum element in x86 SIMD vector

c++ x86 sse simd avx intel

practical BigNum AVX/SSE possible?

Is SSE floating-point arithmetic reproducible?

SIMD latency throughput

c++ performance x86 sse simd

Speed up float 5x5 matrix * vector multiplication with SSE

Flipping sign on packed SSE floats

Constexpr and SSE intrinsics

An SSE Stdlib-esque Library?

c++ c visual-c++ assembly sse

Best way to load a 64-bit integer to a double precision SSE2 register?

assembly double sse sse2 int64

Get index of first element that is not zero in a __m256 variable

c++ c sse simd avx

Does rewriting memcpy/memcmp/... with SIMD instructions make sense?

performance sse simd