Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in avx

Is NOT missing from SSE, AVX?

How to solve the 32-byte-alignment issue for AVX load/store operations?

Transpose an 8x8 float using AVX/AVX2

simd avx avx2

How to find the horizontal maximum in a 256-bit AVX vector

AVX VMOVDQA slower than two SSE MOVDQA?

How to sum __m256 horizontally?

Loop unrolling to achieve maximum throughput with Ivy Bridge and Haswell

c++ x86 intel sse avx

Does ICC satisfy C99 specs for multiplication of complex numbers?

How to rotate an SSE/AVX vector

c x86 sse intrinsics avx

Disable AVX-optimized functions in glibc (LD_HWCAP_MASK, /etc/ld.so.nohwcap) for valgrind & gdb record

linux linker gdb glibc avx

Choice between aligned vs. unaligned x86 SIMD instructions

x86 sse simd avx avx512

How to use the Intel AVX in Java?

java simd avx

How are the gather instructions in AVX2 implemented?

intel ram simd avx avx2

How to write portable simd code for complex multiplicative reduction

c++ c gcc simd avx

Intel AVX: 256-bits version of dot product for double precision floating point variables

c++ performance simd avx

Is there a version of TensorFlow not compiled for AVX instructions?

python tensorflow avx

What are the best instruction sequences to generate vector constants on the fly?

assembly x86 sse simd avx

Are different mmx, sse and avx versions complementary or supersets of each other?

x86 sse avx mmx

What's missing/sub-optimal in this memcpy implementation?

c optimization x86 simd avx

How to tell if a Linux machine supports AVX/AVX2 instructions?

linux unix avx suse avx2