Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in simd

Using Vector<T> for SIMD in Universal Windows Platform

What are the differences between the compress and expand instructions in AVX-512?

assembly x86 simd avx512

Do I get a performance penalty when mixing SIMD instructions and multithreading

Fast byte-wise replace if

c optimization x86 sse simd

How to compare __m128 types?

x86 sse simd

SSE reduction of float vector

c++ sum sse simd reduction

SSE code to set float variable to 0.0f or 1.0f based on comparison

Horizontal XOR in AVX

c++ assembly x86 simd avx

SSE slower than FPU?

How much speed-up from converting 3D maths to SSE or other SIMD?

SIMD/SSE: How to check that all vector elements are non-zero

c++ c gcc vectorization simd

What is the penalty of mixing EVEX and VEX encoded scheme?

assembly x86 simd avx512

Can I use SIMD for speeding up string manipulation?

c++ c string optimization simd

SSE2: Double precision log function

c++ c optimization sse simd

Check XMM register for all zeroes

c++ sse simd intrinsics

SIMD vs Vector architectures

Fastest way to unpack 32 bits to a 32 byte SIMD vector

x86 simd avx bitmask avx2

Do all CPUs which support AVX2 also support SSE4.2 and AVX?

sse simd avx avx2

Storing two x86 32 bit registers into 128 bit xmm register

assembly x86 simd sse

What are the 128-bit to 512-bit registers used for?