Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in avx

Rotating (by 90°) a bit matrix (up to 8x8 bits) within a 64-bit integer

Jan 01, 2026

c++ bit-manipulation x86-64 avx micro-optimization

How to find the index of an element in the AVX vector?

Dec 29, 2025

x86 intrinsics avx

Why do bit manipulation intrinsics like _bextr_u64 often perform worse than simple shift and mask operations?

Dec 24, 2025

gcc bit-manipulation x86-64 intrinsics avx

How to sum all 32-bit or 64-bit sub-registers in an SSE XMM, or AVX YMM, and ZMM register?

Dec 20, 2025

sse simd avx

Using sse and avx intrinsics to add a set of packed singles into one value

Dec 16, 2025

c++ c++11 sse avx

Optimal uint8_t bitmap into a 8 x 32bit SIMD "bool" vector

Dec 13, 2025

c++11 simd avx avx2

Websocket data unmasking / multi byte xor

Dec 08, 2025

c x86 sse simd avx

Does VS2010 SP1 support only part of the AVX instruction set?

Dec 07, 2025

c++ visual-studio-2010 sse avx fma

Difference between _mm256_xor_si256() and _mm256_xor_ps()

Dec 07, 2025

intrinsics avx avx2

C++ AVX2 Instrinsic function Non-Standard Size

Dec 05, 2025

c++ simd intrinsics avx avx2

Different semantic of comparison intrinsic instructions in avx512?

Dec 05, 2025

c++ sse intrinsics avx avx512

Integer dot product using SSE/AVX?

Dec 03, 2025

c++ vectorization sse simd avx

Unpack 12-bit data quickly (where the nibbles aren't contiguous; how to shuffle nibbles?)

Nov 30, 2025

c# c++ avx avx2 pixelformat

Intel vector instruction to zero-extend 8 4-bit values packed in a 32-bit int to a __m256i?

Nov 25, 2025

sse avx avx2

How to implement 16 and 32 bit integer insert and extract operations with AVX-512?

Nov 22, 2025

intrinsics avx avx512

how abundant is hardware support for FMA instruction set

Nov 20, 2025

x86 hardware sse simd avx

AVX equivalent for _mm_movelh_ps

Nov 19, 2025

c++ sse intrinsics avx

Add saturate 32-bit signed ints intrinsics?

Nov 17, 2025

x86 sse intrinsics avx saturation-arithmetic

Mixing SSE with AVX128 for shorter instructions?

Nov 06, 2025

assembly x86 sse avx micro-optimization

Is there a more efficient way to broadcast 4 contiguous doubles into 4 YMM registers?

Nov 05, 2025

gcc intel simd intrinsics avx

Older Entries »