Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in sse
How to get the number of unique elements of a simd vector in C
Oct 22, 2025
c
simd
sse
First use of AVX 256-bit vectors slows down 128-bit vector and AVX scalar ops
Oct 23, 2025
assembly
x86-64
sse
simd
avx
Aligning memory on 16-byte and 32-byte boundaries
Oct 22, 2025
memory
alignment
sse
simd
avx
Why is masking needed before using a pshufb shuffle as a lookup table for nibbles?
Oct 22, 2025
c++
simd
sse
avx
avx2
performance of SSE and AVX when both Memory-band width limited
Oct 22, 2025
performance
caching
sse
avx
Set an XMM register to a repeating byte pattern (broadcast a constant byte)
Oct 21, 2025
assembly
sse
micro-optimization
sse2
How to best emulate the logical meaning of _mm_slli_si128 (128-bit bit-shift), not _mm_bslli_si128
Oct 22, 2025
c
sse
simd
intrinsics
sse2
Aliasing of NEON vector data types
Oct 21, 2025
c++
c
sse
simd
neon
Meaning of XMM register values shown in Visual Studio debugger's register window
Oct 19, 2025
visual-studio
sse
visual-studio-debugging
cpu-registers
How to convert int 64 to int 32 with avx (but without avx-512)
Oct 19, 2025
simd
sse
avx
Why does __m128 cause alignment issues in a union with float x/y/z?
Oct 18, 2025
c
simd
sse
unions
memory-alignment
Out-of-range floating point to integer conversion breaks in VS2022 executable when linking VS2017 or VS2019 libraries
Oct 17, 2025
c
visual-c++
floating-point
sse
floating-point-conversion
optimising column-wise maximum with SIMD
Oct 17, 2025
c++
sse
simd
intrinsics
avx
Should you pass __m128 (and other register types) by reference or by copy?
Sep 23, 2025
c++
simd
sse
intrinsics
average operation ARM NEON
Sep 22, 2025
arm
sse
simd
neon
intrinsics
How to compile a project which requires SSE2 on MacBook with M1 chip?
Sep 18, 2025
sse
apple-m1
vector-class-library
Why is SIMD slower than scalar counterpart
Sep 16, 2025
assembly
x86
sse
simd
CVTTSD2SI - a truncating instruction - uses rounding with "inexact" results?
Sep 16, 2025
assembly
x86
sse
floating-point-conversion
How to store 4 32 bit floats into one 128 bit xmm register?
Sep 14, 2025
assembly
x86
x86-64
sse
simd
gcc vector extensions don't work as stated in docs
Sep 13, 2025
gcc
sse
vectorization
« Newer Entries
Older Entries »