Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in sse

How to get the number of unique elements of a simd vector in C

Oct 22, 2025

c simd sse

First use of AVX 256-bit vectors slows down 128-bit vector and AVX scalar ops

Oct 23, 2025

assembly x86-64 sse simd avx

Aligning memory on 16-byte and 32-byte boundaries

Oct 22, 2025

memory alignment sse simd avx

Why is masking needed before using a pshufb shuffle as a lookup table for nibbles?

Oct 22, 2025

c++ simd sse avx avx2

performance of SSE and AVX when both Memory-band width limited

Oct 22, 2025

performance caching sse avx

Set an XMM register to a repeating byte pattern (broadcast a constant byte)

Oct 21, 2025

assembly sse micro-optimization sse2

How to best emulate the logical meaning of _mm_slli_si128 (128-bit bit-shift), not _mm_bslli_si128

Oct 22, 2025

c sse simd intrinsics sse2

Aliasing of NEON vector data types

Oct 21, 2025

c++ c sse simd neon

Meaning of XMM register values shown in Visual Studio debugger's register window

Oct 19, 2025

visual-studio sse visual-studio-debugging cpu-registers

How to convert int 64 to int 32 with avx (but without avx-512)

Oct 19, 2025

simd sse avx

Why does __m128 cause alignment issues in a union with float x/y/z?

Oct 18, 2025

c simd sse unions memory-alignment

Out-of-range floating point to integer conversion breaks in VS2022 executable when linking VS2017 or VS2019 libraries

Oct 17, 2025

c visual-c++ floating-point sse floating-point-conversion

optimising column-wise maximum with SIMD

Oct 17, 2025

c++ sse simd intrinsics avx

Should you pass __m128 (and other register types) by reference or by copy?

Sep 23, 2025

c++ simd sse intrinsics

average operation ARM NEON

Sep 22, 2025

arm sse simd neon intrinsics

How to compile a project which requires SSE2 on MacBook with M1 chip?

Sep 18, 2025

sse apple-m1 vector-class-library

Why is SIMD slower than scalar counterpart

Sep 16, 2025

assembly x86 sse simd

CVTTSD2SI - a truncating instruction - uses rounding with "inexact" results?

Sep 16, 2025

assembly x86 sse floating-point-conversion

How to store 4 32 bit floats into one 128 bit xmm register?

Sep 14, 2025

assembly x86 x86-64 sse simd

gcc vector extensions don't work as stated in docs

Sep 13, 2025

gcc sse vectorization

« Newer Entries Older Entries »