Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in simd
Writing a piece of C code such that compiler uses SSE4.1 instruction for generating assembly Code
Mar 22, 2026
c
optimization
gcc
sse
simd
xtensor and xsimd: improve performance on reduction
Mar 22, 2026
python
c++
numpy
simd
xtensor
Emulating shifts on 64 bytes with AVX-512
Mar 22, 2026
simd
avx512
Euclidean distance using intrinsic instruction
Mar 19, 2026
c++
sse
simd
euclidean-distance
Broadcast one arbitrary element of __m128 vector
Mar 18, 2026
c++
x86
sse
simd
sse2
Seeded Random Uniform float generator using SIMD? [duplicate]
Mar 18, 2026
c++
random
floating-point
simd
avx
SSE2 8x8 byte-matrix transpose code twice as slow on Haswell+ then on ivy bridge
Mar 17, 2026
performance
assembly
x86
sse
simd
Loop is not vectorized when variable extent is used
Mar 16, 2026
c++
vectorization
sse
compiler-optimization
simd
SIMD transpose when row size is greater than vector width
Mar 15, 2026
matrix
transpose
simd
avx
avx2
Does using SIMD have an initialisation cost
Mar 10, 2026
x86-64
simd
arm64
Sign of the maximum absolute value in an __m128, SSE4
Mar 09, 2026
c++
sse
simd
C++ load and store optimizations and heap objects
Mar 04, 2026
c++
sse
simd
AVX vs. SSE: expect to see a larger speedup
Mar 02, 2026
performance
sse
simd
avx
Is there a way to mask one end of a __m128i register based on mask length that is not known at compile time?
Mar 01, 2026
sse
simd
avx
What does the colon mean in this ARM NEON code
Feb 26, 2026
assembly
arm
simd
neon
What are the differences between Vector256.Create and Avx2.BroadcastScalarToVector functions?
Feb 27, 2026
c#
.net
simd
avx2
vectorize a loop which accesses non-consecutive memory locations
Feb 26, 2026
arrays
parallel-processing
vectorization
simd
fortran90
Older Entries »