Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in simd
Flipping sign on packed SSE floats
Jun 18, 2018
c
performance
optimization
sse
simd
Beating or meeting OS X memset (and memset_pattern4)
Jan 18, 2022
c
performance
optimization
assembly
simd
Constexpr and SSE intrinsics
Dec 27, 2021
c++
sse
simd
constexpr
intrinsics
Get index of first element that is not zero in a __m256 variable
Nov 09, 2022
c++
c
sse
simd
avx
What is the diffrence between SPMD and SIMD?
Oct 22, 2022
terminology
parallel-processing
simd
Does rewriting memcpy/memcmp/... with SIMD instructions make sense?
Sep 21, 2022
performance
sse
simd
SIMD instructions for floating point equality comparison (with NaN == NaN)
Dec 24, 2019
assembly
floating-point
x86
x86-64
simd
intel
Sum reduction of unsigned bytes without overflow, using SSE2 on Intel
Apr 12, 2018
x86
sse
simd
sse2
sse3
Fast vectorized rsqrt and reciprocal with SSE/AVX depending on precision
Oct 24, 2021
performance
sse
simd
avx
Using __m256d registers
Oct 18, 2022
c++
x86
intel
simd
avx
Load address calculation when using AVX2 gather instructions
Feb 15, 2022
x86
sse
simd
avx2
Branch and predicated instructions
Oct 27, 2022
cuda
simd
SIMD the following code
Sep 21, 2022
c
x86
sse
simd
Why does the FMA _mm256_fmadd_pd() intrinsic have 3 asm mnemonics, "vfmadd132pd", "231" and "213"?
Jan 08, 2022
assembly
x86
simd
instruction-set
fma
Can I use the AVX FMA units to do bit-exact 52 bit integer multiplications?
Jun 21, 2022
floating-point
x86
simd
avx2
fma
How can I disable vectorization while using GCC?
Sep 20, 2022
gcc
vectorization
sse
simd
auto-vectorization
Fastest way to compute distance squared
Oct 16, 2022
c
optimization
simd
How to transpose a 16x16 matrix using SIMD instructions?
Sep 20, 2022
assembly
matrix
intel
simd
avx512
How to quickly count bits into separate bins in a series of ints on Sandy Bridge? [duplicate]
Oct 31, 2022
c++
assembly
x86
simd
avx
Fast 24-bit array -> 32-bit array conversion?
Mar 04, 2022
c
bitmap
bit-manipulation
sse
simd
« Newer Entries
Older Entries »