Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in simd
Is using AVX2 can implement a faster processing of LZCNT on a word array?
Oct 05, 2020
x86
simd
avx
micro-optimization
avx2
How to make premultiplied alpha function faster using SIMD instructions?
Jun 04, 2022
c++
x86
sse
simd
avx
SIMD (AVX) compare
Nov 07, 2022
c
gcc
sse
simd
Minimum of 4 SP values in __m128
May 31, 2022
c
sse
simd
Compiling SSE intrinsics in GCC gives an error
Oct 31, 2018
gcc
x86
intel
sse
simd
Why use SIMD if we have GPGPU? [closed]
Nov 02, 2022
cuda
gpgpu
simd
computer-architecture
cpu-architecture
AVX2, How to Efficiently Load Four Integers to Even Indices of a 256 Bit Register and Copy to Odd Indices?
Oct 07, 2018
x86
sse
simd
avx
avx2
Why are SIMD instructions not used in kernel?
Sep 10, 2022
linux-kernel
operating-system
linux-device-driver
simd
ispc
How to convert 32-bit float to 8-bit signed char? (4:1 packing of int32 to int8 __m256i)
Jan 24, 2022
c
x86
simd
intrinsics
avx2
Summing 3 lanes in a NEON float32x4_t
Dec 01, 2020
ios
arm
simd
neon
intrinsics
What is the difference between MOVDQA and MOVNTDQA, and VMOVDQA and VMOVNTDQ for WB/WC marked region?
Jun 09, 2020
assembly
x86
sse
simd
avx
AVX2 VPSHUFB emulation in AVX
Oct 01, 2018
x86
simd
intrinsics
avx
_mm_alignr_epi8 (PALIGNR) equivalent in AVX2
Sep 01, 2020
x86
simd
intrinsics
avx
avx2
How do you move 128-bit values between XMM registers?
Feb 17, 2020
assembly
simd
sse
Setting __m256i to the value of two __m128i values
Mar 30, 2019
c
sse
simd
avx
Loading 8 chars from memory into an __m256 variable as packed single precision floats
Jun 17, 2021
c++
sse
simd
avx
avx2
Shuffling by mask with Intel AVX
Mar 08, 2022
c++
sse
simd
intrinsics
avx
Control flow divergence in SIMT and SIMD
May 11, 2022
cuda
sse
simd
Are there SIMD(SSE / AVX) instructions in the x86-compatible accelerators Intel Xeon Phi?
Nov 02, 2022
intel
sse
simd
avx
intel-mic
Faster lookup tables using AVX2
May 07, 2022
algorithm
performance
optimization
sse
simd
« Newer Entries
Older Entries »