Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in avx
SIMD string to unsigned int parsing in C# performance improvement
Mar 18, 2022
c#
sse
simd
avx
system.numerics
are static / static local SSE / AVX variables blocking a xmm / ymm register?
Aug 17, 2022
c++
sse
avx
vectorized sum in Fortran
Mar 11, 2022
fortran
sse
gfortran
simd
avx
Ensuring that Eigen uses AVX vectorization for a certain operation
Sep 05, 2022
c++
vectorization
eigen
simd
avx
How are AVX registers handled by the common calling conventions?
Aug 16, 2022
windows
linux
calling-convention
avx
Potential bug in Visual Studio C compiler or in Intel Intrinsics' AVX2 "_mm256_set_epi64x" function
Oct 10, 2020
c++
visual-studio
intrinsics
avx
compiler-bug
Copying 64 bytes of memory with NT stores to one full cache line vs. 2 consecutive partial cache lines
Apr 05, 2022
c
performance
assembly
x86
avx
Why two bitwise or AVX instructions? [duplicate]
Jan 20, 2020
x86
bit-manipulation
avx
instructions
bitwise-or
Can I generate AVX vectorized code using LLVM jit?
Feb 15, 2020
x86
llvm
jit
avx
find nan in array of doubles using simd
Jun 13, 2022
c
nan
sse
simd
avx
How to store lower or higher values from AVX/AVX2(YMM) register to memory like the SSE movlps/movhps does?
Feb 21, 2017
x86
sse
simd
avx
avx2
Small branches in modern CPUs
May 23, 2022
performance
x86-64
cpu-architecture
avx
branch-prediction
SIMD minmag and maxmag
Feb 12, 2022
assembly
floating-point
x86
sse
avx
The indices of non-zero bytes of an SSE/AVX register
Feb 06, 2022
c++
c
sse
simd
avx
perf report shows this function "__memset_avx2_unaligned_erms" has overhead. does this mean memory is unaligned?
Oct 17, 2020
c++
profiling
avx
perf
avx2
Is using AVX2 can implement a faster processing of LZCNT on a word array?
Oct 05, 2020
x86
simd
avx
micro-optimization
avx2
How to make premultiplied alpha function faster using SIMD instructions?
Jun 04, 2022
c++
x86
sse
simd
avx
128-bit SSE counter?
Jun 30, 2017
optimization
sse
intrinsics
avx
AVX2, How to Efficiently Load Four Integers to Even Indices of a 256 Bit Register and Copy to Odd Indices?
Oct 07, 2018
x86
sse
simd
avx
avx2
« Newer Entries
Older Entries »