Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in sse
For XMM/YMM FP operation on Intel Haswell, can FMA be used in place of ADD?
Jul 12, 2018
sse
avx
throughput
flops
fma
What is the difference between these 128bit SIMD xor operations
Aug 30, 2022
simd
sse
intrinsics
sse2
Determine cause of segfault when using -O3?
Mar 20, 2022
c++
gdb
sse
gcc4.9
access violation _mm_store_si128 SSE Intrinsics
Feb 28, 2019
intel
c++
x86
simd
sse
intrinsics
AVX scalar operations are much faster
Aug 06, 2022
intel
c
memory
x86
sse
avx
Most efficient way to convert vector of uint32 to vector of float?
Feb 27, 2022
intel
assembly
floating-point
x86
sse
SSE2 instruction to typecast an integer register to short register and vice-versa
Jul 14, 2022
x86
sse
simd
sse2
Is there a way to utilize all XMM registers?
Oct 26, 2022
c++
c
compiler-construction
sse
Implement a near real-time CPU capability like glAlphaFunc(GL_GREATER) with RGB source and RGBA overlay
Jun 21, 2022
c++
opengl
assembly
sse
rgba
Setting last or first n bits in SSE register
Nov 15, 2021
c++
x86
sse
simd
intrinsics
Translating SSE to Neon: How to pack and then extract 32bit result
Jan 15, 2022
c++
arm
sse
neon
intrinsics
AVX/SSE round floats down and return vector of ints?
Jul 23, 2022
c++
intel
sse
intrinsics
avx
Shuffle AVX 256 Vector elements by 1 position left/right - C intrinsics
Apr 16, 2018
c
sse
hpc
intrinsics
avx
Why does AES in SSE not provide full function?
Mar 15, 2019
assembly
x86
aes
sse
instruction-set
glibc and SSE functionality
Jun 03, 2022
c
performance
sse
Storing individual doubles from a packed double vector using Intel AVX
Sep 07, 2022
x86
x86-64
sse
avx
bool judgement is so slow? [closed]
Aug 09, 2017
c++
c
optimization
sse
Why movlps and movhps SSE instructions are faster than movups for transferring misaligned data?
Feb 08, 2018
optimization
assembly
sse
how invert __m128 into ints
Dec 06, 2020
c++
sse
AVX 256-bit code performing slightly worse than equivalent 128-bit SSSE3 code
Jun 06, 2022
c++
performance
sse
avx2
« Newer Entries
Older Entries »