Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in sse

For XMM/YMM FP operation on Intel Haswell, can FMA be used in place of ADD?

Jul 12, 2018

sse avx throughput flops fma

What is the difference between these 128bit SIMD xor operations

Aug 30, 2022

simd sse intrinsics sse2

Determine cause of segfault when using -O3?

Mar 20, 2022

c++ gdb sse gcc4.9

access violation _mm_store_si128 SSE Intrinsics

Feb 28, 2019

intel c++ x86 simd sse intrinsics

AVX scalar operations are much faster

Aug 06, 2022

intel c memory x86 sse avx

Most efficient way to convert vector of uint32 to vector of float?

Feb 27, 2022

intel assembly floating-point x86 sse

SSE2 instruction to typecast an integer register to short register and vice-versa

Jul 14, 2022

x86 sse simd sse2

Is there a way to utilize all XMM registers?

Oct 26, 2022

c++ c compiler-construction sse

Implement a near real-time CPU capability like glAlphaFunc(GL_GREATER) with RGB source and RGBA overlay

Jun 21, 2022

c++ opengl assembly sse rgba

Setting last or first n bits in SSE register

Nov 15, 2021

c++ x86 sse simd intrinsics

Translating SSE to Neon: How to pack and then extract 32bit result

Jan 15, 2022

c++ arm sse neon intrinsics

AVX/SSE round floats down and return vector of ints?

Jul 23, 2022

c++ intel sse intrinsics avx

Shuffle AVX 256 Vector elements by 1 position left/right - C intrinsics

Apr 16, 2018

c sse hpc intrinsics avx

Why does AES in SSE not provide full function?

Mar 15, 2019

assembly x86 aes sse instruction-set

glibc and SSE functionality

Jun 03, 2022

c performance sse

Storing individual doubles from a packed double vector using Intel AVX

Sep 07, 2022

x86 x86-64 sse avx

bool judgement is so slow? [closed]

Aug 09, 2017

c++ c optimization sse

Why movlps and movhps SSE instructions are faster than movups for transferring misaligned data?

Feb 08, 2018

optimization assembly sse

how invert __m128 into ints

Dec 06, 2020

c++ sse

AVX 256-bit code performing slightly worse than equivalent 128-bit SSSE3 code

Jun 06, 2022

c++ performance sse avx2

« Newer Entries Older Entries »