Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in sse

eigen vectorization with arrays

sse eigen avx eigen3

Why this SSE2 program (integers) generate movaps (float)?

gcc assembly x86 sse simd

_declspec(align(16)) does not align the pointer to 16 bytes

c++ sse

SSE/AVX: Choose from two __m256 float vectors based on per-element min and max absolute value

sse intrinsics avx avx512

SSE integer 2^n powers of 2 for 32-bit integers without AVX2

c++ x86 sse simd intrinsics

Why this unnecessary MOVAPD copy in gcc 9.1, in a tiny function

developing for new instruction sets

x86 sse avx

How to perform element-wise left shift with __m128i?

c sse avx

How to find the max member in a __m128(F32vec4)

c sse simd

Add a constant value to a xmm register in x86

assembly x86 sse x87

linear search through uint64[] with SSE

c++ c search sse linear-search

Using SSE to round in Delphi

Horizontal minimum and position in SSE for unsigned 32-bit integers

Comparing quadwords in xmm

assembly x86 nasm sse

What is the purpose of a movss instruction with [this] as destination?

Tiny SSE addpd loop slightly slower than scalar on AMD Phenom II?

c++ c gcc assembly sse

Array Error - Access violation reading location 0xffffffff

c++ sse simd

Xcode debugging: how to view xmm registers in readable form

xcode debugging sse

Intel C Compiler uses unaligned SIMD moves with aligned memory

SSE inline assembly and possible g++ optimization bug

c++ matrix vector sse