Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in sse

Loop unrolling to achieve maximum throughput with Ivy Bridge and Haswell

c++ x86 intel sse avx

Is it fair to compare SSE/AVX units to GPU cores?

cuda hardware opencl gpu sse

Fastest way to compute absolute value using SSE

Can't get over 50% max. theoretical performance on matrix multiply

c optimization matrix openmp sse

SSE 4 instructions generated by Visual Studio 2013 Update 2 and Update 3

How to rotate an SSE/AVX vector

c x86 sse intrinsics avx

Why do some SSE "mov" instructions specify that they move floating-point values?

assembly x86 sse

How to implement "_mm_storeu_epi64" without aliasing problems?

Should I use SIMD or vector extensions or something else?

c++ gcc sse simd

Choice between aligned vs. unaligned x86 SIMD instructions

x86 sse simd avx avx512

SSE multiplication of 4 32-bit integers

x86 sse simd multiplication sse2

SSE: Difference between _mm_load/store vs. using direct pointer access

x86 sse simd

inlining failed in call to always_inline ‘_mm_mullo_epi32’: target specific option mismatch

c cmake x86 sse simd

Fast dot product of a bit vector and a floating point vector

Get member of __m128 by index?

c++ clang sse simd intrinsics

difference between MMX and XMM register?

assembly x86 x86-64 sse mmx

print a __m128i variable

c assembly sse simd intrinsics

Benefits of x87 over SSE

x86 x86-64 sse fpu x87

Newton Raphson with SSE2 - can someone explain me these 3 lines

c++ c math sse newtons-method

How to implement atoi using SIMD?

c++ x86 sse simd atoi