Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in sse

Faster assembly optimized way to convert between RGB8 and RGB32 image

Is there still any development on SIMD in Mono?

c# mono sse simd

Matrix-vector-multiplication in AVX not proportionately faster than in SSE

Print value of __m128 datatype in gdb debugger

c++ gdb sse simd intrinsics

How to convert 'long long' (or __int64) to __m64

Bypass delays when switching execution unit domains

assembly intel sse

Optimal SSE unsigned 8 bit compare

c x86 sse simd sse4

Questions regarding operations on NaN

SSE intrinsics - comparison if/else optimization

c++ sse intrinsics

Fastest way to compare one byte array with many others?

c algorithm assembly x86-64 sse

Fast transposition of an image and Sobel Filter optimization in C (SIMD)

c optimization sse simd

SSE: unaligned load and store that crosses page boundary

"Safe" SIMD arithmetic on aligned vectors of odd size?

Loading non contiguous values with Intel SIMD SSE

assembly x86 intel sse simd

SSE with doubles, not worth it?

Shifting SSE/AVX registers 32 bits left and right while shifting in zeros

x86 sse simd avx avx2

Efficient way of rotating a byte inside an AVX register

c sse simd avx avx2

How compilers treat SSE (or any) intrinsic functions?

SSE: reciprocal if not zero

c normalization sse

_mm_shuffle_ps() equivalent for integer vectors (__m128i)?

c sse