Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in simd

AVX 256-bit equivalent for _mm_load1_ps

Mar 14, 2018

simd intrinsics avx

Loading non contiguous values with Intel SIMD SSE

May 28, 2021

assembly x86 intel sse simd

AVX-512 and Branching

Apr 08, 2022

x86 fortran vectorization simd avx512

Which assemblers currently support the AVX instruction set?

Aug 12, 2022

x86 assembly simd avx intel

Shifting SSE/AVX registers 32 bits left and right while shifting in zeros

Nov 27, 2018

x86 sse simd avx avx2

Efficient way of rotating a byte inside an AVX register

Mar 06, 2022

c sse simd avx avx2

Count leading zero bits for each element in AVX2 vector, emulate _mm256_lzcnt_epi32

Mar 17, 2022

bit-manipulation simd avx avx2 avx512

How to optimize C-code with SSE-intrinsics for packed 32x32 => 64-bit multiplies, and unpacking the halves of those results for (Galois Fields)

Mar 23, 2022

c optimization x86 sse simd

SSE multiplication of 2 64-bit integers

Jan 29, 2021

x86 sse simd multiplication sse2

Does Haskell perfom SIMD optimizations automatically?

Apr 28, 2021

haskell simd

Profiling SIMD Code

Apr 09, 2022

c++ c sse simd

Optimal SIMD algorithm to rotate or transpose an array

Nov 26, 2020

assembly intel simd transpose avx2

How can I set __m128i without using of any SSE instruction?

Aug 22, 2022

c++ constants sse simd sse2

SSE2 code optimization

Nov 04, 2018

c++ sse simd intrinsics sse2

How to square two complex doubles with 256-bit AVX vectors?

Oct 14, 2022

c simd complex-numbers intrinsics avx

What do you do without fast gather and scatter in AVX2 instructions?

Sep 05, 2022

algorithm performance optimization simd avx2

C++ Adding 2 arrays together quickly

Oct 26, 2022

c++ performance arrays micro-optimization simd

SSE instructions to add all elements of an array [duplicate]

Aug 21, 2022

c++ arrays sse simd sse2

Can counting byte matches between two strings be optimized using SIMD?

Dec 17, 2016

c++ optimization x86-64 sse simd

« Newer Entries Older Entries »