Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in avx2

Fast modulo-12 algorithm for 4 uint16_t's packed in a uint64_t

May 15, 2022

c algorithm vectorization modulo avx2

What do you do without fast gather and scatter in AVX2 instructions?

Sep 05, 2022

algorithm performance optimization simd avx2

How to implement an efficient _mm256_madd_epi8?

Nov 09, 2021

c++ x86 simd intrinsics avx2

Efficient implementation of log2(__m256d) in AVX2

Sep 16, 2022

c++ algorithm floating-point logarithm avx2

Parallel programming using Haswell architecture [closed]

Apr 12, 2015

sse cpu-architecture avx avx2

How can I add together two SSE registers

Oct 04, 2022

c++ c intel sse avx2

Efficient way to set first N or last N bits of __m256i to 1, the rest to 0

Oct 31, 2020

c++ bit-manipulation vectorization x86-64 avx2

Fastest way to unpack 32 bits to a 32 byte SIMD vector

Jan 01, 2017

x86 simd avx bitmask avx2

Do all CPUs which support AVX2 also support SSE4.2 and AVX?

Nov 13, 2022

sse simd avx avx2

AVX2 slower than SSE on Haswell

May 18, 2017

c++ x86 sse simd avx2

Is this incorrect code generation with arrays of __m256 values a clang bug?

Mar 12, 2019

c++ clang compiler-optimization avx2

Packing and de-interleaving two __m256 registers

Apr 18, 2022

c++ x86 simd avx avx2

Fallback implementation for conflict detection in AVX2

Apr 20, 2022

c++ x86 intrinsics avx2 avx512

Why both? vperm2f128 (avx) vs vperm2i128 (avx2)

Nov 15, 2022

intel simd avx avx2

Where is VPERMB in AVX2?

Nov 15, 2022

assembly x86 intel sse avx2

Is it possible to use SIMD instructions in Rust?

Feb 07, 2022

rust simd avx avx2

is there an inverse instruction to the movemask instruction in intel avx2?

Dec 05, 2021

x86 intrinsics avx avx2 icc

Fastest Implementation of Exponential Function Using AVX

Sep 14, 2019

x86 simd avx exponential avx2

« Newer Entries Older Entries »