Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in avx2

What's the difference between vextracti128 and vextractf128?

Aug 24, 2022

x86 simd avx avx2

Why does storing to and loading from an AVX2 256bit vector have different results in debug and release mode? [duplicate]

Jan 09, 2022

rust compiler-optimization simd avx2

Aligned and unaligned memory access with AVX/AVX2 intrinsics

Dec 14, 2018

gcc avx avx2

What's the fastest stride-3 gather instruction sequence?

Mar 29, 2021

c++ x86 vectorization avx2

How to clear the upper 128 bits of __m256 value?

May 07, 2022

c x86 simd avx avx2

Load address calculation when using AVX2 gather instructions

Feb 15, 2022

x86 sse simd avx2

Can I use the AVX FMA units to do bit-exact 52 bit integer multiplications?

Jun 21, 2022

floating-point x86 simd avx2 fma

Scatter intrinsics in AVX

Feb 20, 2022

intrinsics avx avx2

AVX2: Computing dot product of 512 float arrays

Apr 07, 2022

c++ simd avx2 dot-product fma

Transpose an 8x8 float using AVX/AVX2

Feb 24, 2022

simd avx avx2

How to find the horizontal maximum in a 256-bit AVX vector

Mar 28, 2014

x86 simd avx vector-processing avx2

Haswell memory access

Sep 12, 2022

performance x86 cpu-architecture avx2 intel-pmu

How are the gather instructions in AVX2 implemented?

Dec 26, 2021

intel ram simd avx avx2

In what situation would the AVX2 gather instructions be faster than individually loading the data?

Sep 14, 2021

assembly optimization x86 vectorization avx2

How to tell if a Linux machine supports AVX/AVX2 instructions?

Aug 31, 2022

linux unix avx suse avx2

Why is Intel Haswell XEON CPU sporadically miscomputing FFTs and ART?

Sep 25, 2022

intel cpu-architecture processor avx2

AVX2 what is the most efficient way to pack left based on a mask?

Sep 05, 2022

c++ vectorization sse simd avx2

« Newer Entries