Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in avx

Using ymm registers as a "memory-like" storage location

Dec 07, 2020

assembly x86 sse avx

Matrix-vector-multiplication in AVX not proportionately faster than in SSE

Dec 07, 2021

c++ vectorization sse matrix-multiplication avx

How to concatenate two vector efficiently using AVX2? (a lane-crossing version of VPALIGNR)

Mar 08, 2022

c simd intrinsics avx avx2

AVX 256-bit equivalent for _mm_load1_ps

Mar 14, 2018

simd intrinsics avx

Which assemblers currently support the AVX instruction set?

Aug 12, 2022

x86 assembly simd avx intel

difference between Intel E7 and E5 Xeon models? [closed]

Dec 03, 2021

cpu intel avx

Shifting SSE/AVX registers 32 bits left and right while shifting in zeros

Nov 27, 2018

x86 sse simd avx avx2

Efficient way of rotating a byte inside an AVX register

Mar 06, 2022

c sse simd avx avx2

Count leading zero bits for each element in AVX2 vector, emulate _mm256_lzcnt_epi32

Mar 17, 2022

bit-manipulation simd avx avx2 avx512

Have different optimizations (plain, SSE, AVX) in the same executable with C/C++

Mar 21, 2022

c++ c compiler-construction sse avx

Sorting 64-bit structs using AVX?

Sep 14, 2022

c++ intrinsics avx

How to square two complex doubles with 256-bit AVX vectors?

Oct 14, 2022

c simd complex-numbers intrinsics avx

Is _mm_broadcast_ss faster than _mm_set1_ps?

Feb 04, 2022

vectorization avx

Avoiding AVX-SSE (VEX) Transition Penalties

Sep 26, 2022

assembly x86 sse avx micro-optimization

Why is tan slower in context than when isolated?

Sep 28, 2022

c performance x86 clang avx

Select unique/deduplication in SSE/AVX

Mar 30, 2021

algorithm assembly sse simd avx

(Vec4 x Mat4x4) product using SIMD and improvements

Jan 12, 2022

c++ matrix simd avx sse3

Why dont use the AVX Registers as a ultra fast cache?

Sep 13, 2022

performance assembly sse cpu-registers avx

« Newer Entries Older Entries »