Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in avx2

AVX 512 vs AVX2 performance for simple array processing loops [closed]

Dec 04, 2022

performance x86 micro-optimization avx2 avx512

Update Tensorflow binary in virtual environment in PyCharm to use AVX2

Nov 07, 2022

python tensorflow pycharm avx2

Unpacking 8 to 16-bit using SIMD: AVX2 version mixes up the order

Nov 08, 2022

c++ simd sse avx2

optimize unaligned SSE2/AVX2 XOR

Nov 05, 2022

c optimization memory-alignment sse2 avx2

Is it possible to popcount __m256i and store result in 8 32-bit words instead of the 4 64-bit using Wojciech Mula algorithm's?

Oct 18, 2022

c++ intel sse avx avx2

Converting to and from __m256i and std::vector<uint32_t>

Oct 17, 2022

c++ intel simd intrinsics avx2

Is there any data on the latency of an AVX2 gather instruction?

Oct 05, 2022

performance x86 latency micro-optimization avx2

Extract bits with SIMD

Dec 24, 2020

intel x86 bit-manipulation simd intrinsics avx2

What is packed and unpacked and extended packed data

Feb 10, 2022

cpu-architecture sse simd avx avx2

AVX2 code slower then without AVX2

Nov 19, 2021

intel c++ performance x86 avx2

Error: suffix or operands invalid for `vbroadcastss'

Dec 11, 2019

python compiler-errors avx avx2

How can I convert a vector of float to short int using avx instructions?

Oct 18, 2022

c++ c gcc avx avx2

Using values from `__m256i` to access an array efficiently - SIMD [closed]

May 26, 2022

c++ arrays simd avx2

What is the inverse of "_mm256_cvtepi16_epi32"

Feb 14, 2022

x86 g++ intrinsics avx avx2

Why does Tensorflow warn about AVX2 while I am using MKL?

Jun 06, 2022

tensorflow keras anaconda intel-mkl avx2

Optimize extraction of 64 bit value from AVX2 register

Oct 31, 2015

c sse avx avx2

Emulating shifts on 32 bytes with AVX

Sep 24, 2020

c++ simd intrinsics sse2 avx2

Fastest way to multiply an array of int64_t?

Nov 27, 2016

c vectorization multiplication avx avx2

AVX 256-bit code performing slightly worse than equivalent 128-bit SSSE3 code

Jun 06, 2022

c++ performance sse avx2

« Newer Entries Older Entries »