Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in avx

Automatically generate FMA instructions in MSVC

Aug 20, 2022

c++ visual-c++ x86 avx fma

Computing 8 horizontal sums of eight AVX single-precision floating-point vectors

Aug 03, 2021

optimization intrinsics avx low-level

Efficiently gather individual bytes, separated by a byte-stride of 4

Feb 27, 2022

c intrinsics avx

Need for fast data demuxing in C# by using multi-threading, AVX, GPU or whatever

Apr 20, 2022

c# multithreading algorithm performance avx

Preventing GCC from automatically using AVX and FMA instructions when compiled with -mavx and -mfma

Jun 21, 2020

c++ gcc vectorization avx fma

Large (0,1) matrix multiplication using bitwise AND and popcount instead of actual int or float multiplies?

Mar 28, 2021

c++ sse matrix-multiplication avx bitset

How to align stack at 32 byte boundary in GCC?

Oct 23, 2022

gcc stack sse avx

How to force gcc to use all SSE (or AVX) registers?

Nov 10, 2022

gcc 64-bit sse register-allocation avx

Horizontal XOR in AVX

Apr 25, 2022

c++ assembly x86 simd avx

Do 128bit cross lane operations in AVX512 give better performance?

Mar 23, 2019

performance x86 intel avx avx512

Parallel programming using Haswell architecture [closed]

Apr 12, 2015

sse cpu-architecture avx avx2

Does vzeroall zero registers ymm16 to ymm31?

Nov 20, 2022

assembly x86 intel avx avx512

Is L2 HW prefetcher really helpful?

Apr 25, 2022

c performance assembly x86-64 avx

AVX log intrinsics (_mm256_log_ps) missing in g++-4.8?

Nov 10, 2022

c++ g++ intrinsics avx

How to efficiently combine comparisons in SSE?

May 06, 2021

c optimization assembly sse avx

Fastest way to unpack 32 bits to a 32 byte SIMD vector

Jan 01, 2017

x86 simd avx bitmask avx2

Do all CPUs which support AVX2 also support SSE4.2 and AVX?

Nov 13, 2022

sse simd avx avx2

SSE runs slow after using AVX

Apr 26, 2021

c++ gcc x86 avx sse2

Does Clang have something like #pragma GCC target?

Sep 27, 2022

clang intrinsics avx pragma

« Newer Entries Older Entries »