Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in simd

Constexpr and SSE intrinsics

Dec 27, 2021

c++ sse simd constexpr intrinsics

Get index of first element that is not zero in a __m256 variable

Nov 09, 2022

c++ c sse simd avx

What is the diffrence between SPMD and SIMD?

Oct 22, 2022

terminology parallel-processing simd

Does rewriting memcpy/memcmp/... with SIMD instructions make sense?

Sep 21, 2022

performance sse simd

SIMD instructions for floating point equality comparison (with NaN == NaN)

Dec 24, 2019

assembly floating-point x86 x86-64 simd intel

Sum reduction of unsigned bytes without overflow, using SSE2 on Intel

Apr 12, 2018

x86 sse simd sse2 sse3

Fast vectorized rsqrt and reciprocal with SSE/AVX depending on precision

Oct 24, 2021

performance sse simd avx

Using __m256d registers

Oct 18, 2022

c++ x86 intel simd avx

Load address calculation when using AVX2 gather instructions

Feb 15, 2022

x86 sse simd avx2

Branch and predicated instructions

Oct 27, 2022

cuda simd

SIMD the following code

Sep 21, 2022

c x86 sse simd

Why does the FMA _mm256_fmadd_pd() intrinsic have 3 asm mnemonics, "vfmadd132pd", "231" and "213"?

Jan 08, 2022

assembly x86 simd instruction-set fma

Can I use the AVX FMA units to do bit-exact 52 bit integer multiplications?

Jun 21, 2022

floating-point x86 simd avx2 fma

How can I disable vectorization while using GCC?

Sep 20, 2022

gcc vectorization sse simd auto-vectorization

Fastest way to compute distance squared

Oct 16, 2022

c optimization simd

How to transpose a 16x16 matrix using SIMD instructions?

Sep 20, 2022

assembly matrix intel simd avx512

How to quickly count bits into separate bins in a series of ints on Sandy Bridge? [duplicate]

Oct 31, 2022

c++ assembly x86 simd avx

Fast 24-bit array -> 32-bit array conversion?

Mar 04, 2022

c bitmap bit-manipulation sse simd

Count each bit-position separately over many 64-bit bitmasks, with AVX but not AVX2

Oct 31, 2022

c optimization x86 x86-64 simd

GCC C vector extension: How to check if result of ANY element-wise comparison is true, and which?

May 01, 2021

c gcc comparison vectorization simd

« Newer Entries Older Entries »