Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in simd

SIMD code runs slower than scalar code

Apr 28, 2022

c optimization sse simd sse2

Free/open source C/C++ library of vectorized math functions? [closed]

Mar 06, 2022

c++ c simd numerical

Usage of _mm_shuffle_epi8 intrinsic

Jun 25, 2022

performance optimization x86 sse simd

Which is the reason for avx floating point bitwise logical operations?

Nov 16, 2022

c++ simd avx avx2

Computing the inner product of vectors with allowed scalar values 0, 1 and 2 using AVX intrinsics

Jun 14, 2022

c++ simd avx

Fastest 64-bit population count (Hamming weight)

Dec 25, 2018

performance optimization assembly simd avx

SIMD vector memory load in LLVM

Feb 20, 2021

c++ llvm simd llvm-ir avx

How can I get the compiler to output faster code for a string search loop, using SIMD vectorization and/or parallelization?

Sep 05, 2022

c assembly vectorization compiler-optimization simd

How can I exchange the middle two 64 bits in a 256 bit AVX(YMM) register

Dec 19, 2017

x86 simd avx

How to do _mm256_maskstore_epi8() in C/C++?

Oct 23, 2021

c++ simd intrinsics avx avx2

no speedup using openmp + SIMD

Mar 30, 2019

c++ multithreading performance openmp simd

Loop versioning with GCC

Apr 27, 2022

gcc alignment simd vectorization

Does stb_image simd support exist?

Nov 04, 2021

c++ c jpeg simd

Comparing two vector<bool> with SSE

Mar 21, 2022

c++ x86 sse simd

Fast SSE low precision exponential using double precision operations

Apr 18, 2019

c++ precision sse simd exponential

256-bit vectorization via OpenMP SIMD prevents compiler's optimization (say function inlining)?

Apr 22, 2022

c gcc openmp simd auto-vectorization

Is it possible to combine Rayon and Faster?

Oct 22, 2022

parallel-processing rust simd rayon

« Newer Entries Older Entries »