Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in micro-optimization

Fast search of some nibbles in two ints at same offset (C, microoptimisation)

Why is `-1 * x` faster than `-x` and why?

ruby micro-optimization

In x86-64 asm: is there a way of optimising two adjacent 32-bit stores / writes to memory if the source operands are two immediate values?

Branch on ?: operator?

Which integer operations have higher performance alternate methods in Rust?

C++ Adding 2 arrays together quickly

Fast Euclidean division in C

Avoiding AVX-SSE (VEX) Transition Penalties

Why is using structure Vector3I instead of three ints much slower in C#?

For loop performance: counters with same value vs. different values

Are there any performance test results for usage of likely/unlikely hints?

Using bools in calculations to avoid branches

Two's complement of long integer

Why are these 8 byte-writes not optimized into a MOV?

How to force NASM to encode [1 + rax*2] as disp32 + index*2 instead of disp8 + base + index?

Most efficient popcount on `__uint128_t`?

What's the easiest way to determine if a register's value is equal to zero or not?

Difference between "or eax,eax" and "test eax,eax" [duplicate]

Which Intel microarchitecture introduced the ADC reg,0 single-uop special case?