Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in micro-optimization

x > -1 vs x >= 0, is there a performance difference

Why does mulss take only 3 cycles on Haswell, different from Agner's instruction tables? (Unrolling FP loops with multiple accumulators)

Avoiding the overhead of C# virtual calls

fastest way to negate a number

Passing null pointer to placement new

Does calculating Sqrt(x) as x * InvSqrt(x) make any sense in the Doom 3 BFG code?

How exactly do partial registers on Haswell/Skylake perform? Writing AL seems to have a false dependency on RAX, and AH is inconsistent

Why does Intel's compiler prefer NEG+ADD over SUB?

Comparing two values in the form (a + sqrt(b)) as fast as possible?

INC instruction vs ADD 1: Does it matter?

Do java finals help the compiler create more efficient bytecode? [duplicate]

What does `rep ret` mean?

' ... != null' or 'null != ....' best performance?

"enter" vs "push ebp; mov ebp, esp; sub esp, imm" and "leave" vs "mov esp, ebp; pop ebp"

Divide by 10 using bit shifts?

Does using xor reg, reg give advantage over mov reg, 0? [duplicate]

Weird use of `?:` in `typeid` code

How to force GCC to assume that a floating-point expression is non-negative?

Fast method to copy memory with translation - ARGB to BGR

Is reading the `length` property of an array really that expensive an operation in JavaScript?