Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in micro-optimization

Micro Optimization of a 4-bucket histogram of a large array or list

Fast BCD addition

What is the optimal way for reading the contents of a webpage into a string in Java?

Efficient extraction of first/only key in a dictionary

Is calling `add` on a memory location faster than calling it on a register and then moving the value?

Cost of a 64bits jump, always 10-22 cycles the first time?

Rotating (by 90°) a bit matrix (up to 8x8 bits) within a 64-bit integer

Saturate short (int16) in C++

What are the pros and cons of int, unsigned int, uint_fastN_t, and int_fastN_t?

Efficient AVX2 implementation of a 17x17-bit squaring operation with result truncation

Adding a byte from memory to the AX register

How much faster are SSE4.2 string instructions than SSE2 for memcmp?

How to get gcc to generate decent code that checks if a buffer is full of NUL bytes?

c gcc x86 micro-optimization

Efficient division operator for 3D vectors

Can accessing uninitialized values result in a performance hit?

Is there any point in creating a second column optimized for FULLTEXT searches?

Does the order of class members affect access speed?

Is CMOVcc considered a branching instruction?