Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in x86

How to write to screen with video memory address 0xb8000 from real mode?

Can x86 reorder a narrow store with a wider load that fully contains it?

Debugging disassembled libraries with gdb

assembly x86 gdb disassembly

fastest way to write a bitstream on modern x86 hardware

What is a Partial Flag Stall?

When using a mask register with AVX-512 load and stores, is a fault raised for invalid accesses to masked out elements?

x86 avx avx512

x86 Program Counter abstracted from microarchitecture?

Is it possible to vectorize myNum += a[b[i]] * c[i]; on x86_64?

About setjmp/longjmp

c linux x86 setjmp

Porting compiler from x86 Assembly to LLVM

What's the difference between __popcnt() and _mm_popcnt_u32()?

x86 sse intrinsics sse4

What's inside the stack?

c x86 stack low-level

What is the overhead of using Intel Last Branch Record?

Do the x86 virtualization instruction sets (VT-x, AMD-V) have alternate uses?

virtualization x86

Is vxorps-zeroing on AMD Jaguar/Bulldozer/Zen faster with xmm registers than ymm?

what's the difference between _mm256_lddqu_si256 and _mm256_loadu_si256

Two very similar functions involving sin() exhibit vastly different performance -- why?

L1 memory bandwidth: 50% drop in efficiency using addresses which differ by 4096+64 bytes

c caching memory x86 avx

is there an inverse instruction to the movemask instruction in intel avx2?

x86 intrinsics avx avx2 icc

Is integer overflow undefined in inline x86 assembly?

c gcc x86 undefined-behavior