Found 1 story
FP4, a 4-bit float with just 16 representable numbers, is now the default inference format for billion-parameter models on NVIDIA Blackwell hardware.