Archive
Writing
Everything I've published — on gaming, displays, AI, hardware, programming, and agents. The hype, examined. Filter by tag, or just scroll.
2 pieces tagged #unified-memory
2026
2
Stop Guessing GGUF Quants: A VRAM-to-Precision Lookup Table for Local LLMs
Consumer GPUs are bandwidth-bound, not precision-bound. Here’s the exact VRAM-to-quant lookup table that maximizes tokens/sec without crossing the perceptible quality threshold.
RTX Spark: 128GB Unified Memory Won't Fix the Bandwidth Bottleneck
NVIDIA's RTX Spark packs 128GB of unified memory, but ~300 GB/s bandwidth caps inference throughput—here's the math on what you can actually run locally versus the cloud.