Oh, S. (CSE) – Efficient Instruction Supply for Datacenter Processors
Modern datacenter CPUs lose 25–66% of execution cycles to instruction-delivery stalls. This bottleneck persists, despite the recent trend towards accelerators and GPUs, as there is continuing demand by applications that only execute on CPUs. Two workload classes dominate today’s datacenter execution cycles: hyperscale server software (databases, build systems, and content stores), whose large instruction footprints […]