
Modern AI systems demand low-latency high-quality retrieval and serving over billion-scale keys and vectors. This proposal studies learned hashing and overlay networks to co-locate semantically related items and steer queries with minimal coordination. We first present LEAD, to our knowledge the first use of order-preserving learned hash functions in distributed key-value overlays, enabling efficient range […]