Speculative decoding
Red Hat AI Inference 3.4
Speculative decoding with Red Hat AI Inference
Abstract
Learn about speculative decoding and how to deploy and train custom speculator models using the Speculators library with Red Hat AI Inference.
Abstract
We deliver hardened solutions that make it easier for enterprises to work across platforms and environments, from the core datacenter to the network edge.
Red Hat is committed to replacing problematic language in our code, documentation, and web properties. For more details, see the Red Hat Blog.