Meta's REFRAG Unlocks 16× Longer Contexts and Up to 31× Faster RAG Decoding
'Meta Superintelligence Labs released REFRAG, a decoding framework that compresses retrieved passages to enable 16× longer contexts and up to 30.85× faster time-to-first-token while preserving accuracy.'