thunlp / Ouroboros

Ouroboros: Speculative Decoding with Large Model Enhanced Drafting
60Updated 6 months ago

Related projects: