mtkresearch / generative-fusion-decoding

Generative Fusion Decoding (GFD) is a novel framework for integrating Large Language Models (LLMs) into multi-modal text recognition systems like ASR and OCR, improving performance and efficiency by enabling seamless fusion without requiring re-training.
67Updated 2 months ago

Related projects

Alternatives and complementary repositories for generative-fusion-decoding