mtkresearch / generative-fusion-decoding

Generative Fusion Decoding (GFD) is a novel framework for integrating Large Language Models (LLMs) into multi-modal text recognition systems like ASR and OCR, improving performance and efficiency by enabling seamless fusion without requiring re-training.
76Updated 5 months ago

Alternatives and similar repositories for generative-fusion-decoding:

Users that are interested in generative-fusion-decoding are comparing it to the libraries listed below