Generative Fusion Decoding (GFD) is a novel framework for integrating Large Language Models (LLMs) into multi-modal text recognition systems like ASR and OCR, improving performance and efficiency by enabling seamless fusion without requiring re-training.
☆87Jul 31, 2025Updated 7 months ago
Alternatives and similar repositories for generative-fusion-decoding
Users that are interested in generative-fusion-decoding are comparing it to the libraries listed below
Sorting:
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆19Nov 29, 2022Updated 3 years ago
- PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.☆25Oct 3, 2022Updated 3 years ago
- ⚙️Tool for NLP - handle file and text☆15Feb 16, 2025Updated last year
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/ab…☆36Feb 10, 2024Updated 2 years ago
- Super Flappy Bird in p5.js☆10Mar 8, 2021Updated 4 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- The Android application providing user with REST-based interface for utilizing built-in Android's TTS engine. The web service is highly c…☆11Jul 28, 2020Updated 5 years ago
- ☆13Sep 25, 2024Updated last year
- [Kaggle-2nd] Lightweight yet Effective Chinese LLM.☆53Jun 14, 2025Updated 8 months ago
- LLM CLI Interface - Extremely Convenient and Fast☆12Sep 22, 2025Updated 5 months ago
- Technical Analysis on Cryptocurrency☆25Oct 14, 2025Updated 4 months ago
- ☆234Aug 25, 2025Updated 6 months ago
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 5 months ago
- ☆11Jun 28, 2024Updated last year
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- win32 native frontend for llama-cli☆12Nov 2, 2024Updated last year
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Jun 7, 2024Updated last year
- A simple library for working with Hugging Face models.☆14Dec 30, 2024Updated last year
- Accompanying code for paper "Attention-Based Contextual Language Model Adaptation for Speech Recognition", submitted to ACL 2021.☆14Jul 25, 2023Updated 2 years ago
- Unsupervised spoken sentence embeddings☆14Dec 14, 2022Updated 3 years ago
- ☆16Feb 10, 2026Updated 3 weeks ago
- 偵測使用者的平台、系統以及瀏覽器☆13Feb 21, 2020Updated 6 years ago
- TaCo: Enhancing Cross-Lingual Transfer for Low-Resource Languages in LLMs through Translation-Assisted Chain-of-Thought Processes☆13Jul 1, 2025Updated 8 months ago
- Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding☆11May 19, 2023Updated 2 years ago
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year
- ☆31Jul 13, 2023Updated 2 years ago
- Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless…☆35Aug 10, 2023Updated 2 years ago
- 一小時 No-Code 自製客服機器人 GPT☆17May 28, 2024Updated last year
- ☆17May 5, 2024Updated last year
- ☆15Sep 9, 2021Updated 4 years ago
- sharing and learning python skills☆15Jun 19, 2023Updated 2 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆18Jul 16, 2024Updated last year
- 3000.gov.tw☆16Jun 17, 2020Updated 5 years ago
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆20Jan 3, 2023Updated 3 years ago
- 應用Google Colaboratory免費GPU資源來完成深度學習卷積神經網路執行影像二元分類☆15Jun 17, 2018Updated 7 years ago
- pure go for rwkv☆19Dec 31, 2023Updated 2 years ago