Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models" [IEEE ICASSP 2026].
☆32Mar 10, 2026Updated 2 weeks ago
Alternatives and similar repositories for Omni-AVSR
Users that are interested in Omni-AVSR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Pytorch implementation of "Large Language Models are Strong Audio-Visual Speech Recognition Learners" [ICASSP 2025] and "Mitigat…☆60Jan 18, 2026Updated 2 months ago
- ☆13Oct 25, 2024Updated last year
- ☆63Jul 1, 2025Updated 8 months ago
- A simple gitlab/github web hooks daemon☆16Feb 6, 2026Updated last month
- APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding☆14Jul 22, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Tistory Readme Stat Card☆11Mar 27, 2024Updated 2 years ago
- ☆12Dec 16, 2025Updated 3 months ago
- Code for Learning to Learn Language from Narrated Video☆33Oct 3, 2023Updated 2 years ago
- ☆10Mar 3, 2026Updated 3 weeks ago
- PyTorch unoffical implementation of "PoE-GAN : Multimodal Conditional Image Synthesis with Product-of-Experts GANs"☆14Mar 29, 2023Updated 3 years ago
- Mad Square's Brawl is the 2D Android Platformer PVP game.☆17Feb 15, 2023Updated 3 years ago
- ☆99Feb 4, 2026Updated last month
- [CVPR2025] Official code for Lost in Translation Found in Context☆23Jan 14, 2026Updated 2 months ago
- DO with Terraform and Ansible☆11Jun 5, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆11Nov 17, 2018Updated 7 years ago
- ☆10Oct 24, 2024Updated last year
- A Wordle game written in Rust, refined. Play in browser with the power of WebAssembly! Course project of Programming Training, Tsinghua U…☆17Jul 10, 2024Updated last year
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆20Apr 2, 2024Updated last year
- Support library for the MaskRCNN masks extracted on EPIC-KITCHENS-100☆14Dec 1, 2020Updated 5 years ago
- A minimal java desktop app with awesome UI based on Swing to drag and drop files programmatically.☆24Jan 19, 2018Updated 8 years ago
- Official code for the paper "Understanding Co-speech Gestures in-the-wild"☆21Oct 31, 2025Updated 4 months ago
- A framework for building speech-enabled websites.☆10Jul 10, 2015Updated 10 years ago