emova-ollm / EMOVAView external linksLinks
Official PyTorch implementation of EMOVA in CVPR 2025 (https://arxiv.org/abs/2409.18042)
☆76Mar 16, 2025Updated 10 months ago
Alternatives and similar repositories for EMOVA
Users that are interested in EMOVA are comparing it to the libraries listed below
Sorting:
- A project for tri-modal LLM benchmarking and instruction tuning.☆56Mar 27, 2025Updated 10 months ago
- Art2Mus is a system that generates music based on digitized artworks and text by using the AudioLDM2 architecture with an added projectio…☆19Oct 20, 2025Updated 3 months ago
- AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension☆127Dec 9, 2024Updated last year
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆57Jun 7, 2024Updated last year
- BLSP: Bootstrapping Langauge-Speech Pre-training via Behavior Alignment of Continuation Writing☆59Mar 11, 2024Updated last year
- ☆113Sep 18, 2025Updated 4 months ago
- The official code for “Dance-to-Music Generation with Encoder-based Textual Inversion“☆22Jun 17, 2025Updated 7 months ago
- (NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Align…☆124Nov 8, 2025Updated 3 months ago
- a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.☆37Apr 7, 2025Updated 10 months ago
- Actually released!☆10Feb 24, 2021Updated 4 years ago
- llama-omni训练代码复现☆73Jan 23, 2025Updated last year
- Streamlit YOLOv5 deployment template☆27Jul 18, 2025Updated 6 months ago
- Official Implementation of "Prefix tuning for Automated Audio Captioning(ICASSP 2023)"☆31Dec 6, 2023Updated 2 years ago
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated 11 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- cbReader - A simple web-based comic book reader (CBZ/CBR)☆10May 21, 2018Updated 7 years ago
- official code for CVPR'24 paper Diff-BGM☆71Oct 12, 2024Updated last year
- A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.☆132Sep 19, 2025Updated 4 months ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- Desktop client for Walltaker powered by golang☆12Sep 13, 2022Updated 3 years ago
- arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-daily☆14Jan 6, 2025Updated last year
- Official implementation of "MoST: Motion Style Transformer between Diverse Action Contents"☆37Jun 26, 2024Updated last year
- A pytorch implementation of D3Net.☆11Aug 8, 2021Updated 4 years ago
- Cannabis strain information☆10Feb 20, 2016Updated 9 years ago
- ☆14Dec 5, 2025Updated 2 months ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆81Jun 7, 2024Updated last year
- A rewrite of Open Hexagon☆12Updated this week
- A daemon exposing the OpenZWave API via Apache Thrift (and some useful tools)☆27Nov 16, 2018Updated 7 years ago
- A hubot script to perform Service Now API record lookups.☆10Apr 17, 2023Updated 2 years ago
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆43Mar 12, 2023Updated 2 years ago
- MOVED TO☆10Oct 29, 2018Updated 7 years ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 4 months ago
- Vim environment for authoring, compiling, and debugging Inform7 based interactive fiction works.☆11Aug 22, 2020Updated 5 years ago
- Pytorch directly integrated to the cloud all through Bench AI!☆10Dec 10, 2023Updated 2 years ago
- ☆13Aug 28, 2024Updated last year
- Official repo for ICCV 2025 paper "Is Less More? Exploring Token Condensation as Training-free Test-time Adaptation"☆17Sep 3, 2025Updated 5 months ago
- Bad Dragon 3D Model Downloader is a command-line utility that facilitates the downloading of 3D models, along with their respective textu…☆11May 18, 2023Updated 2 years ago
- Player bots for Garry's Mod TTT☆15Aug 25, 2025Updated 5 months ago
- gl hf☆14Oct 18, 2021Updated 4 years ago