Gaiejj/align-anything

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Gaiejj/align-anything)

Gaiejj / align-anything

☆16

Alternatives and similar repositories for align-anything

Users that are interested in align-anything are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

swarupbehera / awesome-audio-visual-question-answering
View on GitHub
A curated list of resources in audio visual question answering and related area. :-)
☆17Jun 29, 2025Updated last year
KomeijiForce / Active_Passive_Constraint_Koishiday_2024
View on GitHub
Koishi's Day 2024 Paper (NeurIPS 2024): An advanced persona-driven role-playing system with global faithfulness quantification and optimi…
☆13Oct 19, 2025Updated 9 months ago
YYX666660 / LAVSS
View on GitHub
Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation
☆19Feb 25, 2025Updated last year
ictnlp / FastLongSpeech
View on GitHub
FastLongSpeech is a novel framework designed to extend the capabilities of Large Speech-Language Models for efficient long-speech process…
☆16Jul 22, 2025Updated last year
HumanMLLM / Omni-Emotion
View on GitHub
☆22Jan 17, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
vua / LabelTool
View on GitHub
A label tool of Target Detection (Label Template can be customized)
☆10Mar 12, 2021Updated 5 years ago
krylm / whisper-event-tuning
View on GitHub
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Dec 24, 2022Updated 3 years ago
kyegomez / FastFF
View on GitHub
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆16Nov 11, 2024Updated last year
193746 / VHASR
View on GitHub
☆11Oct 31, 2024Updated last year
rawbeen248 / audio_classification_finetuning
View on GitHub
This project focuses on the classification of animal sounds using deep learning. The core idea is to utilize audio processing techniques …
☆10Dec 3, 2024Updated last year
AQ-MedAI / LiveClin
View on GitHub
LiveClin is a live benchmark designed for the faithful replication of clinical practice
☆16Feb 27, 2026Updated 4 months ago
Yaoyaolingbro / notebook
View on GitHub
☆20Mar 4, 2025Updated last year
YapengTian / CCOL-CVPR21
View on GitHub
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation
☆26Nov 24, 2021Updated 4 years ago
schowdhury671 / meerkat
View on GitHub
☆35Jul 9, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
mathllm / VoiceAssistant-Eval
View on GitHub
A rigorous framework for evaluating and guiding the development of next-generation AI assistants.
☆19Jan 26, 2026Updated 5 months ago
FreedomIntelligence / MTalk-Bench
View on GitHub
MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols
☆20Nov 19, 2025Updated 8 months ago
pmachapman / GoTo.Bible
View on GitHub
View and compare Bible Translations in an innovative interlinear format. Run on Windows or Web.
☆15Updated this week
tarun-bisht / wav2vec2-asr
View on GitHub
wav2vec2 asr with transformers
☆16Oct 26, 2021Updated 4 years ago
fun-audio-llm / fun-audio-llm.github.io
View on GitHub
FunAudioLLM homepage
☆17Dec 11, 2024Updated last year
liumy2010 / UFT
View on GitHub
UFT: Unifying Supervised and Reinforcement Fine-Tuning
☆31Jun 30, 2025Updated last year
liaolea / EmoVerse
View on GitHub
[Neurocomputing] EmoVerse: Enhancing Multimodal Large Language Models for Affective Computing via Multitask Learning
☆19Jul 6, 2025Updated last year
chutaklee / CantoASR
View on GitHub
Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)
☆16May 8, 2022Updated 4 years ago
xinghaixu / mwget
View on GitHub
多线程的下载，速度是wget的10倍，解决在shell下运行没有问题，但是在crontab后台运行会报Segmentation fault的错误
☆12Apr 19, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zhoucz97 / ECPE-MM-R
View on GitHub
[COLING2022] A Multi-turn Machine Reading Comprehension Framework with Rethink Mechanism for Emotion-Cause Pair Extraction
☆18Oct 13, 2022Updated 3 years ago
soumik12345 / clip-lightning
View on GitHub
☆19Aug 3, 2022Updated 3 years ago
ml-inory / melotts.axera
View on GitHub
MeloTTS demo on Axera
☆14Jul 1, 2026Updated 3 weeks ago
Deep-unlearning / Llasa-GRPO
View on GitHub
☆18Nov 19, 2025Updated 8 months ago
X-Gen-Lab / knowledge-os
View on GitHub
一个集成化的个人知识管理平台
☆15Apr 6, 2026Updated 3 months ago
AIBigTruth / 0-9-speech-recognition-system-based-on-GMM
View on GitHub
基于GMM的0-9孤立词语音识别系统
☆10Sep 29, 2020Updated 5 years ago
TIGER-AI-Lab / VideoEval-Pro
View on GitHub
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]
☆15Jun 1, 2026Updated last month
MateoCamara / sota.ai
View on GitHub
Create a State of the art excel around any topic, assisted by an LLM!
☆18May 25, 2026Updated last month
weAreMusicAI / dmx-diffusion
View on GitHub
☆15Oct 13, 2025Updated 9 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
SenseTime-FVG / InteractiveOmni
View on GitHub
☆24Dec 3, 2025Updated 7 months ago
ASLP-lab / Smart-Glass-Challenge
View on GitHub
☆17Jun 16, 2026Updated last month
du-ud / kaldi-cslt
View on GitHub
☆15Aug 30, 2022Updated 3 years ago
FreedomIntelligence / MyPhoneBench
View on GitHub
MyPhoneBench: Do Phone-Use Agents Respect Your Privacy?
☆24Apr 3, 2026Updated 3 months ago
percent4 / yi_vl_experiment
View on GitHub
本项目是关于Yi的多模态系列模型，如Yi-VL-6B/34B等的实验与应用。
☆14Jan 25, 2024Updated 2 years ago
LeapLabTHU / AdaGen
View on GitHub
Official code for "AdaGen: Learning Adaptive Policy for Image Synthesis"
☆15Mar 18, 2026Updated 4 months ago
streichgeorg / autosing
View on GitHub
☆18Jan 20, 2025Updated last year