A Semantically Consistent Dataset for Data-Efficient Query-Based Universal Sound Separation
☆227Mar 9, 2026Updated last week
Alternatives and similar repositories for Hive
Users that are interested in Hive are comparing it to the libraries listed below
Sorting:
- Official Repository for "Music Source Restoration"☆32Jun 1, 2025Updated 9 months ago
- Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation☆138Mar 8, 2026Updated last week
- Escape room game with quiz-solving and smart AI navigation. Unity + NavMesh + C#.☆41Jun 18, 2025Updated 9 months ago
- ☆98Mar 8, 2025Updated last year
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 9 months ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆18Nov 19, 2025Updated 3 months ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆76Jan 25, 2026Updated last month
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- ☆17Jun 24, 2025Updated 8 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- Align Anything: Training All-modality Model with Feedback☆4,634Nov 27, 2025Updated 3 months ago
- Official Repository of Paper: "Emilia-NV: A Non-Verbal Speech Dataset with Word-Level Annotation for Human-Like Speech Modeling"☆86Sep 18, 2025Updated 6 months ago
- Demo for testing dynamically load the libos module.☆10Nov 8, 2023Updated 2 years ago
- ☆15Nov 11, 2024Updated last year
- This is a complete online exam system☆10Dec 27, 2019Updated 6 years ago
- ☆11Dec 17, 2025Updated 3 months ago
- Arxiv automatically obtains the latest article service.☆11Apr 29, 2020Updated 5 years ago
- The power-law compressed phase-aware asymmetric (PLCPA-ASYM) loss☆14Sep 4, 2023Updated 2 years ago
- Audio-FLAN☆159Sep 23, 2025Updated 5 months ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆61Jul 1, 2025Updated 8 months ago
- 数字底座是一款面向大型政府、企业数字化转型,基于身份认证、组织架构、岗位职务、应用系统、资源角色、数据目录、安全控制等功能构建的统一且安全的管理支撑平台。数字底座基于 三员管理模式,具备微服务、多租户、容器化和国产化,支持用户利用代码生成器快速构建自己的业务应用,同时可关联诸…☆2,578Updated this week
- We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction☆188Mar 9, 2026Updated last week
- An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.☆227Feb 26, 2026Updated 2 weeks ago
- Spherical residual vector quantization (SRVQ)☆31Aug 25, 2024Updated last year
- Pytorch implementation of subband decomposition☆92Jul 26, 2022Updated 3 years ago
- A simple implementation for improving CosyVoice2 by GRPO method☆34Oct 17, 2025Updated 5 months ago
- ☆16Feb 19, 2026Updated 3 weeks ago
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated 11 months ago
- For audio visualization and playback in Jupyter notebooks.☆17Nov 25, 2025Updated 3 months ago
- Use quantized versions of Whisper to speed up inference☆12Oct 16, 2024Updated last year
- ☆12Mar 11, 2025Updated last year
- Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering☆42Nov 8, 2025Updated 4 months ago
- 💰唯一正版💰 minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy 矿池抽水 矿池代理 矿池中转 矿池抽…☆3,881Mar 4, 2026Updated 2 weeks ago
- GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling☆168Feb 28, 2025Updated last year
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆45Sep 5, 2025Updated 6 months ago
- ☆36Sep 6, 2025Updated 6 months ago
- Interspeech Tutorial - Resource Efficient and Cross-Modal Learning Toward Foundation Modeling☆15Oct 9, 2023Updated 2 years ago
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated last year