NiniAndy / HpyformerLinks
Hpyformer base FunASR
☆30Updated last year
Alternatives and similar repositories for Hpyformer
Users that are interested in Hpyformer are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of VersBand(EMNLP 2025): Versatile Framework for Song Generation with Prompt-based Control☆223Updated 4 months ago
- ☆158Updated 3 months ago
- Code for paper "Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models"☆241Updated last year
- Code for "Filling MIDI Velocity using U-Net Image Colorizer" (CMMR2025) PyTorch implementation for filling MIDI velocities from given MID…☆39Updated last month
- [DCASE 2023] Official Implementation for "Low-Complexity Acoustic Scene Classification Using Deep Space Separable Distillation And Mutil-…☆25Updated last year
- LLaQo, a Large Language Query-based Coach in the domain of expressive performance☆111Updated last month
- MTLA: Multi-head Temporal Latent Attention☆761Updated 3 months ago
- DExter: Learning and Controlling Performance Expression through Diffusion models☆114Updated last year
- From Audio Encoders to Piano Judges: Benchmarking Performance Understanding for Solo Piano☆76Updated last year
- A large chinese freelanguage chain tools,you can get free API from:open.bigmodel.cn☆80Updated last year
- UniVoice: Unifying Autoregressive ASR and Flow-Matching based TTS with Large Language Models☆110Updated 2 months ago
- A toolkit enhances PyTorch with specialized functions for low-bit quantized neural networks.☆196Updated last year
- ☆164Updated last year
- A reading list for trustworthy audio large language models.☆112Updated this week
- Fast and free zeroshot lipsync MCP server☆90Updated 7 months ago
- Symbolic Representation☆83Updated 9 months ago
- Fat-Cat: A document-centric context management Agent. Making context as simple as reading chat history.☆281Updated last week
- [AAAI 2026] Playmate2: Training-Free Multi-Character Audio-Driven Animation via Diffusion Transformer with Reward Feedback☆292Updated last month
- AI Database for unified, scalable SQL + vector management, search and analytics☆205Updated last week
- A real-time interactive Omni Avatar built on LiveKit, which allows you to seamlessly integrate with any open source Avatar components (re…☆557Updated this week
- [NeurIPS2025 spotlight★] Official implementation for "RepLDM: Reprogramming Pretrained Latent Diffusion Models for High-Quality, High-Eff…☆192Updated last week
- A transparent, minimal, and hackable agent framework. ~300 lines of readable code. Full control, no magic.☆434Updated 2 weeks ago
- ☆241Updated last month
- Tokenize The Virtual Agents Onchain☆243Updated 7 months ago
- LLM Rag Intelligent Q&A Robot☆84Updated 4 months ago
- We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for comple…☆1,103Updated last month
- ☆50Updated 9 months ago
- RKAN: Residual Kolmogorov-Arnold Network is designed to enhance the performance of deep learning models.☆274Updated 2 months ago
- This is the code for Visual Reasoning Sequential Attack, which is a method to jailbreak Multimodal Large Language Models Based on their v…☆64Updated last month
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆312Updated last month