GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling
☆168Feb 28, 2025Updated last year
Alternatives and similar repositories for gen-se
Users that are interested in gen-se are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆98Mar 8, 2025Updated last year
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆57Apr 14, 2025Updated 11 months ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆101Apr 1, 2025Updated 11 months ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆83May 21, 2025Updated 10 months ago
- Tianjin University "Design and Construction I" Course Project☆75Dec 24, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A code repository designed to show the best GitHub has to offer.☆165Jun 30, 2024Updated last year
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆54Mar 5, 2025Updated last year
- ☆297Sep 14, 2025Updated 6 months ago
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated 9 months ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆38Aug 7, 2024Updated last year
- ☆17Jun 3, 2020Updated 5 years ago
- SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios☆268Jan 22, 2025Updated last year
- ☆176Feb 21, 2025Updated last year
- Build a simple yet effective CNN to work as a sketch recognizer. Just like Google Quick-Draw Project.☆143Mar 23, 2023Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆256Dec 12, 2025Updated 3 months ago
- ☆23Jul 16, 2025Updated 8 months ago
- A new AI Game Paradigm in Autonomous world. it includes configurations for agents, functional buildings, and equipment, as well as the lo…☆89Jan 12, 2025Updated last year
- C++ codes for FDTD Maxwell's equation.☆161Jun 11, 2023Updated 2 years ago
- ☆279Apr 29, 2025Updated 11 months ago
- ☆248Apr 10, 2025Updated 11 months ago
- Harnessing the Power of AI to Navigate the Information Age – Uncovering Truth, Promoting Transparency, and Championing Fact-Based Discour…☆147Jun 2, 2023Updated 2 years ago
- ☆371Sep 6, 2025Updated 6 months ago
- The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models☆717Updated this week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ECCV 2024] Tuning-Free Image Customization with Image and Text Guidance☆148Feb 1, 2025Updated last year
- ☆247Nov 24, 2024Updated last year
- Visualization, simulation, manipulation of Intrinsically disorder proteins with Gibbs sampling☆288Oct 24, 2024Updated last year
- 即迅语音识别服务,支持语音识别(ASR)、语音合成(TTS)、声纹识别(VPR)等功能,适配国产化arm操作系统,支持CPU快速语音识别☆74Jul 15, 2024Updated last year
- Inscriptions on CoreDao, powered by Insdexer.☆148Mar 20, 2024Updated 2 years ago
- Improvements to animations based on Manim, designed to facilitate the demonstration of algorithms in data structures, operating systems, …☆207Dec 15, 2025Updated 3 months ago
- Official baseline, dataset and evaluation scripts for the ICASSP 2026 URGENT challenge.☆33Nov 12, 2025Updated 4 months ago
- ☆249Jul 19, 2023Updated 2 years ago
- ☆237Apr 26, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆135Sep 24, 2024Updated last year
- We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for comple…☆1,108Nov 26, 2025Updated 4 months ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆46Mar 10, 2025Updated last year
- ☆75Feb 17, 2025Updated last year
- 网络硬盘是通过存储、分类、检索、分享、协作、下发、回收、展示等方式管理文档、文件、图片、音频、视频等资料的工具。网络硬盘擅长在国产的私有化环境中管控文档权限、存储空间分配、安全加密、链接分享,同时支持一定轻量级的文件任务收发。网络硬盘需要依赖开源的数字底座进行人员岗位管控。☆353Mar 19, 2026Updated last week
- It is an Android-based application that enables managing hotspot properties through a web interface, providing mobile routing functionali…☆155Dec 19, 2024Updated last year
- [ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling☆1,280Mar 2, 2025Updated last year