推文工具: 图片音频批量合成视频
☆18May 23, 2024Updated last year
Alternatives and similar repositories for auto-video
Users that are interested in auto-video are comparing it to the libraries listed below
Sorting:
- An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control☆31Jan 13, 2026Updated last month
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- ☆12Jun 17, 2019Updated 6 years ago
- ☆15Sep 16, 2024Updated last year
- A news based stock scalper using LLM and quant approach☆15Jan 16, 2025Updated last year
- Generator for anechoic, non-stationary noise signals☆11Aug 12, 2022Updated 3 years ago
- ☆14Apr 18, 2023Updated 2 years ago
- ☆11Sep 12, 2023Updated 2 years ago
- Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement☆10Jan 24, 2022Updated 4 years ago
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆23Feb 11, 2026Updated 3 weeks ago
- 完整基于omlsa.m实现☆14Nov 26, 2021Updated 4 years ago
- ☆11Jun 6, 2022Updated 3 years ago
- ☆10Jun 24, 2021Updated 4 years ago
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 8 months ago
- [ICLR2026] FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates☆41Feb 25, 2026Updated last week
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆28Sep 20, 2025Updated 5 months ago
- modules for the evaluation of acoustic echo cancellation systems☆17Nov 2, 2021Updated 4 years ago
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement☆16Jul 11, 2025Updated 7 months ago
- ☆20Aug 25, 2025Updated 6 months ago
- Sound field reconstruction using neural processes with dynamic kernels☆16Mar 25, 2025Updated 11 months ago
- 批量小管家-微博运营管理工具,高效的微博定时工具 ✅ 支持多账号批量管理,提升运营效率10倍以上 ✅ 100+功能覆盖微博全部维度管理 ✅ 24小时自动化运营,解放双手持续增粉☆14Aug 10, 2025Updated 6 months ago
- MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations☆33Oct 15, 2025Updated 4 months ago
- 📝 A lightweight tool for auto-posting markdown content to X (Twitter). Schedule and manage your social media content with ease.☆11Nov 20, 2024Updated last year
- AI Voice Agents: Exploring the Next Generation of Human-Machine Interaction! 🎙️🤖🎧☆10Aug 30, 2024Updated last year
- Room impulse response simulation for various array architectures using Monte-Carlo simulation and quaternions (Python)☆17Feb 25, 2026Updated last week
- ☆15Oct 31, 2022Updated 3 years ago
- ASLP Summer Inter@NPU☆12Jul 30, 2024Updated last year
- Unofficial Pytorch Lightning Implementation of "Towards Robust Speech Super-Resolution"☆10May 8, 2023Updated 2 years ago
- An unofficial non-causal Tensorflow implementation of "Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Spee…☆14Dec 27, 2022Updated 3 years ago
- Automatic gain control library☆15Jul 13, 2024Updated last year
- The code about “LABNet: A Lightweight Attentive Beamforming Network for Ad-hoc Multichannel Microphone Invariant Real-Time Speech Enhance…☆38Oct 10, 2025Updated 4 months ago
- This is the unofficial implementation of MFNet, from paper''a Mask Free Neural Network for Monaural Speech Enhancement''☆13Dec 20, 2024Updated last year
- Convert a mono channel recording into binaural playback with headphones and loudspeakers☆13Dec 6, 2023Updated 2 years ago
- This repository is webrtc agc module demo.☆12Jan 23, 2019Updated 7 years ago
- Frequency-Dependent Adaptive Filtering Double Talk Detector.☆12Mar 26, 2020Updated 5 years ago
- An open-source AI agent that brings the power of Gemini directly into your terminal.☆25Jul 3, 2025Updated 8 months ago
- A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIV…☆21Nov 25, 2024Updated last year
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago