844704781 / auto-videoView external linksLinks
推文工具: 图片音频批量合成视频
☆18May 23, 2024Updated last year
Alternatives and similar repositories for auto-video
Users that are interested in auto-video are comparing it to the libraries listed below
Sorting:
- An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control☆30Jan 13, 2026Updated last month
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- ☆15Sep 16, 2024Updated last year
- A news based stock scalper using LLM and quant approach☆14Jan 16, 2025Updated last year
- ☆12Jun 17, 2019Updated 6 years ago
- ☆10Jun 24, 2021Updated 4 years ago
- Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement☆10Jan 24, 2022Updated 4 years ago
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 7 months ago
- FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates☆40Nov 4, 2025Updated 3 months ago
- ☆14Apr 18, 2023Updated 2 years ago
- ☆19Aug 25, 2025Updated 5 months ago
- MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations☆31Oct 15, 2025Updated 4 months ago
- ☆11Jun 6, 2022Updated 3 years ago
- Generator for anechoic, non-stationary noise signals☆11Aug 12, 2022Updated 3 years ago
- 完整基于omlsa.m实现☆14Nov 26, 2021Updated 4 years ago
- ☆11Sep 12, 2023Updated 2 years ago
- ASLP Summer Inter@NPU☆12Jul 30, 2024Updated last year
- The code about “LABNet: A Lightweight Attentive Beamforming Network for Ad-hoc Multichannel Microphone Invariant Real-Time Speech Enhance…☆37Oct 10, 2025Updated 4 months ago
- Room impulse response simulation for various array architectures using Monte-Carlo simulation and quaternions (Python)☆17May 25, 2025Updated 8 months ago
- modules for the evaluation of acoustic echo cancellation systems☆17Nov 2, 2021Updated 4 years ago
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement☆16Jul 11, 2025Updated 7 months ago
- An unofficial non-causal Tensorflow implementation of "Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Spee…☆14Dec 27, 2022Updated 3 years ago
- Frequency-Dependent Adaptive Filtering Double Talk Detector.☆12Mar 26, 2020Updated 5 years ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆28Sep 20, 2025Updated 4 months ago
- Sound field reconstruction using neural processes with dynamic kernels☆15Mar 25, 2025Updated 10 months ago
- Automatic gain control library☆15Jul 13, 2024Updated last year
- Unofficial Pytorch Lightning Implementation of "Towards Robust Speech Super-Resolution"☆10May 8, 2023Updated 2 years ago
- AI Voice Agents: Exploring the Next Generation of Human-Machine Interaction! 🎙️🤖🎧☆10Aug 30, 2024Updated last year
- ☆15Oct 31, 2022Updated 3 years ago
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆13Jan 14, 2022Updated 4 years ago
- Convert a mono channel recording into binaural playback with headphones and loudspeakers☆12Dec 6, 2023Updated 2 years ago
- Codebase of the submitted work in ICASSP 2023☆14Nov 30, 2022Updated 3 years ago
- This repository is webrtc agc module demo.☆12Jan 23, 2019Updated 7 years ago
- Speaker adaptive forced alignment (phonetic segmentation) using Wav2Vec2☆22Jan 13, 2026Updated last month
- Generating non-stationary multi-sensor signals under a spatial coherence constraint (Python)☆22Jan 14, 2025Updated last year
- Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"☆33Jan 28, 2026Updated 2 weeks ago
- This repository provides an implementation of the DPCCN model for single-channel speech separation. More details will be updated soon.☆13Dec 8, 2021Updated 4 years ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆16Jun 23, 2024Updated last year