844704781/auto-video

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/844704781/auto-video)

844704781 / auto-video

推文工具: 图片音频批量合成视频

☆18

Alternatives and similar repositories for auto-video

Users that are interested in auto-video are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

parallel101 / hw08
View on GitHub
☆11Sep 12, 2023Updated 2 years ago
IsraelCohenLab / ConstantBeamwidthUCCA
View on GitHub
☆11Jun 6, 2022Updated 4 years ago
yuguochencuc / CinCGAN-SE
View on GitHub
Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement
☆10Jan 24, 2022Updated 4 years ago
Xiaobin-Rong / lite-rtse
View on GitHub
An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement
☆14Nov 19, 2023Updated 2 years ago
ReiherGroup / CoRe_optimizer
View on GitHub
Continual Resilient (CoRe) Optimizer for PyTorch
☆12Jun 10, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
audiolabs / anechoic-noise
View on GitHub
Generator for anechoic, non-stationary noise signals
☆12Aug 12, 2022Updated 3 years ago
SoulProficiency / speechseparation-Sandglasset
View on GitHub
☆13Jun 24, 2021Updated 5 years ago
jyhan03 / dpccn
View on GitHub
This repository provides an implementation of the DPCCN model for single-channel speech separation. More details will be updated soon.
☆13Dec 8, 2021Updated 4 years ago
Yuan-ManX / audio-ai-agent
View on GitHub
Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.
☆16Dec 8, 2023Updated 2 years ago
claytonotey / paukn
View on GitHub
The pitched audio knife - VST effect with various midi note controlled filters (lowpass, hipass, comb, decimator, granulator, digital wav…
☆14Mar 22, 2022Updated 4 years ago
ddxsg24 / Personalized-Speech-Enhancement
View on GitHub
ASLP Summer Inter@NPU
☆13Jul 30, 2024Updated last year
XinleiRen / MTFAA-Net
View on GitHub
An unofficial non-causal Tensorflow implementation of "Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Spee…
☆14Dec 27, 2022Updated 3 years ago
tencentmusic / TME-Audio-Super-Resolution-Samples
View on GitHub
Audio samples for the paper 'Phase-aware music super-resolution using generative adversarial networks'
☆14May 15, 2020Updated 6 years ago
Sreyan88 / RECAP
View on GitHub
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆16Jun 23, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
salgado / music-search
View on GitHub
Code from blog 'Searching by Music: Leveraging Vector Search for Music Information Retrieval'
☆16Nov 16, 2023Updated 2 years ago
xf739645524 / omlsa_imcra_new_version
View on GitHub
完整基于omlsa.m实现
☆14Nov 26, 2021Updated 4 years ago
ishine / Project_sp_ehance_matlab
View on GitHub
☆12Jun 17, 2019Updated 7 years ago
0x07dc / declicker
View on GitHub
Audio plugin to remove clicks from audio
☆12Oct 29, 2021Updated 4 years ago
line / WaveTrainerFit
View on GitHub
Official implementation of "Wave-Trainer-Fit: Neural Vocoder with Trainable Prior and Fixed-Point Iteration towards High-Quality Speech G…
☆16Feb 6, 2026Updated 5 months ago
xmos / lib_agc
View on GitHub
Automatic gain control library
☆15Jul 13, 2024Updated 2 years ago
BUTSpeechFIT / cgmm_mvdr_online
View on GitHub
Implementation of CGMM-MVDR beamforming used for Clarity challenge
☆14Jan 14, 2022Updated 4 years ago
HuPER29 / HuPER
View on GitHub
☆16Mar 19, 2026Updated 4 months ago
prerak23 / Dir_SrcMic_DOA
View on GitHub
Codebase of the submitted work in ICASSP 2023
☆14Nov 30, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
akarsh-prabhakara / spatial-audio
View on GitHub
Convert a mono channel recording into binaural playback with headphones and loudspeakers
☆13Dec 6, 2023Updated 2 years ago
jonysugianto / vad_lsfm
View on GitHub
Efficient voice activity detection algorithm using long-term spectral flatness measurement
☆15Feb 21, 2017Updated 9 years ago
dr-pato / SSGD
View on GitHub
Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"
☆15Dec 22, 2022Updated 3 years ago
HaoFengyuan / EEND-IAAE
View on GitHub
The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…
☆11Aug 27, 2023Updated 2 years ago
wonjune-kang / expressive-speech-retrieval
View on GitHub
Expressive Speech Retrieval using Natural Language Descriptions of Speaking Style
☆15Aug 18, 2025Updated 11 months ago
ml-for-speech / speechtoolkit
View on GitHub
[Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…
☆21Jan 10, 2025Updated last year
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
audiolabs / MonteCarloRIRSimulation
View on GitHub
Room impulse response simulation for various array architectures using Monte-Carlo simulation and quaternions (Python)
☆18Feb 25, 2026Updated 5 months ago
Aratako / CALM-DACVAE
View on GitHub
An attempt to reproduce CALM (Continuous Audio Language Models) using DACVAE as the audio VAE.
☆18Feb 20, 2026Updated 5 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
JanWilczek / fdaf-double-talk-detector
View on GitHub
Frequency-Dependent Adaptive Filtering Double Talk Detector.
☆13Mar 26, 2020Updated 6 years ago
prerak23 / RoomParamEstim
View on GitHub
This is the code for the WASPAA 2021 paper "Blind Room Parameter Estimation Using Multiple Multichannel Speech Recordings
☆17Nov 9, 2022Updated 3 years ago
ssprl / Real-time-Blind-source-separation-using-IVA
View on GitHub
☆16Apr 24, 2021Updated 5 years ago
TuZehai / Sheffield_Clarity_CEC1_Entry
View on GitHub
Implementation of Sheffield entry for Clarity enhancement challenge.
☆18Apr 19, 2022Updated 4 years ago
ina-foss / InaGVAD
View on GitHub
Voice activity detection and speaker gender segmentation audiovisual corpus
☆16Jan 20, 2025Updated last year
lunixcode / trading_system
View on GitHub
A news based stock scalper using LLM and quant approach
☆15Jan 16, 2025Updated last year
jagger2048 / WebRtc_AGC1
View on GitHub
This repository is webrtc agc module demo.
☆12Jan 23, 2019Updated 7 years ago