Exquisite video generation
☆14Feb 18, 2024Updated 2 years ago
Alternatives and similar repositories for OpenSora
Users that are interested in OpenSora are comparing it to the libraries listed below
Sorting:
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆18Jul 16, 2024Updated last year
- Train tacotron on a mandarin dataset☆18May 6, 2019Updated 6 years ago
- ☆56Jul 17, 2023Updated 2 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Dec 17, 2020Updated 5 years ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- ☆12Mar 26, 2020Updated 5 years ago
- Charisma.ai SDK for Javascript (browser)☆12Aug 13, 2025Updated 6 months ago
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated 9 months ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- ☆13Nov 22, 2022Updated 3 years ago
- ☆11May 9, 2023Updated 2 years ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Mar 23, 2021Updated 4 years ago
- Lab assignments for 6.826☆11Nov 8, 2019Updated 6 years ago
- [ICASSP 2023] Tempo vs. Pitch: understanding self-supervised tempo estimation☆13Aug 2, 2023Updated 2 years ago
- EUR/USD forex algotrading on QuantConnect with 938% return on 10 years of backtesting data☆12Dec 27, 2018Updated 7 years ago
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆26Apr 27, 2025Updated 10 months ago
- Real-Time ASR with CNN-BiLSTM: End-to-End Live Streaming Using PyTorch Lightning⚡☆11Jan 23, 2025Updated last year
- JSGF Deducer based on JSGF grammar and WFST☆11Jan 11, 2018Updated 8 years ago
- ☆13Dec 1, 2025Updated 3 months ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- ☆14Jan 5, 2022Updated 4 years ago
- We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…☆20May 20, 2025Updated 9 months ago
- Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization (ICCV 2021)☆10Oct 11, 2021Updated 4 years ago
- Multiple Constrained Minimum Variance (MCMV) beamformer☆13Apr 30, 2020Updated 5 years ago
- A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes☆11Oct 19, 2023Updated 2 years ago
- ☆10Oct 24, 2024Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- b站视频音轨下载器(支持多P) Rebuild from https://github.com/Quandong-Zhang/bilibiliAudioDownloader.ps1 with python☆11Jul 31, 2025Updated 7 months ago