Fast parallel CTC.
☆31Aug 31, 2018Updated 7 years ago
Alternatives and similar repositories for warp-ctc
Users that are interested in warp-ctc are comparing it to the libraries listed below
Sorting:
- Unofficial implementation of Adaptive Input in PyTorch☆12Feb 22, 2019Updated 7 years ago
- Sparse unsupervised capsules☆23Jan 22, 2019Updated 7 years ago
- ☆22Feb 25, 2020Updated 6 years ago
- Pytorch Bindings for warp-ctc☆761Jul 2, 2023Updated 2 years ago
- Chainer implementation of deep-INFOMAX☆34Aug 29, 2018Updated 7 years ago
- ☆32Aug 1, 2018Updated 7 years ago
- arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-daily☆15Jan 6, 2025Updated last year
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆22Jan 19, 2026Updated last month
- Javascript-powered Swype interface☆16Apr 15, 2013Updated 12 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Acoustic-prosodic entrainment measurement in spoken dialogue and approximation of the evolution of a speaker’s a/p features.☆12Feb 26, 2024Updated 2 years ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆17Nov 19, 2025Updated 3 months ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- ☆35Dec 9, 2020Updated 5 years ago
- WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models☆27Feb 13, 2026Updated 3 weeks ago
- Pascal2 Harvest project QuEst☆14Sep 15, 2014Updated 11 years ago
- Visual question answering for CVPR16 VQA Challenge.☆41Nov 5, 2016Updated 9 years ago
- A lightweight muji-moe chatbot created by Reecho.ai.☆13Oct 1, 2024Updated last year
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated 10 months ago
- Feature Decay Algorithms☆11Mar 5, 2014Updated 12 years ago
- Experiments and tutorials with and for torchaudio☆13May 7, 2021Updated 4 years ago
- C++ PyTorch Examples☆10Aug 18, 2019Updated 6 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- ☆12Oct 24, 2024Updated last year
- Template for PhD thesis using Tufte's style book☆11Mar 13, 2020Updated 5 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- MDLText☆12Jul 13, 2017Updated 8 years ago
- Code for the paper☆11May 24, 2024Updated last year
- Fully connected neural nets for supervised learning DQMC data☆12Jul 13, 2016Updated 9 years ago
- Python package for the extraction of speech features for sustained phonation☆12Aug 10, 2020Updated 5 years ago
- This is the official repository of Emotion-Driven Melody Harmonization via Melodic Variation and Functional Representation.☆12Sep 25, 2024Updated last year
- Listing my favorite research papers 📝 from different fields as I read them.☆10Oct 17, 2019Updated 6 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion☆10Oct 29, 2022Updated 3 years ago
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.☆25Nov 25, 2025Updated 3 months ago
- A Text2Speech Engine built in Pytorch.☆12Dec 9, 2018Updated 7 years ago
- Datasets for machine translation☆10Jul 5, 2019Updated 6 years ago
- Fast binary matrix product on CPU☆10Feb 11, 2016Updated 10 years ago
- ☆11Nov 7, 2024Updated last year