awni / warp-ctcView external linksLinks
Fast parallel CTC.
☆31Aug 31, 2018Updated 7 years ago
Alternatives and similar repositories for warp-ctc
Users that are interested in warp-ctc are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of A Deep Learning System for Predicting Size and Fit in Fashion E-Commerce (RecSys'19)☆14Aug 23, 2021Updated 4 years ago
- Unofficial implementation of Adaptive Input in PyTorch☆12Feb 22, 2019Updated 6 years ago
- Sparse unsupervised capsules☆23Jan 22, 2019Updated 7 years ago
- ☆22Feb 25, 2020Updated 5 years ago
- Pytorch Bindings for warp-ctc☆761Jul 2, 2023Updated 2 years ago
- Chainer implementation of deep-INFOMAX☆34Aug 29, 2018Updated 7 years ago
- ☆32Aug 1, 2018Updated 7 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- ☆27Oct 22, 2025Updated 3 months ago
- arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-daily☆14Jan 6, 2025Updated last year
- Javascript-powered Swype interface☆16Apr 15, 2013Updated 12 years ago
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆21Jan 19, 2026Updated 3 weeks ago
- ☆35Dec 9, 2020Updated 5 years ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- Pascal2 Harvest project QuEst☆14Sep 15, 2014Updated 11 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- Theano+Keras implementation of style transfer algorithms.☆38Aug 11, 2023Updated 2 years ago
- Visual question answering for CVPR16 VQA Challenge.☆41Nov 5, 2016Updated 9 years ago
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.☆25Nov 25, 2025Updated 2 months ago
- A Text2Speech Engine built in Pytorch.☆12Dec 9, 2018Updated 7 years ago
- Mirror of https://gerrit.wikimedia.org/g/wikimedia/textcat See https://www.mediawiki.org/wiki/Developer_access for contributing☆11Jan 27, 2026Updated 2 weeks ago
- Basic library for spatial audio SOFA files☆12Sep 29, 2020Updated 5 years ago
- Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion☆10Oct 29, 2022Updated 3 years ago
- Standalone Repository of AWS Lambda Client from soto-project/soto☆11Sep 14, 2020Updated 5 years ago
- ☆17Sep 17, 2025Updated 4 months ago
- Sound2Synth Plug-Ins☆13Jul 28, 2022Updated 3 years ago
- Observe the dataset of images and targets in few shots☆11Sep 27, 2022Updated 3 years ago
- Simple example for learning and serving 'MNIST' in kubernetes cluster☆10Mar 27, 2019Updated 6 years ago
- This is the official repository of Emotion-Driven Melody Harmonization via Melodic Variation and Functional Representation.☆12Sep 25, 2024Updated last year
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- System Design and Product Architecture Diagrams☆11Sep 22, 2024Updated last year
- MDLText☆12Jul 13, 2017Updated 8 years ago
- PyPI package to calculate comprehensive confidence intervals for classification positive rate, precision, NPV, and recall using a labeled…☆10Jul 6, 2023Updated 2 years ago
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 3 months ago
- Template for PhD thesis using Tufte's style book☆11Mar 13, 2020Updated 5 years ago
- Automatically exported from code.google.com/p/hunpos☆12Apr 9, 2018Updated 7 years ago
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated 9 months ago
- Listing my favorite research papers 📝 from different fields as I read them.☆10Oct 17, 2019Updated 6 years ago
- Experiments and tutorials with and for torchaudio☆13May 7, 2021Updated 4 years ago