Pure-PyTorch Parakeet TDT inference
☆34Mar 10, 2026Updated 3 weeks ago
Alternatives and similar repositories for nano-parakeet
Users that are interested in nano-parakeet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆50Feb 17, 2026Updated last month
- [NeurIPS 2024] Generalizable Person Re-identification via Balancing Alignment and Uniformity☆32Dec 6, 2024Updated last year
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 4 years ago
- Speaker diarization benchmark framework☆40Jan 8, 2026Updated 3 months ago
- A JAX library for building lattice-based speech transducer models☆48Mar 2, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The open-source CapCut alternative for Linux☆86Apr 1, 2026Updated last week
- Explorations into the proposed SDFT, Self-Distillation Enables Continual Learning, from Shenfeld et al. of MIT☆30Feb 6, 2026Updated 2 months ago
- The algorithm which plans my day☆10May 23, 2022Updated 3 years ago
- ☆13Sep 3, 2024Updated last year
- ☆53Oct 17, 2023Updated 2 years ago
- ☆37Updated this week
- ☆14Jun 1, 2015Updated 10 years ago
- RxKotlin OkHttp WebSocket Wrapper for Kotlin, Java, and Android☆15Jul 12, 2019Updated 6 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Lighting and Rotation Invariant Real-time Vehicle Wheel Detector based on YOLOv5☆17Aug 24, 2025Updated 7 months ago
- Filtering and Noise Adding Tool☆29May 27, 2022Updated 3 years ago
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated 2 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- The Smallest English TTS Model with only 1M parameters☆68Mar 3, 2026Updated last month
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆13Jul 25, 2022Updated 3 years ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- A Weakly Supervised Forced Alignment for disluent speech☆15Nov 12, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆64Apr 11, 2023Updated 2 years ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆51Mar 2, 2026Updated last month
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆110Aug 16, 2024Updated last year
- Fuzzy search and open Chrome history via the terminal☆18Feb 3, 2021Updated 5 years ago
- ☆15Apr 5, 2023Updated 3 years ago
- [COLING 2025 Industry] LoRA Soups☆19Nov 29, 2024Updated last year
- ☆10Sep 19, 2022Updated 3 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆38Dec 24, 2025Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Coroutines based Kotlin library which bypasses the Cloudflare IUAM page☆11Aug 10, 2020Updated 5 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago
- A simple and powerful Rust remote method invocation library based on trait objects☆13Jan 6, 2021Updated 5 years ago
- ☆18Apr 15, 2024Updated last year
- The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…☆20Oct 11, 2024Updated last year
- Databases that CodeChain uses☆12Aug 12, 2020Updated 5 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆93Aug 3, 2023Updated 2 years ago