Shadowfita/parakeet-tdt-0.6b-v2-fastapi

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Shadowfita/parakeet-tdt-0.6b-v2-fastapi)

Shadowfita / parakeet-tdt-0.6b-v2-fastapi

A FastAPI wrapper for NVIDIA's new parakeet 0.6b v2 TTS 600-million-parameter model designed for high-quality English speech recognition

☆186

Alternatives and similar repositories for parakeet-tdt-0.6b-v2-fastapi

Users that are interested in parakeet-tdt-0.6b-v2-fastapi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

groxaxo / parakeet-tdt-0.6b-v3-fastapi-openai
View on GitHub
A FastAPI wrapper for NVIDIA's new parakeet 0.6b v3 TTS 600m model designed for high-quality multilingual speech recognition, beating Whi…
☆203Jul 10, 2026Updated last week
jfgonsalves / parakeet-diarized
View on GitHub
Parakeet 0.6b V2 + Pyannote diarization behind a Whisper API
☆78Feb 21, 2026Updated 4 months ago
altunenes / parakeet-rs
View on GitHub
very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA Parakeet in Rust
☆367Updated this week
istupakov / onnx-asr
View on GitHub
A lightweight Python package for Automatic Speech Recognition using ONNX models
☆346Updated this week
senstella / parakeet-mlx
View on GitHub
An implementation of the Nvidia's Parakeet models for Apple Silicon using MLX.
☆960Jun 5, 2026Updated last month
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
justinlime / Fatterbox
View on GitHub
Open API and Wyoming wrapper around Chatterbox
☆27Jan 2, 2026Updated 6 months ago
Deep-unlearning / Finetune-Voxtral-ASR
View on GitHub
☆38Oct 10, 2025Updated 9 months ago
davidbrowne17 / csm-streaming-tf
View on GitHub
A transformers implementation of csm-streaming
☆30May 16, 2025Updated last year
Deep-unlearning / Finetune-Parakeet
View on GitHub
☆25Oct 22, 2025Updated 8 months ago
wwang1110 / kokoro_batch
View on GitHub
☆19Feb 23, 2026Updated 4 months ago
alby13 / NVIDIA-Nemo-Parakeet-TDT-0-6B-V2-Audio-to-Text
View on GitHub
NVIDIA Nemo Parakeet TDT 0.6B V2 Audio to Text Python Script
☆20May 8, 2025Updated last year
davidbrowne17 / chatterbox-streaming
View on GitHub
Streaming and Fine-tuning for Chatterbox TTS
☆291Jun 15, 2025Updated last year
kyutai-labs / delayed-streams-modeling
View on GitHub
Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.
☆2,979Jan 26, 2026Updated 5 months ago
lucky-bai / wasm-speech-streaming
View on GitHub
Offline streaming speech-to-text in the browser
☆25Aug 28, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
rsxdalv / chatterbox
View on GitHub
SoTA open-source TTS
☆165Dec 16, 2025Updated 7 months ago
groxaxo / Qwen3-TTS-Openai-Fastapi
View on GitHub
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streamin…
☆214Jul 10, 2026Updated last week
speaches-ai / speaches
View on GitHub
☆3,522Updated this week
marhensa / vibevoice-realtime-openai-api
View on GitHub
OpenAI API-compatible text-to-speech server using Microsoft VibeVoice-Realtime-0.5B. Docker or Python venv support, multiple voices with …
☆83Dec 9, 2025Updated 7 months ago
mawwalker / stt-server
View on GitHub
stt websockect server using sherpa-onnx
☆56Feb 28, 2026Updated 4 months ago
k-koehler / gguf-tensor-overrider
View on GitHub
☆57Oct 10, 2025Updated 9 months ago
Deveraux-Parker / Nvidia_parakeet-tdt-0.6b-v2-FAST-BATCHING-API-1200x-RTFx
View on GitHub
☆43Oct 9, 2025Updated 9 months ago
randombk / chatterbox-vllm
View on GitHub
VLLM Port of the Chatterbox TTS model
☆379Oct 18, 2025Updated 9 months ago
abeiro / saig-gwserver
View on GitHub
Simple AI Gateway Skyrim Mod Gateway Server
☆14Sep 12, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zserge / utensil
View on GitHub
A tiny single-header tensor library in C
☆21Jul 13, 2026Updated last week
kroko-ai / kroko-onnx
View on GitHub
Kroko ASR - Speech-to-text
☆155May 28, 2026Updated last month
ncoder-ai / VibeVoice-FastAPI
View on GitHub
FastAPI wrapper around original Vibevoice 1.5B and 7B models, with support for AWQ4 quant
☆33Jun 22, 2026Updated 3 weeks ago
Deep-unlearning / Finetune-Dia-TTS
View on GitHub
☆22Aug 21, 2025Updated 10 months ago
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
View on GitHub
☆16Apr 4, 2022Updated 4 years ago
pipecat-ai / smart-turn
View on GitHub
☆1,475Jan 29, 2026Updated 5 months ago
matatonic / openedai-speech
View on GitHub
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
☆860Feb 2, 2025Updated last year
The-Data-Dilemma / MediBeng-Whisper-Tiny
View on GitHub
MediBeng Whisper Tiny improves doctor-patient transcription by training the Whisper Tiny model to translate mixed Bengali-English speech…
☆29Jul 24, 2025Updated 11 months ago
CrispStrobe / CrisperWeaver
View on GitHub
On-device speech-to-text Flutter app powered by CrispASR (ggml / Whisper) — offline, multi-platform, AGPL-3.0.
☆38Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
travisvn / chatterbox-tts-api
View on GitHub
Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned speech anywhere the OpenAI AP…
☆627Dec 23, 2025Updated 6 months ago
huggingface / open_asr_leaderboard
View on GitHub
☆228Updated this week
ysharma3501 / FastNeuTTS
View on GitHub
A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!
☆118Nov 24, 2025Updated 7 months ago
tleyden / arty
View on GitHub
iOS realtime voice assistant w/ translation + connectors
☆20Jul 12, 2026Updated last week
nitotm / efficient-language-detector-py
View on GitHub
Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.
☆22Jul 9, 2026Updated last week
taresh18 / orpheus-streaming
View on GitHub
Orpheus TTS Server with streaming support (TTFB ~160ms)
☆26Sep 21, 2025Updated 9 months ago
Valtora / Nojoin
View on GitHub
A self-hosted meeting transcription app that doesn't need to join your meetings as a bot.
☆53Updated this week