proof of concept conversation orchestrator with a speech-language model
☆20Oct 19, 2024Updated last year
Alternatives and similar repositories for gazelle-inference
Users that are interested in gazelle-inference are comparing it to the libraries listed below
Sorting:
- Temporary anonymous version☆22Mar 20, 2024Updated last year
- ☆13Oct 3, 2025Updated 5 months ago
- Drax: Speech Recognition with Discrete Flow Matching☆75Oct 15, 2025Updated 4 months ago
- Repository for Knowledge Platform - 2.0☆17Updated this week
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆39Jan 6, 2024Updated 2 years ago
- Crowdsourced and Automatic Speech Prominence Estimation☆25Apr 12, 2024Updated last year
- Implementation of vocoders empowered with pytorch lightning☆18Jan 27, 2024Updated 2 years ago
- Make your landing page look good in minutes. Animated ready to use sections for your landing page.☆18Apr 6, 2025Updated 11 months ago
- Survey on speech generation work.☆21Nov 26, 2023Updated 2 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- Joint speech-language model - respond directly to audio!☆372Jul 1, 2024Updated last year
- [NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words☆56Jun 25, 2024Updated last year
- ☆21Sep 24, 2018Updated 7 years ago
- ☆29Feb 4, 2025Updated last year
- Collection of scripts from mHuBERT-147.☆32Nov 19, 2024Updated last year
- Convert English text from written expressions into spoken forms☆28Jun 22, 2022Updated 3 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Oct 3, 2023Updated 2 years ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Jul 2, 2024Updated last year
- ☆32Feb 3, 2026Updated last month
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆35Oct 13, 2024Updated last year
- A universal messaging library for cross-platform applications (Chrome extension, Web, Mobile, Iframe,...)☆15Oct 10, 2025Updated 5 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆33Jul 28, 2024Updated last year
- Joint speech-language model - respond directly to audio!☆30May 13, 2024Updated last year
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Aug 29, 2023Updated 2 years ago
- Java Lint Library☆12Oct 17, 2023Updated 2 years ago
- ☆16Jan 13, 2022Updated 4 years ago
- Real-time Speech-Text Foundation Model Toolkit (wip)☆254Mar 26, 2025Updated 11 months ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- Ask AI to test your website with a specific goal☆15Dec 22, 2023Updated 2 years ago
- KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with 15M params for real-time, high-quality voices. Open source, fas…☆24Updated this week
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- Connect with the Lightspeed Retail API☆12Oct 22, 2024Updated last year
- Local LLM Testing & Benchmarking for Apple Silicon☆56Feb 26, 2026Updated last week
- openEHR Clinical modelling tooling setup☆10Jun 24, 2018Updated 7 years ago
- BlockCAT token sale smart contracts.☆11Oct 19, 2017Updated 8 years ago
- Media asset for BTCPayServer☆11Aug 19, 2024Updated last year
- An ERC721 implementation of event tickets.☆10Jan 15, 2022Updated 4 years ago
- ☆13Oct 9, 2025Updated 5 months ago