tincans-ai/gazelle-inference

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tincans-ai/gazelle-inference)

tincans-ai / gazelle-inference

proof of concept conversation orchestrator with a speech-language model

☆20

Alternatives and similar repositories for gazelle-inference

Users that are interested in gazelle-inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

speechnovateur / languagecodec_tmp
View on GitHub
Temporary anonymous version
☆22Mar 20, 2024Updated 2 years ago
aiola-lab / drax
View on GitHub
Drax: Speech Recognition with Discrete Flow Matching
☆75Oct 15, 2025Updated 9 months ago
tincans-ai / gazelle
View on GitHub
Joint speech-language model - respond directly to audio!
☆374Jul 1, 2024Updated 2 years ago
slSeanWU / beats-conformer-bart-audio-captioner
View on GitHub
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…
☆41Jan 6, 2024Updated 2 years ago
skit-ai / slu-prosody
View on GitHub
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆27May 17, 2023Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
kuan2jiu99 / Awesome-Speech-Generation
View on GitHub
Survey on speech generation work.
☆21Nov 26, 2023Updated 2 years ago
BUTSpeechFIT / DeCRED
View on GitHub
☆18Aug 13, 2025Updated 11 months ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
Sho-N / BabyWatcher
View on GitHub
☆15Dec 13, 2021Updated 4 years ago
Wataru-Nakata / ssl-vocoders
View on GitHub
Implementation of vocoders empowered with pytorch lightning
☆18Jan 27, 2024Updated 2 years ago
utter-project / mHuBERT-147-scripts
View on GitHub
Collection of scripts from mHuBERT-147.
☆35Nov 19, 2024Updated last year
zhangqi-here / UnifiedEAE
View on GitHub
A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck
☆10Sep 9, 2022Updated 3 years ago
EMRAI / emrai-synthetic-diarization-corpus
View on GitHub
☆22Sep 24, 2018Updated 7 years ago
MicahZoltu / vultr-raid0
View on GitHub
Some scripts to create a Vultr instance with multiple physical drives RAID0 together.
☆14Jul 11, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mesolitica / vllm-whisper
View on GitHub
A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper
☆35Jul 28, 2024Updated 2 years ago
ruvnet / open-space
View on GitHub
An open source code of the GitHub Copilot Workspace
☆14Jun 8, 2024Updated 2 years ago
tomaarsen / TTSTextNormalization
View on GitHub
Convert English text from written expressions into spoken forms
☆32Jun 22, 2022Updated 4 years ago
LAION-AI / emotional-speech-annotations
View on GitHub
This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models
☆35Oct 13, 2024Updated last year
gladiaio / normalization
View on GitHub
A lightweight library for normalizing speech transcripts before computing WER
☆28Jul 14, 2026Updated 2 weeks ago
interactiveaudiolab / emphases
View on GitHub
Crowdsourced and Automatic Speech Prominence Estimation
☆27Apr 12, 2024Updated 2 years ago
thevoicecompany / gazelle-train
View on GitHub
Joint speech-language model - respond directly to audio!
☆30May 13, 2024Updated 2 years ago
WangHelin1997 / LibriLightMix-WHAMR
View on GitHub
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
☆17Nov 7, 2024Updated last year
ShovalMessica / NAST
View on GitHub
Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…
☆46Jul 2, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
frankyoujian / Edge-Punct-Casing
View on GitHub
☆33Feb 4, 2025Updated last year
joaoantoniocn / AM-MobileNet1D
View on GitHub
The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…
☆31Oct 3, 2023Updated 2 years ago
simplc / WCN-BERT
View on GitHub
Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).
☆12Jun 12, 2023Updated 3 years ago
0nutation / SLMTokBench
View on GitHub
SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"
☆37Aug 29, 2023Updated 2 years ago
mllm-ie / mllm-ie.github.io
View on GitHub
☆11Feb 5, 2024Updated 2 years ago
yangdongchao / RSTnet
View on GitHub
Real-time Speech-Text Foundation Model Toolkit (wip)
☆255Mar 26, 2025Updated last year
swift-cloud / vercel-starter-kit
View on GitHub
A starter kit for deploying Swift applications to Vercel
☆10Apr 6, 2024Updated 2 years ago
jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch
View on GitHub
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
☆12Mar 23, 2021Updated 5 years ago
Pythagora-io / gpt-pilot-db-analysis-tool
View on GitHub
☆21Feb 2, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ayushpai / RAG-SingleStore-ChatBot
View on GitHub
☆12Jan 8, 2024Updated 2 years ago
cpdu / vallt
View on GitHub
☆36Mar 14, 2025Updated last year
liuhuadai / ViT-TTS
View on GitHub
PyTorch Implementation of ViT-TTS (EMNLP'23)
☆11Oct 20, 2023Updated 2 years ago
Stack-Box / stackbox
View on GitHub
Create app stacks loaded with all your favourite clients, services and infra along with code boilerplates in under 5 mins.
☆13Jan 19, 2023Updated 3 years ago
1140251 / Ethsential
View on GitHub
EthSential is a security analysis framework for Ethereum smart contracts. It bundles other tools to find vulnerabilities in smart contrac…
☆23Oct 6, 2020Updated 5 years ago
BUTSpeechFIT / DiCoW
View on GitHub
☆100Jan 28, 2026Updated 6 months ago
rasoolfa / videocap
View on GitHub
Memory-augmented Attention Modelling for Videos
☆10Apr 24, 2017Updated 9 years ago