pipecat-ai / voice-ai-primer-webLinks
☆39Updated 2 months ago
Alternatives and similar repositories for voice-ai-primer-web
Users that are interested in voice-ai-primer-web are comparing it to the libraries listed below
Sorting:
- Testing and evaluation framework for voice agents☆148Updated 3 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆136Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆150Updated 7 months ago
- Routing on Random Forest (RoRF)☆200Updated 11 months ago
- ☆66Updated this week
- Kyutai with an "eye"☆217Updated 5 months ago
- Collection of Open Source Speech Data☆160Updated 9 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆278Updated 3 months ago
- Joint speech-language model - respond directly to audio!☆30Updated last year
- ☆102Updated last year
- A simple client and utils for interacting with OpenAI's Realtime API in Python☆240Updated 3 months ago
- ☆181Updated 6 months ago
- A real-time Pipecat debugger☆57Updated last week
- GPT-4 Level Conversational QA Trained In a Few Hours☆64Updated last year
- Website with current metrics on the fastest AI models.☆43Updated 9 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 7 months ago
- Scripts to create your own moe models using mlx☆90Updated last year
- A basic voice agent built with Python agents framework☆48Updated last month
- Open-source reproducible benchmarks from Argmax☆57Updated this week
- ☆34Updated last year
- ☆47Updated last year
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆54Updated 4 months ago
- ☆210Updated last week
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆116Updated last month
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆196Updated 6 months ago
- The Moshi speech-to-speech model, deployed to Modal with a realtime CLI chat☆58Updated 11 months ago
- The official Cartesia client for Python.☆101Updated last month
- Arxflix turns your boring Arxiv research paper into a captivating video.☆52Updated this week
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆45Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year