NVIDIA / voice-agent-examplesLinks
Pipecat framework based orchestrator for building real-time, voice-enabled, and multimodal conversational AI agents
☆34Updated last month
Alternatives and similar repositories for voice-agent-examples
Users that are interested in voice-agent-examples are comparing it to the libraries listed below
Sorting:
- ☆69Updated 9 months ago
- Collection of reference workflows for building intelligent agents with NIMs☆184Updated last year
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆163Updated 5 months ago
- This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows inste…☆127Updated last year
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆97Updated last year
- The NVIDIA RTX™ AI Toolkit is a suite of tools and SDKs for Windows developers to customize, optimize, and deploy AI models across RTX PC…☆181Updated last month
- Harness the power of NVIDIA technologies and LangChain to create dynamic avatars from live speech, integrating RIVA ASR and TTS with Audi…☆96Updated last year
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆68Updated last year
- an optimized, production-ready implementation of active speaker detection☆78Updated last year
- Blueprint for Ingesting massive volumes of live or archived videos and extract insights for summarization and interactive Q&A☆373Updated last month
- NVIDIA ACE samples, workflows, and resources☆299Updated 6 months ago
- A Gradio component that can be used to annotate images with bounding boxes.☆66Updated 2 months ago
- ☆206Updated last year
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- An NVIDIA AI Workbench example project for fine-tuning a Nemotron-3 8B model☆62Updated last year
- Fine Tuning Multimodal LLM "Idefics 9B" on Pokemon Go Dataset available on Hugging Face.☆18Updated 2 years ago
- ☆14Updated 2 years ago
- This repo contains codes covered in the youtube tutorials.☆86Updated 7 months ago
- A CLI to estimate inference memory requirements for Hugging Face models, written in Python.☆261Updated last week
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆356Updated last week
- Orpheus TTS Server with streaming support (TTFB ~160ms)☆19Updated 3 months ago
- A service to convert audio to facial blendshapes for lipsyncing and facial performances.☆203Updated 7 months ago
- ☆127Updated 9 months ago
- ☆199Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆105Updated 7 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆46Updated last year
- Hands-on hub to learn techniques to optimize and serve AI models to production the most optimal way.☆13Updated 4 months ago
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…☆231Updated last month
- unsloth-5090-multiple☆60Updated 7 months ago
- Kyutai with an "eye"☆233Updated 9 months ago