augmentedstartups / Roomey_AI_Voice_AgentLinks
Roomey is a multi-purpose Voice Agent designed to run your personal and business life.
☆27Updated last week
Alternatives and similar repositories for Roomey_AI_Voice_Agent
Users that are interested in Roomey_AI_Voice_Agent are comparing it to the libraries listed below
Sorting:
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆62Updated 8 months ago
- The purpose of this repository is to discuss on Audio transformers☆12Updated last week
- Voice Agent Framework for Conversational AI☆53Updated last month
- Efficient approach to speaker diarization using voice characteristics extraction☆96Updated this week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 3 weeks ago
- AI Voice Agents: Exploring the Next Generation of Human-Machine Interaction! 🎙️🤖🎧☆9Updated 9 months ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆29Updated last year
- ☆127Updated 3 months ago
- Multimodal AI App using Llava 7B and Gradio.☆38Updated last year
- On-device LLM Inference using Mediapipe LLM Inference API.☆21Updated last year
- Self-hosted AI voice agent☆109Updated 10 months ago
- Function Calling Mistral 7B. Learn how to make functions call for open source LLMs.☆48Updated last year
- YouTube Video Summarization App built using open source LLM and Framework like Llama 2, Haystack, Whisper, and Streamlit. This app smooth…☆56Updated last year
- YOLOv10: Real-Time End-to-End Object Detection☆10Updated last year
- ☆20Updated 6 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆20Updated 8 months ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆18Updated 3 months ago
- Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.☆15Updated last year
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated 3 weeks ago
- ☆18Updated last year
- A minimalistic streamlit chatbot UI to combine and customize tools for langchain llm agents☆13Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆26Updated 2 weeks ago
- Harness the power of NVIDIA technologies and LangChain to create dynamic avatars from live speech, integrating RIVA ASR and TTS with Audi…☆74Updated 11 months ago
- llmware RAG Demo App.☆17Updated last year
- A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)☆19Updated last year
- AI Voice Assistant: Talk to an AI agent that helps you with event scheduling, contact management, accessing your knowledge base, and web …☆49Updated 6 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 7 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆45Updated 10 months ago
- Diagnose the performance of your RAG🩺☆36Updated 2 months ago
- This is the official repository of ISMIR 2024 paper "Emotion-driven Piano Music Generation via Two-stage Disentanglement and Functional R…☆56Updated 9 months ago