andysingal / Audio-LLMLinks
The purpose of this repository is to discuss on Audio transformers
☆12Updated last week
Alternatives and similar repositories for Audio-LLM
Users that are interested in Audio-LLM are comparing it to the libraries listed below
Sorting:
- ☆11Updated last year
- Experimenting text-embeddings-inference server on both CPU and GPU☆18Updated last year
- Developer showcase of projects built on Cartesia☆17Updated 9 months ago
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆26Updated 7 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆20Updated 8 months ago
- ☆12Updated last year
- Your Python AI Coder!☆34Updated last month
- a version of baby agi using dspy and typed predictors☆17Updated last year
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆17Updated 2 weeks ago
- Apps that run on modal.com☆12Updated last year
- GraphRag vs Embeddings☆14Updated 11 months ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆35Updated 2 weeks ago
- ☆11Updated 2 years ago
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- In this course you'll learn to use Gradio to create user-friendly apps with minimal code: Summarize text using a large language model, ge…☆14Updated last year
- ☆9Updated 3 months ago
- ☆20Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated last month
- ☆1Updated 11 months ago
- A python library to find differences between audio and transcriptions☆20Updated last year
- This Repo focuses on defending against 'adversarial prompts,' detecting and attempting to mitigate objectionable content in real time.☆13Updated last year
- Structured outputs from DSPy and Jinja2☆23Updated last month
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆22Updated 7 months ago
- Roomey is a multi-purpose Voice Agent designed to run your personal and business life.☆27Updated last week
- Shared personal notes created while working with the Apple MLX machine learning framework☆24Updated last month
- On-device LLM Inference using Mediapipe LLM Inference API.☆21Updated last year
- Widest collection of generative ai usecases in enterprise & startups☆19Updated last year
- Use Gemma3:4b model on Ollama to make a fully functional streamlit OCR App using Vibe Coding with Cursor Code Editor☆14Updated 3 months ago
- 🧠 Retrieval Augmented Generation (RAG) example☆17Updated 11 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 9 months ago