huggingface / speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
☆2,989Updated last week
Related projects: ⓘ
- Open Source framework for voice and multimodal conversational AI☆3,044Updated this week
- Inference and training library for high-quality TTS models.☆4,193Updated last month
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,100Updated this week
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,531Updated last month
- The easiest way to use Agentic RAG in any enterprise☆3,132Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!☆2,884Updated last month
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆3,155Updated this week
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆891Updated last week
- tiny vision language model☆4,893Updated 3 weeks ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆2,739Updated last week
- Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.☆2,055Updated this week
- Parse files for optimal RAG☆2,450Updated this week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆4,578Updated last week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆2,489Updated this week
- Agent Zero AI framework☆3,676Updated this week
- open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming…☆2,425Updated this week
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,120Updated 2 months ago
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆6,008Updated this week
- ☆1,705Updated this week
- 🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper☆2,763Updated last week
- MARS5 speech model (TTS) from CAMB.AI☆2,440Updated last month
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,542Updated 2 weeks ago
- Foundational model for human-like, expressive TTS☆3,721Updated last month
- Whisper with Medusa heads☆774Updated last week
- Mixture of Agents using Groq☆899Updated last month
- ☆2,652Updated this week
- Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.☆1,374Updated this week
- Large Action Model framework to develop AI Web Agents☆5,289Updated this week
- PraisonAI application combines AutoGen and CrewAI or similar frameworks into a low-code solution for building and managing multi-agent LL…☆2,112Updated 2 weeks ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,509Updated last month