SesameAILabs / csm
A Conversational Speech Generation Model
☆11,931Updated this week
Alternatives and similar repositories for csm:
Users that are interested in csm are comparing it to the libraries listed below
- Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…☆6,261Updated 3 weeks ago
- TTS Towards Human-Sounding Speech☆3,162Updated this week
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆14,647Updated this week
- Open Source framework for voice and multimodal conversational AI☆5,360Updated this week
- A lightweight, powerful framework for multi-agent workflows☆7,667Updated this week
- YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open☆4,636Updated this week
- An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Co…☆3,466Updated last month
- Run AI Agent in your browser.☆10,134Updated this week
- Agno is a lightweight library for building Multimodal Agents. It exposes LLMs as a unified API and gives them superpowers like memory, kn…☆21,898Updated this week
- https://hf.co/hexgrad/Kokoro-82M☆1,911Updated this week
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆2,165Updated this week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆10,821Updated this week
- ☆8,748Updated this week
- Fully local web research and report writing assistant☆6,669Updated last week
- A fast multimodal LLM for real-time voice☆3,771Updated last month
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆19,839Updated this week
- Agent Zero AI framework☆6,406Updated last week
- An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl☆5,208Updated last month
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆32,762Updated this week
- Roo Code (prev. Roo Cline) gives you a whole dev team of AI agents in your code editor.☆9,228Updated this week
- an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM☆10,959Updated this week
- Agent Framework / shim to use Pydantic with LLMs☆7,819Updated this week
- The Memory layer for AI Agents☆26,839Updated this week
- The python library for real-time communication☆3,355Updated this week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆7,929Updated this week
- ☆2,454Updated last month
- The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.☆3,264Updated this week
- Python scraper based on AI☆18,840Updated this week
- 🤗 smolagents: a barebones library for agents that think in python code.☆15,909Updated this week
- This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a…☆10,011Updated last week