liydaco / sesame-csmLinks
sesame
☆10Updated 2 months ago
Alternatives and similar repositories for sesame-csm
Users that are interested in sesame-csm are comparing it to the libraries listed below
Sorting:
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 7 months ago
- Course Project for COMP4471 on RWKV☆17Updated last year
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆19Updated 7 months ago
- A simple extension that uses Bark Text-to-Speech for audio output☆35Updated last year
- Easily convert HuggingFace models to GGUF-format for llama.cpp☆21Updated 10 months ago
- What Would Portland Do? Generative agent experience☆13Updated last year
- PegasusX: The Future of Multimodal Embeddings 🦄 🦄☆15Updated 7 months ago
- A chat UI for Llama.cpp☆13Updated last week
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆21Updated 9 months ago
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆14Updated last week
- ☆17Updated last month
- ☆28Updated 9 months ago
- Thin wrapper around GGML to make life easier☆34Updated last week
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- An extension to use Kokoro TTS in text generation webui☆20Updated last month
- OpenPipe Reinforcement Learning Experiments☆25Updated 2 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆34Updated 10 months ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆45Updated last year
- Simple CogVLM client script☆14Updated last year
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated 11 months ago
- Experimental sampler to make LLMs more creative☆31Updated last year
- ☆17Updated 2 months ago
- 🐸Coqui Dialogue Audio Pack contains more than 2000 audio files of synthetic human voices over dialogue created specifically for video ga…☆41Updated 2 years ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆45Updated 10 months ago
- A random walk voice style cloning application for Kokoro text to speech☆85Updated 2 weeks ago
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆23Updated 2 years ago
- Gradio UI for RWKV LLM☆29Updated 2 years ago
- Port of Facebook's LLaMA model in C/C++☆21Updated last year
- LLM backed Fantasy Tribe Game☆18Updated 6 months ago