genai-nantes-meetup / shift-hackathon-nantes-2024Links
🏴☠️ Awesome projects built during the Shift Hackathon (Nantes / 2024)
☆10Updated last year
Alternatives and similar repositories for shift-hackathon-nantes-2024
Users that are interested in shift-hackathon-nantes-2024 are comparing it to the libraries listed below
Sorting:
- A pattern for an always on AI Assistant powered by Deepseek-V3, RealtimeSTT, and Typer for engineering☆928Updated 6 months ago
- https://hf.co/hexgrad/Kokoro-82M☆3,577Updated last week
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆3,254Updated 2 weeks ago
- A Conversational Speech Generation Model☆13,751Updated last month
- A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcri…☆8,132Updated last week
- ☆11Updated 2 months ago
- ✅2-in-1 AI Developer and Project Manager. AI agents plan an entire project in Todoist and code it task by task.☆520Updated last week
- Converts text to speech in realtime☆3,286Updated last week
- A fast multimodal LLM for real-time voice☆4,099Updated last week
- The python library for real-time communication☆4,128Updated last week
- A powerful framework for building realtime voice AI agents 🤖🎙 ️📹☆6,752Updated this week
- Sharing early versions of Ada, a personal AI Assistant built on OpenAIs Realtime API☆700Updated 8 months ago
- Control Any Computer Using LLMs.☆2,291Updated 4 months ago
- ☆29Updated 7 months ago
- Neura Spark Listener is a modern, customizable AI chatbot interface that supports multiple language models and visual templates.☆20Updated last month
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆1,884Updated last week
- A python program that turns an LLM, running on Ollama, into an automated researcher, which will with a single query determine focus areas…☆2,862Updated 7 months ago
- Open Source framework for voice and multimodal conversational AI☆6,805Updated last week
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆369Updated last week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆8,657Updated this week
- This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.☆6,023Updated last month
- Airweave lets agents search any app☆2,748Updated this week
- An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Co…☆4,032Updated 4 months ago
- Command Your World with Voice☆723Updated last month
- A framework to enable multimodal models to operate a computer.☆9,781Updated 2 months ago
- React app for inspecting, building and debugging with the Realtime API☆3,335Updated 3 weeks ago
- A react-based starter app for using the Live API over websockets with Gemini☆2,243Updated last month
- Towards Human-Sounding Speech☆5,229Updated 2 months ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,228Updated 3 months ago
- SoTA open-source TTS☆9,357Updated last month