Speech To Speech: an effort for an open-sourced and modular GPT4-o
☆79Oct 14, 2024Updated last year
Alternatives and similar repositories for speech-to-speech
Users that are interested in speech-to-speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Real-time Img2img translation! (TouchDesigner+T2Iadapter\_canny+SDXL+turbo\_LoRA)☆20Jan 5, 2024Updated 2 years ago
- Local SRT/LLM/TTS Voicechat☆774Oct 12, 2024Updated last year
- This tool allows local LLM usage that can automate tasks without human interventention. The agent can call itself recursively and work on…☆20May 5, 2025Updated last year
- Jameel Noori Nastaleeq, Noto Nastaliq Urdu and Mehr Nastaliq font for Rooted and Non-Rooted Android device.☆18Dec 16, 2025Updated 5 months ago
- ojjson is a library designed to facilitate JSON interactions with Ollama, a large language api (LLM). It leverages the power of Zod for s…☆12Nov 7, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 4th place solution of ESA Kelvin Mars Explorer Power Challenge☆11Aug 6, 2016Updated 9 years ago
- KAN (Kolmogorov–Arnold Networks) in the MLX framework for Apple Silicon☆31Jun 18, 2025Updated 11 months ago
- Multimodal AI App using Llava 7B and Gradio.☆39Apr 30, 2024Updated 2 years ago
- Browser extension to quickly access Perplexity searchbar from any page with a shortcut☆13Oct 20, 2024Updated last year
- ☆11Mar 30, 2022Updated 4 years ago
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆36Dec 31, 2023Updated 2 years ago
- A TouchDesigner extension for integrating Slamtec LIDAR A1/A2 devices, providing real-time point cloud data visualization and OSC data st…☆25Dec 27, 2024Updated last year
- Build local voice agents with open-source models☆4,874Updated this week
- ANE accelerated embedding models!☆19Dec 11, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Collaborative AI Model☆11Nov 27, 2024Updated last year
- A TTS that fits in your CPU (and pocket)☆109Feb 13, 2026Updated 4 months ago
- Automatically generate and overlay subtitles for any video using OpenAi Whisper☆20Oct 1, 2022Updated 3 years ago
- ☆11Jan 6, 2024Updated 2 years ago
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 3 years ago
- ☆13Jan 14, 2025Updated last year
- complete full-stack application built with Next.js 13.3 that automatically generates blog post content using ChatGPT.☆12Aug 29, 2023Updated 2 years ago
- Framework for Self-Organizing Python Agents☆29Feb 4, 2024Updated 2 years ago
- A tensorflow based implementation of DeepVoice3 https://arxiv.org/abs/1710.07654☆13Jun 5, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Asynchronous pipeline parallel optimization☆22Feb 2, 2026Updated 4 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆144Jun 18, 2024Updated last year
- PicWish T2I, Photo Enhancer and Background Remover for Python☆25Jul 6, 2025Updated 11 months ago
- Coqui STT (🐸STT) based forced alignment tool☆13Feb 24, 2022Updated 4 years ago
- using multion to find all the commenters under a given reddit post, and DMing a message to them.☆16Jul 21, 2024Updated last year
- Notebooks to demonstrate TimmWrapper☆16Jan 16, 2025Updated last year
- Furnace is a high-performance quantitative trading library that provides features similar to CCXT, allowing developers to connect and int…☆16Jan 16, 2025Updated last year
- Workflow used in this video:☆22Feb 28, 2024Updated 2 years ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆250Jan 20, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Triton kernel fusion for Qwen3-TTS 1.7B inference acceleration — RMSNorm, SwiGLU, M-RoPE, Norm+Residual☆85Jun 7, 2026Updated last week
- Demo of knowledge graph creation and Graph RAG with Dspy and Kuzu☆22Jun 30, 2025Updated 11 months ago
- A research project exploring fine-tuning BERT-style models for text generation☆41Nov 30, 2025Updated 6 months ago
- Immortal Flappy Bird - train a Flappy that never dies☆25Sep 4, 2017Updated 8 years ago
- E-book for AIS1003 (Object-oriented Programming), which covers an introduction to modern C++ & software engineering fundamentals.☆18May 28, 2026Updated 2 weeks ago
- Identifying tumor affected scans using Fast.ai and detecting them using openCV☆13Jan 18, 2021Updated 5 years ago
- Deploy and scale Pipecat apps to production with Pipecat Cloud☆14Updated this week