Speech To Speech: an effort for an open-sourced and modular GPT4-o
☆79Oct 14, 2024Updated last year
Alternatives and similar repositories for speech-to-speech
Users that are interested in speech-to-speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Real-time Img2img translation! (TouchDesigner+T2Iadapter\_canny+SDXL+turbo\_LoRA)☆20Jan 5, 2024Updated 2 years ago
- Local SRT/LLM/TTS Voicechat☆778Oct 12, 2024Updated last year
- Self-host LLMs with LMDeploy and BentoML☆22Dec 26, 2025Updated 4 months ago
- This tool allows local LLM usage that can automate tasks without human interventention. The agent can call itself recursively and work on…☆20May 5, 2025Updated last year
- web application, powered by Python Flask and OpenAI GPT-3, designed to generate exceptional AI-generated content for a wide range of appl…☆13Feb 7, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Lossless compression of BF16 MLP weights for LLM inference on NVIDIA Hopper GPUs☆48Apr 17, 2026Updated last month
- This Flutter app leverages Firebase for authentication, storage, and real-time database. It provides a seamless chat experience with supp…☆18Nov 29, 2023Updated 2 years ago
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆35Dec 31, 2023Updated 2 years ago
- TensorRT implementation of the waifu2x super-resolution model for faster image and video upscaling.☆18Nov 24, 2024Updated last year
- Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector) 服务,支持GPU多卡、多worker、多客户端调用,开箱即用。☆12May 24, 2022Updated 4 years ago
- Code implementation for 《Large AI Model Empowered Multimodal Semantic communication》☆26Jul 4, 2024Updated last year
- some papers about Kalman Filter☆16Sep 4, 2019Updated 6 years ago
- ☆13Jan 14, 2025Updated last year
- The official implementation of the EMNLP 2023 paper "Paraphrase Types for Generation and Detection"☆12Oct 20, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A massively multilingual modern encoder language model☆140Jan 20, 2026Updated 4 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆144Jun 18, 2024Updated last year
- PicWish T2I, Photo Enhancer and Background Remover for Python☆25Jul 6, 2025Updated 10 months ago
- Estimate probability of failure using reframed Bayesian optimization☆10Aug 14, 2025Updated 9 months ago
- Workflow used in this video:☆22Feb 28, 2024Updated 2 years ago
- Notebooks to demonstrate TimmWrapper☆16Jan 16, 2025Updated last year
- Furnace is a high-performance quantitative trading library that provides features similar to CCXT, allowing developers to connect and int…☆16Jan 16, 2025Updated last year
- A competition about search results CTR prediction in real-time search scenario ! Rank : 26 / 2888 !☆19Feb 4, 2019Updated 7 years ago
- A research project exploring fine-tuning BERT-style models for text generation☆40Nov 30, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Identifying tumor affected scans using Fast.ai and detecting them using openCV☆13Jan 18, 2021Updated 5 years ago
- Run OpenDevin inside Docker☆28Jul 22, 2025Updated 10 months ago
- ☆13Mar 7, 2025Updated last year
- ☆14Apr 8, 2026Updated last month
- ☆19Aug 19, 2025Updated 9 months ago
- AscTec quadrotor drivers☆17Aug 22, 2019Updated 6 years ago
- Explore how Gemini's built-in vision and multimodality approach enable it to interpret images and generate text based on visual input.☆11Jan 3, 2024Updated 2 years ago
- C++ inference engine for running GLiNER (Generalist and Lightweight Named Entity Recognition) models☆48Apr 20, 2026Updated last month
- High performance, DPDK-based, user space firewall☆13Dec 9, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Feb 17, 2026Updated 3 months ago
- Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models☆16Sep 10, 2025Updated 8 months ago
- FastRTC voice agent☆22Mar 18, 2025Updated last year
- Simple WebRTC audio/video conferencing demo using SkylinkJS and React.☆13Jun 16, 2015Updated 10 years ago
- Android app that uses the Yahoo Finance api to display the latest stock prices.☆19Apr 7, 2016Updated 10 years ago
- Tutorial about noisy labels for SIBGRAPI 2020☆11Nov 6, 2020Updated 5 years ago
- Gradio UI to load crewAI configuration from excel xls and generate the python code. The source of the crews is in the xls. It allows for …☆10Oct 17, 2025Updated 7 months ago