Abhi-vish / SeamlessM4t-Translator
SeamlessM4t-Translator: Utilizing the powerful Seamless M4t Facebook model in the backend, this project facilitates seamless translation functionalities including S2ST, S2TT, T2ST, and T2TT queries.
☆9Updated last year
Alternatives and similar repositories for SeamlessM4t-Translator:
Users that are interested in SeamlessM4t-Translator are comparing it to the libraries listed below
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API's☆14Updated last year
- A fast MP3 decoder for python, using minimp3☆28Updated 2 years ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆64Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆37Updated 2 weeks ago
- 💬 "Realtime" voice transcription and cloning using ElevenLabs's API.☆53Updated last year
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆39Updated 5 months ago
- AudioLDM text to audio colab☆19Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 3 months ago
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- AI voice assistant made with Streamlit python and powered by Gemini, Mistral and PHI-3☆12Updated 5 months ago
- A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Spe…☆30Updated 2 weeks ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆12Updated 11 months ago
- Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp☆18Updated 3 months ago
- ☆36Updated last year
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆33Updated 2 years ago
- Generative-X (twitter) augments your twitter timeline with AI using image filters, text-to-speech, auto replies, and dynamic UI component…☆20Updated 10 months ago
- 🧠 Mem4AI: A LLM Friendly memory management library.☆17Updated 2 months ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆14Updated last year
- Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.☆13Updated last year
- A library for defining AI personalities for AI based models.We define a file format, assets and personalized scripts.☆54Updated last year
- ☆14Updated last year
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Updated last year
- ☆19Updated 4 months ago
- ☆12Updated last year
- myChat is an open-source template for developing a React UI interfacing with OpenAI's GPT API. Please note: myChat is not directly linked…☆37Updated last year
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆26Updated 5 months ago
- Voice cloning using coqui-TTS☆10Updated last year
- AI Voice Assistant: Talk to an AI agent that helps you with event scheduling, contact management, accessing your knowledge base, and web …☆27Updated last month