maxmelichov / Text-To-speechLinks
Roboshaul
☆18Updated 3 months ago
Alternatives and similar repositories for Text-To-speech
Users that are interested in Text-To-speech are comparing it to the libraries listed below
Sorting:
- ivrit.ai codebase☆40Updated this week
- A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding too…☆151Updated 5 months ago
- Google Colab Notebooks for Transcription with Whisper☆24Updated 5 months ago
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆21Updated last year
- An open-source, browser-based transcript viewer and manager. Upload, transcribe, and chat with meeting recordings using AI. Features meet…☆60Updated 5 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆54Updated 10 months ago
- This repository contains a user-friendly Graphical User Interface (GUI) for interacting with the Hebrew-Mistral-7B language model.☆15Updated last year
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"☆102Updated 4 months ago
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆37Updated 9 months ago
- https://narrateit.streamlit.app/☆39Updated 9 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- ☆29Updated last month
- web based editor for subtitles and transcripts☆142Updated last year
- Zippy Talking Avatar uses Azure Cognitive Services and OpenAI API to generate text and speech. It is built with Next.js and Tailwind CSS.…☆15Updated last year
- ☆15Updated 3 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆57Updated last year
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆30Updated last month
- A question answering dataset in Modern Hebrew, containing 30,147 questions.☆24Updated 10 months ago
- Screenshot LLM is a Python application that leverages the power of AI to analyze screenshots. Built with PyQt6 for a user-friendly interf…☆44Updated 11 months ago
- ☆27Updated 2 years ago
- ☆35Updated last year
- Speaker diarization model☆28Updated 2 years ago
- Voice data <= 10 mins can also be used to train a good VC model!☆12Updated last year
- Transcription and Diarization based on OpenAI's Whisper☆23Updated last month
- On-device speaker recognition engine powered by deep learning☆37Updated 2 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆299Updated 3 months ago
- PyToon is a Python based animation library for automatically animating a cartoon character's mouth movements and bodily expressions to sy…☆51Updated 10 months ago
- Web Interface for Vision Language Models Including InternVLM2☆23Updated last year
- A library to convert Pydantic models to TypedDict☆36Updated last year
- A voice to text keyboard based on OpenAI Whisper Model.☆50Updated 2 years ago