kanodiaayush / make-doc-listenableLinks
Turn a doc into plaintext which you can listen to using TTS
☆20Updated 2 years ago
Alternatives and similar repositories for make-doc-listenable
Users that are interested in make-doc-listenable are comparing it to the libraries listed below
Sorting:
- LLM plugin for embeddings using sentence-transformers☆74Updated 9 months ago
- web based editor for subtitles and transcripts☆143Updated last year
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆29Updated last year
- Transcription and Diarization based on OpenAI's Whisper☆24Updated 5 months ago
- Python notebook to run OpenAI's Whisper model with speaker identification☆80Updated 3 years ago
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆35Updated 5 months ago
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible☆15Updated 2 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- ☆19Updated last year
- Easy-to-Use Apple Vision wrapper for text extraction, scalar representation and clustering using K-means.☆117Updated 2 years ago
- ez audio transcription tool with flexible processing and post-processing options☆162Updated 2 years ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year
- Convert epub file to txt☆43Updated 2 years ago
- LLM plugin for clustering embeddings☆82Updated last year
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆45Updated 2 years ago
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆21Updated last year
- OpenAI Whisper + davinci for podcast summarization☆70Updated 2 years ago
- Telegram > OpenAI > Read Later [instapaper/pocket/omnivore]☆16Updated 2 years ago
- Embedding models from Jina AI☆65Updated 2 years ago
- A curated list of awesome OpenAI's Whisper☆102Updated 2 years ago
- Generate captions for images with Salesforce BLIP☆125Updated last year
- An easy-to-use library and command-line tool for TTS☆15Updated 9 months ago
- Very fast, accurate speaker diarization☆228Updated this week
- Perform OCR upon entire videos to look for credentials or similar.☆44Updated 3 years ago
- Zoom Audio Transcription offline☆32Updated 5 years ago
- Chat with an AI simulation of anyone as easily as copy-pasting text into a folder!☆18Updated 2 years ago
- This is an optimized implementation of OpenAI's Whisper for multilingual transcription.☆39Updated 3 years ago
- Jupyter Notebooks and an R Notebook for encoding Pokémon embeddings and creating data visualizations.☆20Updated last year
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆54Updated last year