kanodiaayush / make-doc-listenableLinks
Turn a doc into plaintext which you can listen to using TTS
☆21Updated 2 years ago
Alternatives and similar repositories for make-doc-listenable
Users that are interested in make-doc-listenable are comparing it to the libraries listed below
Sorting:
- Easy-to-Use Apple Vision wrapper for text extraction, scalar representation and clustering using K-means.☆118Updated last year
- Chat with an AI simulation of anyone as easily as copy-pasting text into a folder!☆19Updated 2 years ago
- Algorithmic composition of modern classical music in the twelve-tone technique.☆13Updated 8 months ago
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆29Updated last year
- Transcription and Diarization based on OpenAI's Whisper☆24Updated 4 months ago
- LLM plugin for embeddings using sentence-transformers☆74Updated 8 months ago
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cpp☆51Updated 2 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- Python notebook to run OpenAI's Whisper model with speaker identification☆80Updated 3 years ago
- web based editor for subtitles and transcripts☆142Updated last year
- Zoom Audio Transcription offline☆32Updated 5 years ago
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆54Updated last year
- Perform OCR upon entire videos to look for credentials or similar.☆43Updated 3 years ago
- LLM plugin for clustering embeddings☆82Updated last year
- Speaker diarization service☆25Updated 6 months ago
- ez audio transcription tool with flexible processing and post-processing options☆160Updated last year
- Read files (pdf/png/jpg) with OCR and rename using AI.☆24Updated 2 years ago
- Glanceables is a handy macOS desktop app that turns parts of websites into easy-to-view widgets. This app makes it simpler to keep tabs o…☆58Updated last year
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆23Updated last year
- Generate captions for images with Salesforce BLIP☆123Updated last year
- An easy-to-use library and command-line tool for TTS☆15Updated 8 months ago
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆45Updated 2 years ago
- Jupyter Notebooks and an R Notebook for encoding Pokémon embeddings and creating data visualizations.☆20Updated last year
- Example python project demonstrating how to create a native macOS GUI with AppKit and PyObjC☆30Updated 5 months ago
- iOS Safari Extension to convert web pages to Markdown text☆44Updated 3 years ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year
- Local & Private LLM that drafts responses LIKE you automatically☆84Updated last year
- Some tough questions to test new models.☆28Updated last year
- Median is an open-source flashcard application that leverages the power of spaced repetition and artificial intelligence to transform the…☆22Updated last year
- Shared personal notes created while working with the Apple MLX machine learning framework☆24Updated 3 weeks ago