kanodiaayush / make-doc-listenableLinks
Turn a doc into plaintext which you can listen to using TTS
☆20Updated 2 years ago
Alternatives and similar repositories for make-doc-listenable
Users that are interested in make-doc-listenable are comparing it to the libraries listed below
Sorting:
- Python notebook to run OpenAI's Whisper model with speaker identification☆80Updated 2 years ago
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆53Updated last year
- web based editor for subtitles and transcripts☆142Updated last year
- LLM plugin for embeddings using sentence-transformers☆72Updated 5 months ago
- Concise answers to search queries using Google and GPT-3. Includes citations.☆81Updated 2 years ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆54Updated 10 months ago
- ez audio transcription tool with flexible processing and post-processing options☆159Updated last year
- Generate captions for images with Salesforce BLIP☆122Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- A simple app for downloading YouTube Shorts transcripts. Built to self-host with Python and Streamlit. Free and open source.☆33Updated 10 months ago
- https://ollama.com/search?o=newest☆32Updated this week
- Whisper combined with Silero VAD, for improved long-form transcriptions☆53Updated 2 years ago
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆198Updated 7 months ago
- A web-app to explore topics using LLM (less typing and more clicks)☆67Updated last year
- Self-editing GPT-4 application☆70Updated 2 years ago
- Better Bookmarks Search w/ Transformers☆197Updated last year
- Embedding models from Jina AI☆65Updated last year
- A growing list of fake podcasts generated by Notebook LM☆33Updated 10 months ago
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆30Updated last month
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- Zoom Audio Transcription offline☆32Updated 5 years ago
- Quality News - Towards a fairer ranking formula for Hacker News☆83Updated this week
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- assign color hues to a collection of text fragments based on embeddings☆20Updated last year
- Speaker diarization service☆24Updated 3 months ago
- OpenAI Whisper + davinci for podcast summarization☆71Updated 2 years ago
- Praetor is a lightweight finetuning data and prompt management tool☆67Updated 10 months ago
- LLM plugin for clustering embeddings☆82Updated last year
- Detect whether or not an audio file was generated by NotebookLM☆140Updated 10 months ago