ualiawan / OCRmyPDFLinks
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
β22Updated 3 years ago
Alternatives and similar repositories for OCRmyPDF
Users that are interested in OCRmyPDF are comparing it to the libraries listed below
Sorting:
- turnkey self-hosted offline transcription and diarization service with llm summaryβ918Updated 3 weeks ago
- π Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with π§ LLM.β611Updated 8 months ago
- Dictation app based on the OpenAI speech-to-text modelsβ210Updated last year
- Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.β643Updated 11 months ago
- A simple HTML content extractor in Python. Can be run as a wrapper for Mozilla's Readability.js package or in pure-python mode.β353Updated last year
- Handy voice dictation using whisper.β23Updated 2 months ago
- OCRmyPDF EasyOCR pluginβ98Updated 4 months ago
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR modeβ¦β911Updated last month
- Provides OCR (Optical Character Recognition) services through web applicationsβ704Updated 2 years ago
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β217Updated last year
- LLM Chain querying a scientific Zotero library, with citationsβ441Updated 2 years ago
- LLM plugin providing access to models running on an Ollama serverβ352Updated last month
- Multimodal RAG to search and interact locally with technical documents of any kindβ283Updated 3 weeks ago
- open source audio and video transcription softwareβ470Updated last month
- macOS OCR command-line tool for almost any image formatβ61Updated 11 months ago
- Too Long, Didn't Watch: End-to-End Rolling Summarizer of Long Videosβ363Updated 5 months ago
- AI-powered tools to enhance Anki flashcards with explanations, mnemonics, illustrations, and adaptive learning for medical school and beyβ¦β835Updated last month
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detectionβ118Updated last year
- Chat with your codeβ157Updated 8 months ago
- Fully neural approach for text chunkingβ406Updated 3 months ago
- Fast and secure translation on your local machine, powered by marian and Bergamot.β591Updated 10 months ago
- Mac Menu bar app for accurate speech-to-text with multimodal LLMsβ82Updated 2 weeks ago
- Python Package for FSRS Spaced Repetitionβ382Updated 3 months ago
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.β1,275Updated 10 months ago
- Convert your ChatGPT export (ZIP) into clean Markdown text files with inline media, and generate data visualizations like word clouds andβ¦β821Updated this week
- Streaming Markdown renderer for tui clisβ335Updated last week
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β250Updated this week
- A hub for various industry-specific schemas to be used with VLMs.β539Updated last month
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokensβ535Updated 2 years ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps π£οΈπ―β887Updated 2 months ago