google-research / inksightLinks
☆694Updated 2 months ago
Alternatives and similar repositories for inksight
Users that are interested in inksight are comparing it to the libraries listed below
Sorting:
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆669Updated 4 months ago
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing an…☆872Updated 11 months ago
- Official implementation of the paper "Watermark Anything with Localized Messages"☆1,056Updated 3 months ago
- Detect and extract tables to markdown and csv☆750Updated 7 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆866Updated 6 months ago
- Talk to any ArXiv paper using ChatGPT☆530Updated last year
- Visualise your CSV files in seconds without sending your data anywhere☆513Updated 3 months ago
- ☆442Updated last year
- Math OCR model that outputs LaTeX and markdown☆1,075Updated 7 months ago
- OCR Benchmark☆562Updated 3 months ago
- Animating R1's thoughts.☆384Updated 7 months ago
- MCP server and CLI tool for searching and downloading documents from Anna's Archive☆549Updated 2 months ago
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.☆528Updated this week
- AI-powered tools to enhance Anki flashcards with explanations, mnemonics, illustrations, and adaptive learning for medical school and bey…☆790Updated 7 months ago
- Provides OCR (Optical Character Recognition) services through web applications☆699Updated last year
- A powerful document AI question-answering tool that connects to your local Ollama models. Create, manage, and interact with RAG systems f…☆1,075Updated last month
- 🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.☆585Updated 3 months ago
- Whisper with Medusa heads☆856Updated last month
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆785Updated last year
- A superfast full-text search application☆1,129Updated last week
- VSCode extension that demonstrates the use of large language models (LLMs) for active debugging of programs☆353Updated 7 months ago
- Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, …☆480Updated last week
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,752Updated 6 months ago
- Curated resources for discovering, reading, and working with arXiv papers☆340Updated 3 months ago
- Use the reMarkable2 as an interface to vision-LLMs (ChatGPT, Claude, Gemini). Ghost in the machine!☆472Updated this week
- Examples and guides for using the VLM Run API☆293Updated 2 months ago
- OpenCV+YOLO+LLAVA powered video surveillance system☆775Updated 3 weeks ago
- Code behind Arxiv Papers☆528Updated last year
- A hub for various industry-specific schemas to be used with VLMs.☆533Updated 3 months ago
- Create your custom OpenCV algorithms using a user-friendly node editor interface, inspired by Blender and Unreal Engine blueprints! Quic…☆377Updated last week