google-research / inksightLinks
β688Updated last month
Alternatives and similar repositories for inksight
Users that are interested in inksight are comparing it to the libraries listed below
Sorting:
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)β661Updated 2 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps π£οΈπ―β859Updated 4 months ago
- Animating R1's thoughts.β383Updated 5 months ago
- Detect and extract tables to markdown and csvβ749Updated 6 months ago
- Official implementation of the paper "Watermark Anything with Localized Messages"β1,042Updated last month
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing anβ¦β865Updated 10 months ago
- AI-powered tools to enhance Anki flashcards with explanations, mnemonics, illustrations, and adaptive learning for medical school and beyβ¦β771Updated 5 months ago
- Visualise your CSV files in seconds without sending your data anywhereβ511Updated last month
- MCP server and CLI tool for searching and downloading documents from Anna's Archiveβ486Updated 3 weeks ago
- β442Updated 10 months ago
- A powerful document AI question-answering tool that connects to your local Ollama models. Create, manage, and interact with RAG systems fβ¦β1,063Updated 2 months ago
- Math OCR model that outputs LaTeX and markdownβ1,066Updated 6 months ago
- VSCode extension that demonstrates the use of large language models (LLMs) for active debugging of programsβ350Updated 5 months ago
- OpenCV+YOLO+LLAVA powered video surveillance systemβ766Updated this week
- Examples and guides for using the VLM Run APIβ286Updated 3 weeks ago
- Code behind Arxiv Papersβ526Updated last year
- Talk to any ArXiv paper using ChatGPTβ527Updated last year
- π discover story relationshipsβ337Updated last month
- Use the reMarkable2 as an interface to vision-LLMs (ChatGPT, Claude, Gemini). Ghost in the machine!β465Updated 2 months ago
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.β604Updated 5 months ago
- A passive recording project allows you to have complete control over your data. Automatically take screenshots of all your screens, indexβ¦β1,289Updated last week
- RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooksβ183Updated last month
- Provides OCR (Optical Character Recognition) services through web applicationsβ685Updated last year
- Open-source framework for exporting your personal data.β1,443Updated 7 months ago
- Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, β¦β475Updated last week
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.β524Updated last week
- Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Softwareβ321Updated 6 months ago
- Code release for "LLMs can see and hear without any training"β447Updated 2 months ago
- With one command, create a natural-sounding audiobook from a variety of input formats (epub, mobi, txt, PDF, HTML and more!)β667Updated 4 months ago
- Local Video-LLM powered AI Baby Monitorβ397Updated 2 months ago