google-research / inksightLinks

☆688

Alternatives and similar repositories for inksight

Users that are interested in inksight are comparing it to the libraries listed below

Sorting:

ses4255 / Versatile-OCR-Program
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
☆661Updated 2 months ago
lifeiteng / OmniSenseVoice
Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯
☆859Updated 4 months ago
dhealy05 / frames_of_mind
Animating R1's thoughts.
☆383Updated 5 months ago
VikParuchuri / tabled
Detect and extract tables to markdown and csv
☆749Updated 6 months ago
facebookresearch / watermark-anything
Official implementation of the paper "Watermark Anything with Localized Messages"
☆1,042Updated last month
yigitkonur / llm-ocr
An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing an…
☆865Updated 10 months ago
thiswillbeyourgithub / AnkiAIUtils
AI-powered tools to enhance Anki flashcards with explanations, mnemonics, illustrations, and adaptive learning for medical school and bey…
☆771Updated 5 months ago
visprex / visprex
Visualise your CSV files in seconds without sending your data anywhere
☆511Updated last month
iosifache / annas-mcp
MCP server and CLI tool for searching and downloading documents from Anna's Archive
☆486Updated 3 weeks ago
punnerud / Local_Knowledge_Graph
☆442Updated 10 months ago
DonTizi / rlama
A powerful document AI question-answering tool that connects to your local Ollama models. Create, manage, and interact with RAG systems f…
☆1,063Updated 2 months ago
VikParuchuri / texify
Math OCR model that outputs LaTeX and markdown
☆1,066Updated 6 months ago
mohsen1 / llm-debugger-vscode-extension
VSCode extension that demonstrates the use of large language models (LLMs) for active debugging of programs
☆350Updated 5 months ago
PsyChip / machina
OpenCV+YOLO+LLAVA powered video surveillance system
☆766Updated this week
vlm-run / vlmrun-cookbook
Examples and guides for using the VLM Run API
☆286Updated 3 weeks ago
imelnyk / ArxivPapers
Code behind Arxiv Papers
☆526Updated last year
evanhu1 / talk2arxiv
Talk to any ArXiv paper using ChatGPT
☆527Updated last year
herol3oy / austen
📚 discover story relationships
☆337Updated last month
awwaiid / ghostwriter
Use the reMarkable2 as an interface to vision-LLMs (ChatGPT, Claude, Gemini). Ghost in the machine!
☆465Updated 2 months ago
therealoliver / Deepdive-llama3-from-scratch
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
☆604Updated 5 months ago
arkohut / pensieve
A passive recording project allows you to have complete control over your data. Automatically take screenshots of all your screens, index…
☆1,289Updated last week
ash80 / RLHF_in_notebooks
RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks
☆183Updated last month
OCR4all / OCR4all
Provides OCR (Optical Character Recognition) services through web applications
☆685Updated last year
Surfer-Org / Protocol
Open-source framework for exporting your personal data.
☆1,443Updated 7 months ago
thiswillbeyourgithub / wdoc
Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, …
☆475Updated last week
clemlesne / scrape-it-now
Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.
☆524Updated last week
souzatharsis / tamingLLMs
Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software
☆321Updated 6 months ago
facebookresearch / MILS
Code release for "LLMs can see and hear without any training"
☆447Updated 2 months ago
C-Loftus / QuickPiperAudiobook
With one command, create a natural-sounding audiobook from a variety of input formats (epub, mobi, txt, PDF, HTML and more!)
☆667Updated 4 months ago
zeenolife / ai-baby-monitor
Local Video-LLM powered AI Baby Monitor
☆397Updated 2 months ago