google-research / inksightLinks
โ683Updated 2 weeks ago
Alternatives and similar repositories for inksight
Users that are interested in inksight are comparing it to the libraries listed below
Sorting:
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)โ656Updated last month
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps ๐ฃ๏ธ๐ฏโ854Updated 4 months ago
- Talk to any ArXiv paper using ChatGPTโ526Updated last year
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing anโฆโ863Updated 9 months ago
- Official implementation of the paper "Watermark Anything with Localized Messages"โ1,032Updated 3 weeks ago
- Send Morse code via โฎ๏ธ โธ๏ธ โฏ๏ธโ412Updated 8 months ago
- Animating R1's thoughts.โ383Updated 4 months ago
- Detect and extract tables to markdown and csvโ748Updated 5 months ago
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.โ520Updated last month
- A personalized language-learning tool that combines Duolingo-style lessons with your own curated vocabulary lists. Seamlessly add words โฆโ638Updated this week
- โ444Updated 9 months ago
- Math OCR model that outputs LaTeX and markdownโ1,065Updated 5 months ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkitโ775Updated 11 months ago
- Curated resources for discovering, reading, and working with arXiv papersโ311Updated last month
- VSCode extension that demonstrates the use of large language models (LLMs) for active debugging of programsโ346Updated 5 months ago
- Code behind Arxiv Papersโ523Updated last year
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.โ594Updated 4 months ago
- Code release for "LLMs can see and hear without any training"โ447Updated 2 months ago
- Visualise your CSV files in seconds without sending your data anywhereโ510Updated 3 weeks ago
- AI-powered tools to enhance Anki flashcards with explanations, mnemonics, illustrations, and adaptive learning for medical school and beyโฆโ760Updated 4 months ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.โ2,688Updated 4 months ago
- Super simple MLX (apple silicon) CLIP based photo similarity web appโ484Updated last year
- Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, โฆโ468Updated this week
- With one command, create a natural-sounding audiobook from a variety of input formats (epub, mobi, txt, PDF, HTML and more!)โ663Updated 3 months ago
- Examples and guides for using the VLM Run APIโ281Updated this week
- Your filesystem as a vector databaseโ385Updated 2 months ago
- Realtime AI speech with OpenAI Realtime API and Gemini Live API on Arduino ESP32 with Secure Websockets and Deno edge functions with >15 โฆโ1,061Updated this week
- Semantic Image Search CLI tool.โ554Updated 9 months ago
- A passive recording project allows you to have complete control over your data. Automatically take screenshots of all your screens, indexโฆโ1,273Updated last week
- Handwriting synthesis with Harfbuzz WASM.โ479Updated 10 months ago