google-research / inksight
☆643Updated last month
Alternatives and similar repositories for inksight:
Users that are interested in inksight are comparing it to the libraries listed below
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing an…☆813Updated 4 months ago
- Visualise your CSV files in seconds without sending your data anywhere☆465Updated 3 weeks ago
- Semantic Image Search CLI tool.☆541Updated 4 months ago
- Official implementation of the paper "Watermark Anything with Localized Messages"☆947Updated last week
- A superfast full-text search application☆984Updated last month
- Fast and accurate automatic speech recognition (ASR) for edge devices☆2,520Updated this week
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆789Updated last month
- 🔄 CLI to convert Webpages to PDFs 🚀☆1,195Updated this week
- 5ire is a cross-platform desktop AI assistant, MCP client. It compatible with major service providers, supports local knowledge base and…☆693Updated this week
- With one command, create a natural-sounding audiobook from a variety of input formats (epub, mobi, txt, PDF, HTML and more!)☆614Updated this week
- Math OCR model that outputs LaTeX and markdown☆1,008Updated this week
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆743Updated 5 months ago
- ☆433Updated 4 months ago
- ☆1,159Updated 4 months ago
- Private and on-device speech recognition keyboard and service for Android.☆500Updated 2 weeks ago
- Local realtime voice AI☆2,188Updated last week
- Fast, reliable, and free document scanner app for iPhone☆944Updated 4 months ago
- Generate audiobooks from e-books☆1,183Updated this week
- Detect and extract tables to markdown and csv☆723Updated last week
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…☆216Updated last month
- first base model for full-duplex conversational audio☆1,691Updated 3 weeks ago
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆2,773Updated 2 months ago
- Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software☆241Updated this week
- Handwriting synthesis with Harfbuzz WASM.☆435Updated 5 months ago
- An Open Source implementation of Notebook LM with more flexibility and features☆939Updated 2 months ago
- ☆976Updated 2 months ago
- A lightweight task engine for building stateful AI agents that prioritizes simplicity and flexibility.☆878Updated 3 weeks ago
- AI-powered tools to enhance Anki flashcards with explanations, mnemonics, illustrations, and adaptive learning for medical school and bey…☆636Updated 3 weeks ago
- Gantt charts with only Javascript, CSS, HTML and YAML☆210Updated 2 weeks ago