orasik / parsevisionLinks
Parse vision is an open source tool to visualise what OCR is parsing in a PDF document to help developers and product teams identify if the parsing has missed some vital information from the document.
☆85Updated last year
Alternatives and similar repositories for parsevision
Users that are interested in parsevision are comparing it to the libraries listed below
Sorting:
- Lightweight Nearest Neighbors with Flexible Backends☆312Updated last month
- LLM plugin providing access to Mistral models using the Mistral API☆197Updated 3 months ago
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆280Updated 3 weeks ago
- A web-app to explore topics using LLM (less typing and more clicks)☆67Updated last year
- Convert URLs into LLM-friendly markdown chunks☆65Updated last year
- ai for jq☆244Updated last year
- Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuo…☆176Updated last year
- See Through Your Models☆401Updated 4 months ago
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆128Updated 2 months ago
- Packages whisper.cpp into pre-built, pip-installable wheels, for macOS and Linux.☆175Updated last year
- Replace OpenAI with Llama.cpp Automagically.☆325Updated last year
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆222Updated 10 months ago
- Useful resources for LLM-based Diarization and Transcription.☆55Updated last year
- Get a markdown version of any webpage with a keyboard shortcut.☆67Updated 8 months ago
- ☆39Updated 11 months ago
- Enforce structured output from LLMs 100% of the time☆248Updated last year
- Structured Output Is All You Need!☆59Updated last year
- Action library for AI Agent☆225Updated 7 months ago
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆196Updated 8 months ago
- Turn any input document into a sophisticated, context-dependent mindmap that distills the meaning and structure of the document.☆123Updated 8 months ago
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…☆225Updated 10 months ago
- ☆113Updated 4 months ago
- ☆121Updated last month
- Implement recursion using English as the programming language and an LLM as the runtime.☆237Updated 2 years ago
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆89Updated 11 months ago
- Tools for LLM agents.☆60Updated 10 months ago
- OCR Benchmark☆587Updated 2 weeks ago
- Embedding models from Jina AI☆65Updated last year
- GoalChain for goal-orientated LLM conversation flows☆70Updated 11 months ago
- Applying the ideas of Deepseek R1 to computer use☆216Updated 9 months ago