orasik / parsevisionLinks
Parse vision is an open source tool to visualise what OCR is parsing in a PDF document to help developers and product teams identify if the parsing has missed some vital information from the document.
☆84Updated last year
Alternatives and similar repositories for parsevision
Users that are interested in parsevision are comparing it to the libraries listed below
Sorting:
- Lightweight Nearest Neighbors with Flexible Backends☆308Updated this week
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆280Updated last month
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆198Updated 7 months ago
- Import unstructured data (text and images) into structured tables☆154Updated 5 months ago
- Packages whisper.cpp into pre-built, pip-installable wheels, for macOS and Linux.☆174Updated last year
- Useful resources for LLM-based Diarization and Transcription.☆54Updated 11 months ago
- Build Secure and Compliant AI agents and MCP Servers. YC W23☆152Updated 4 months ago
- converts url content into JSON with a simple prefix☆71Updated last year
- LLM plugin providing access to Mistral models using the Mistral API☆196Updated 2 months ago
- Dabarqus is incredibly fast RAG that runs everywhere.☆60Updated 8 months ago
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆306Updated 3 weeks ago
- Data extraction with LLM on CPU☆68Updated last year
- Action library for AI Agent☆224Updated 6 months ago
- ☆113Updated 3 months ago
- Embedding models from Jina AI☆65Updated last year
- A web-app to explore topics using LLM (less typing and more clicks)☆67Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆151Updated 8 months ago
- Replace OpenAI with Llama.cpp Automagically.☆324Updated last year
- Deep Research for your internal data☆338Updated 4 months ago
- Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuo…☆176Updated last year
- Get a markdown version of any webpage with a keyboard shortcut.☆66Updated 7 months ago
- The library for character-driven AI experiences.☆89Updated last year
- Automatically reformat any JSON into any schema with AI☆335Updated 6 months ago
- ☆92Updated last year
- A non-official CLI for Llama Index Parser☆215Updated last year
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆87Updated 10 months ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆217Updated 11 months ago
- Some tough questions to test new models.☆28Updated last year
- An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning☆170Updated last year
- Fully neural approach for text chunking☆372Updated 5 months ago