ikantkode / qwen2.5VLM-OCRLinks
A simple streamlit app, dockerized, to do OCR on documents. I'm lazy, idk.
☆20Updated last week
Alternatives and similar repositories for qwen2.5VLM-OCR
Users that are interested in qwen2.5VLM-OCR are comparing it to the libraries listed below
Sorting:
- PDF to MD UI - User Interface to Convert PDF to MarkDown for LLM and RAG☆40Updated 2 weeks ago
- Datu Core AI Analyst open-source☆23Updated last week
- “A locally hosted, memory-aware AI microservice—designed for cultural continuity, decentralized intelligence, and ethical autonomy.”☆27Updated 3 months ago
- Agent MCP for ffmpeg☆202Updated 2 months ago
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆29Updated 6 months ago
- AIVA (AI Virtual Assistant) Mock Interviews is an interactive platform that simulates real interview scenarios using AI-generated questio…☆59Updated 5 months ago
- An API for VoiceCraft.☆25Updated last year
- Redact PDF/image-based documents, or CSV/XLSX files using a Gradio-based GUI interface☆24Updated last week
- generate informative knowledge graph from text using open source models , ollama☆21Updated last week
- Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.☆67Updated 3 months ago
- Exploring retrieval systems for language models☆14Updated 4 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 10 months ago
- Makes a improved prompts from a basic prompt☆42Updated 2 months ago
- CoexistAI is a modular, developer-friendly research assistant framework . It enables you to build, search, summarize, and automate resear…☆217Updated last week
- deep hermes, but decides how to respond based on its OWN decision, no need for system prompts.☆40Updated 4 months ago
- An fully autonomous agent that accesses the browser and performs tasks.☆17Updated 4 months ago
- A python script designed to translate large amounts of text with an LLM and the Ollama API☆108Updated last month
- BUDDIE is the first full-stack open-source AI voice interaction solution, providing a complete end-to-end system from hardware design to …☆113Updated last week
- Docling with Ollama - RAG on Local Files with Local Models☆73Updated 7 months ago
- A natural language file search tool that uses LLMs to help you find files by describing what you're looking for.☆28Updated 5 months ago
- Long-Term Memory & Context Management for LLMs☆66Updated last week
- A real-time shared memory layer for multi-agent LLM systems.☆47Updated 2 months ago
- Open Source Study Assistant☆43Updated last year
- 🔥 LitLytics - an affordable, simple analytics platform that leverages LLMs to automate data analysis☆100Updated 9 months ago
- Limopola is an AI platform that allows you to communicate with a wide range of AI models. It features autonomous agents, model-agnostic r…☆106Updated last week
- Add AI capabilities to your file system using Ollama, Groq, OpenAi and other's api☆199Updated 7 months ago
- Finally, an open source Youtube Summarizer extension☆75Updated 4 months ago
- the npc shell built with npcpy☆28Updated last week
- Link you Ollama models to LM-Studio☆141Updated last year
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆112Updated 3 weeks ago