ses4255 / Versatile-OCR-Program
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
☆18Updated this week
Alternatives and similar repositories for Versatile-OCR-Program:
Users that are interested in Versatile-OCR-Program are comparing it to the libraries listed below
- 360M model running in the browser on WebGPU☆21Updated 7 months ago
- With a few words and a click of a button, quickly get an engaging, high quality video. (And optionally save and share it!)☆17Updated last month
- Master PDF Summarization with Google Bard☆12Updated last year
- Secure, locally-run Retrieval-Augmented Generation system for document-based question-answering, utilizing Llama 3, Mistral, and Gemini m…☆22Updated 5 months ago
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 6 months ago
- convert natural language into technical diagrams☆12Updated 3 months ago
- An interface for llama.cpp, ChatGPT, and Gemini☆25Updated last week
- ☆12Updated 2 weeks ago
- Streamable multi-format serialization with schema☆22Updated 3 months ago
- YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts.☆25Updated last month
- ☆27Updated 6 months ago
- ☆25Updated 7 months ago
- ☆22Updated 5 months ago
- Jupyter Notebooks and an R Notebook for encoding Pokémon embeddings and creating data visualizations.☆19Updated 9 months ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆33Updated last week
- Piazza-Updater automates updates to a Weaviate database with real-time vectorial data. By continuously searching the internet and integra…☆30Updated 4 months ago
- Data Neuron is a powerful framework that enables you to build text-to-SQL applications with an easily maintainable semantic layer. Whethe…☆43Updated 7 months ago
- ☆12Updated 7 months ago
- Dashb.io - Minimalist's Dashboard and Widgets.☆14Updated last year
- An autonomous Mall assistant that can answer user queries using tools. Powered by LLMs.☆14Updated last year
- Gateway and load balancer to your LLM inference endpoints☆21Updated 5 months ago
- Use GPTparser with your OpenAI API to scrape & parse files into structured JSON files.☆13Updated last year
- Search a JSON path and get the value fast☆21Updated last month
- ☆27Updated 11 months ago
- A simple app for downloading YouTube Shorts transcripts. Built to self-host with Python and Streamlit. Free and open source.☆28Updated 4 months ago
- Documentation for the Krixik Python client.☆38Updated 4 months ago
- Dockerized FastAPI wrapper around the recognize-anything image recognition models☆25Updated last year
- 🛤️ Pathik - High-Performance Web Crawler ⚡☆25Updated last week
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated 11 months ago