OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models
☆29Feb 4, 2026Updated last month
Alternatives and similar repositories for OCRVerse
Users that are interested in OCRVerse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆61Aug 5, 2025Updated 7 months ago
- ☆66Sep 6, 2025Updated 6 months ago
- Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding☆60Feb 10, 2026Updated last month
- 由中国政法大学和北京航空航天大学共同设计,基于GLM-9B的法律文书处理和判决预测模型☆29Sep 6, 2024Updated last year
- CodeAlert - 红色警戒之人工智能崛起!!!欢迎参加第一届AI智能体红警游戏黑客松☆40Oct 16, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆49Mar 20, 2026Updated last week
- PinData is a modern, open-source dataset management platform designed specifically for large language model (LLM) training workflows☆44Jul 7, 2025Updated 8 months ago
- [ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement☆38Sep 27, 2023Updated 2 years ago
- Structured workflow prompts for AI-assisted software development. Guides engineers through PRD creation, architecture design, task manage…☆36Jul 13, 2025Updated 8 months ago
- ☆69Oct 10, 2025Updated 5 months ago
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆39Jun 20, 2024Updated last year
- ☆50Jan 23, 2023Updated 3 years ago
- Gemini API for OCR☆15Nov 17, 2025Updated 4 months ago
- 🤡 An up-to-date & curated list of awesome KBQA papers, methods & resources.☆10Jul 14, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆21Dec 2, 2025Updated 3 months ago
- Code of CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping☆17Oct 8, 2022Updated 3 years ago
- [CVPR 2023] STMixer: A One-Stage Sparse Action Detector☆63May 18, 2023Updated 2 years ago
- [IJCV 2022] Information-Theoretic Odometry Learning☆16Apr 19, 2023Updated 2 years ago
- ☆48Feb 7, 2025Updated last year
- Linux 经验记录☆60Feb 21, 2022Updated 4 years ago
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆69Feb 3, 2023Updated 3 years ago
- Implementation of "Decoding-time Realignment of Language Models", ICML 2024.☆21Jun 17, 2024Updated last year
- ☆18Jun 7, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- ☆19Jun 13, 2024Updated last year
- Scripts used to manage and automate travis CI builds for Matomo and plugins.☆29Nov 6, 2024Updated last year
- An extension for the GitHub Cli application that displays your current contribution graph☆14Aug 3, 2021Updated 4 years ago
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning☆141Oct 10, 2025Updated 5 months ago
- Reference implementations for visual descriptors of IMGpedia☆23Oct 20, 2022Updated 3 years ago
- 3D-printed parallel gripper compatible with Feetech STS3215 and Waveshare ST3215 servos. Ready-to-use solution for the SO-ARM 100/SO-ARM …☆76Feb 23, 2026Updated last month
- LinVT: Empower Your Image-level Large Language Model to Understand Videos☆84Dec 30, 2024Updated last year
- 开箱即用的AI标书编写工具☆200Feb 4, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Patch console methods to intercept output☆18Jul 27, 2022Updated 3 years ago
- Various mechanized proof files for fun.☆13Mar 9, 2026Updated 2 weeks ago
- 专为AIPC设计,致力于打破传统教育的局限,通过一个集成化的本地大模型应用平台,提供AI助教、互动问答、智能出题、教学大纲与思维导图生成 以及代码助手功能等个性化学习与教学体验,为学生与教师创造前所未有的教学与学习体验。☆75Apr 15, 2024Updated last year
- ☆31Jan 17, 2026Updated 2 months ago
- Huggingface Backup - Jupyter, Colab and Python Script☆10Jan 20, 2026Updated 2 months ago
- Develop an AIGC application marketplace based on the Dify API, building the best application services for AIGC.☆95Feb 15, 2025Updated last year
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 5 months ago