☆273Mar 4, 2026Updated 2 months ago
Alternatives and similar repositories for FireRed-OCR
Users that are interested in FireRed-OCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This code implements the algorithm of FIPO, a value-free RL recipe for eliciting deeper reasoning from a clean base model.☆123Apr 7, 2026Updated last month
- [ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".☆23Oct 28, 2025Updated 6 months ago
- ☆11Feb 20, 2025Updated last year
- ComfyUI custom nodes for AudioX — generate sound effects and background music from video, powered by HKUSTAudio/AudioX.☆38Mar 12, 2026Updated 2 months ago
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆33Nov 21, 2025Updated 6 months ago
- Pytorch implementation of "SKEL-CF: Coarse-to-Fine Biomechanical Skeleton and Surface Mesh Recovery"☆62Mar 17, 2026Updated 2 months ago
- ☆25Apr 6, 2026Updated last month
- [ICML 2026] InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem☆22Apr 7, 2026Updated last month
- collab-dev - Collaboration Metrics for Code Reviews☆23May 12, 2025Updated last year
- ☆26Apr 21, 2026Updated last month
- Lets make video diffusion practical! Adding Start and end frame control to Framepack☆32Apr 20, 2025Updated last year
- Codex Intel Rebuilder - Run Codex Desktop on Intel Macs☆65Feb 18, 2026Updated 3 months ago
- ☆115Dec 28, 2025Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- High-performance Qwen3-TTS implementation | Instruction-driven · Zero-shot voice cloning · Streaming · RTF 0.55☆64Apr 4, 2026Updated last month
- 在 Mirai Console 中使用MCL管理包和其他高级功能☆10Nov 13, 2022Updated 3 years ago
- [arXiv 2026] Official PyTorch Repository for "Coarse-Guided Visual Generation via Weighted h-Transform Sampling"☆42May 8, 2026Updated 2 weeks ago
- A test web browser using rust and egui☆13Apr 13, 2025Updated last year
- Official repo for "TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders"☆25Apr 9, 2026Updated last month
- comfyui的InternVL2插件,InternVL2是当前不错的开源多模态大语言模型,在文档vqa上表现很好☆13Aug 10, 2024Updated last year
- Graph view for your Typst notes with preview syncing☆14Dec 11, 2025Updated 5 months ago
- Using OpenVINO to speed up inference of PaddleOCR-VL model☆35Apr 23, 2026Updated last month
- remove bg☆13Feb 7, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [SIGGRAPH'24] CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization☆10Jul 13, 2024Updated last year
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆38Nov 11, 2025Updated 6 months ago
- 不用搭建环境,解压即用,4G显存可用☆11Mar 1, 2025Updated last year
- This project demonstrates a real-time delivery location tracking system similar to Zomato/Swiggy, built using Spring Boot and Apache Kafk…☆28Dec 4, 2025Updated 5 months ago
- Speech AI training and inference tools☆36Jun 25, 2023Updated 2 years ago
- FlashSampling: Fast and Memory-Efficient Exact Sampling (https://huggingface.co/papers/2603.15854)☆72May 13, 2026Updated 2 weeks ago
- Safer rust wrapper over mnn☆23Mar 5, 2026Updated 2 months ago
- ComfyUI-InfiniteTalk-MultiImage☆74Jan 5, 2026Updated 4 months ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆32Feb 10, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official repo for the DanQing dataset.☆36Mar 25, 2026Updated 2 months ago
- Step-Audio-TTS-3B demo☆13Feb 25, 2025Updated last year
- ide-cap-chan is a utility for batch image captioning with natural language using various VL models☆14May 8, 2026Updated 2 weeks ago
- ☆81Apr 3, 2026Updated last month
- TAPFormer is a model that fuses images and events for high-frame-rate tracking any point (pixel) .☆43Apr 1, 2026Updated last month
- KToon is a Kotlin Multiplatform serialization library implementing the TOON format (Token-Oriented Object Notation). Think of it as JSON'…☆52Mar 23, 2026Updated 2 months ago
- Interactive TikZ graph generator for creating and customizing complex diagrams with nodes, edges, and text. Features include multi-graph …☆24Feb 16, 2026Updated 3 months ago