Official Repository of OmniCaptioner
☆168Apr 23, 2025Updated last year
Alternatives and similar repositories for OmniCaptioner
Users that are interested in OmniCaptioner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (ICCV-2025 Official Code)) Improving Generalist Model with Domain-Specific Experts☆87Oct 29, 2025Updated 6 months ago
- Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs☆45Jun 17, 2025Updated 10 months ago
- InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery☆1,289Apr 29, 2026Updated last week
- [T-PAMI 2024] & [CVPR 2023] Vote2Cap-DETR; A set-to-set perspective towards 3D Dense Captioning; State-of-the-Art 3D Dense Captioning met…☆104Aug 17, 2024Updated last year
- [Neural Networks 2025] The official code for the paper "MNet: A Multi-Scale Network for Visible Watermark Removal."☆17Jun 16, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy☆73Jan 22, 2025Updated last year
- This is the open-source code for TokenCarve.☆26Jan 23, 2026Updated 3 months ago
- Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"☆23Sep 1, 2025Updated 8 months ago
- ☆24Aug 9, 2025Updated 8 months ago
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆15Jun 26, 2025Updated 10 months ago
- Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision☆11Jul 22, 2024Updated last year
- (ACL-2025 main conference) Dolphin: Moving Towards Closed-loop Auto-research through Thinking, Practice, and Feedback☆43Jun 24, 2025Updated 10 months ago
- [ICLR 2026] The official implementation of "RegionE: Adaptive Region-Aware Generation for Efficient Image Editing"☆102Feb 3, 2026Updated 3 months ago
- Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"☆129Oct 2, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- ☆27Mar 3, 2025Updated last year
- ☆11Nov 12, 2018Updated 7 years ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆22Sep 24, 2025Updated 7 months ago
- Multimodal Document Intelligence Platform☆41Apr 10, 2026Updated 3 weeks ago
- ☆13Jan 9, 2018Updated 8 years ago
- (ACL-2025 main conference) SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automat…☆330Aug 27, 2025Updated 8 months ago
- Official codes for "Q-Ground: Image Quality Grounding with Large Multi-modality Models", ACM MM2024 (Oral)☆44Apr 21, 2026Updated 2 weeks ago
- Powerful Python-based tool for scraping Tweets, user data, and trends from Twitter without needing API access or authentication, offering…☆130Jan 4, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Comprehensive AI-powered urban development optimization platform that combines deep learning and reinforcement learning for data-driven b…☆35Nov 26, 2025Updated 5 months ago
- [ICCV2025] VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation☆33Aug 18, 2025Updated 8 months ago
- [Neurips 2025] R-KV: Redundancy-aware KV Cache Compression for Reasoning Models☆1,196Oct 16, 2025Updated 6 months ago
- Two languages, one purpose: turning words into geometry.☆160Dec 31, 2025Updated 4 months ago
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated 3 months ago
- A SAR domain-specific language defined in CXX & Python. Keywords: AST, MLIR, LLVM, FPGA HLS. Currently under development...☆17Mar 28, 2026Updated last month
- ☆73May 17, 2025Updated 11 months ago
- [arXiv 2024] Is Oracle Pruning the True Oracle?☆26Jan 10, 2025Updated last year
- Your codebase was probably AI-generated. Get a better handle on it. Noodles creates interactive diagrams that visualize how your code act…☆1,270Mar 13, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆24Jun 18, 2025Updated 10 months ago
- The accepted paper for cvpr2025.☆56Dec 9, 2025Updated 4 months ago
- 🔥minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,矿池抽水,矿池中转,矿场运维专用☆3,501Apr 9, 2026Updated 3 weeks ago
- 🏭 Mega Scale Multimodal DataPipeline for SOTA Foundation Models☆362Mar 25, 2026Updated last month
- Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach, CVPR 2024☆25Jul 25, 2024Updated last year
- [ AAAI26 ]: “VTinker: Guided Flow Upsampling and Texture Mapping for High-Resolution Video Frame Interpolation”☆19Mar 26, 2026Updated last month
- Source code for LDPTrace: Locally Differentially Private Trajectory Synthesis. VLDB 2023.☆101Nov 13, 2023Updated 2 years ago