InternScience/OmniCaptioner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/InternScience/OmniCaptioner)

InternScience / OmniCaptioner

Official Repository of OmniCaptioner

☆168

Alternatives and similar repositories for OmniCaptioner

Users that are interested in OmniCaptioner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

InternScience / MME-Reasoning
View on GitHub
Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs
☆45Jun 17, 2025Updated last year
InternScience / Chimera
View on GitHub
(ICCV-2025 Official Code)) Improving Generalist Model with Domain-Specific Experts
☆87Oct 29, 2025Updated 9 months ago
InternScience / InternAgent
View on GitHub
InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery
☆1,388Jun 10, 2026Updated last month
ch3cook-fdu / Vote2Cap-DETR
View on GitHub
[T-PAMI 2024] & [CVPR 2023] Vote2Cap-DETR; A set-to-set perspective towards 3D Dense Captioning; State-of-the-Art 3D Dense Captioning met…
☆104Aug 17, 2024Updated last year
pockebot / openpocket
View on GitHub
🐹 An Intelligent Phone That Never Sleeps.
☆836Jul 9, 2026Updated 2 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
InternScience / Dolphin
View on GitHub
(ACL-2025 main conference) Dolphin: Moving Towards Closed-loop Auto-research through Thinking, Practice, and Feedback
☆44Jun 24, 2025Updated last year
SomeB1oody / RustyML
View on GitHub
A high-performance machine learning library in pure Rust, offering statistical utilities, ML algorithms and neural networks.
☆341Updated this week
InternScience / AdaptiveDiffusion
View on GitHub
[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
☆73Jan 22, 2025Updated last year
Peyton-Chen / RegionE
View on GitHub
[ICLR 2026] The official implementation of "RegionE: Adaptive Region-Aware Generation for Efficient Image Editing"
☆109Feb 3, 2026Updated 5 months ago
devchilll / scope
View on GitHub
Configurable Multi-layered AI Agentic Safety Framework
☆330Jun 27, 2026Updated last month
InternScience / SurveyForge
View on GitHub
(ACL-2025 main conference) SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automat…
☆333Aug 27, 2025Updated 11 months ago
Sibo-Zhao / OpenPraxis
View on GitHub
An OpenClaw-native knowledge retention skill that turns raw inputs into structured practice so you can use what you know, not just store …
☆415Mar 10, 2026Updated 4 months ago
leoli646 / Adapter-X
View on GitHub
Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision
☆11Jul 22, 2024Updated 2 years ago
InternScience / MLEvolve
View on GitHub
MLEvolve is an open-source autonomous system for end-to-end machine learning algorithm design and optimization powered by progressive sea…
☆406Jul 14, 2026Updated 2 weeks ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
agentic-in / elephant-agent
View on GitHub
Personal-Model First Self Evolving AI Agent 🐘
☆578Jun 1, 2026Updated last month
agentic-in / inferoa
View on GitHub
Inference-native Tokenmaxxing Agent Harness for Loop Engineering
☆486Jun 18, 2026Updated last month
Alpha-Innovator / DocParser
View on GitHub
☆18Jan 13, 2025Updated last year
ConchFeng / conch-cpp
View on GitHub
🚀 Production-ready C++ framework: Build games, trading systems & network apps in minutes. Qt6 | libuv | C++23 | Docker Ready
☆25Feb 12, 2026Updated 5 months ago
aivolcano / CiteScan
View on GitHub
Scan the Hallucination Citation of Academic papers. Convert second-hand citation to official version
☆233Apr 1, 2026Updated 3 months ago
vul337 / IDFuzz
View on GitHub
Official code repository for the research paper IDFuzz: Intelligent Directed Grey-box Fuzzing (USENIX Security 2025)
☆92Jan 31, 2026Updated 5 months ago
duoan / mega-data-factory
View on GitHub
🏭 Mega Scale Multimodal DataPipeline for SOTA Foundation Models
☆370May 12, 2026Updated 2 months ago
InternScience / Agents-A1
View on GitHub
Scaling the Horizon, Not the Parameters
☆523Jul 16, 2026Updated last week
Sunshine-Ye / Beta-DARTS
View on GitHub
official implementation of β-DARTS: Beta-Decay Regularization for Differentiable Architecture Search (CVPR22 oral).
☆86Mar 29, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
InternScience / GeoX
View on GitHub
[ICLR'25] Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training
☆49Jan 25, 2025Updated last year
InternScience / TrustGeoGen
View on GitHub
Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"
☆23Sep 1, 2025Updated 10 months ago
ShawnTan86 / TokenCarve
View on GitHub
This is the open-source code for TokenCarve.
☆25Jan 23, 2026Updated 6 months ago
zjyaccount / MTMSD
View on GitHub
Exploring Multi-Timestep Multi-Stage Diffusion Features for Hyperspectral Image Classification
☆16Sep 7, 2024Updated last year
ReflexioAI / claude-smart
View on GitHub
Turns corrections into Preferences, Project-specific skills, and Shared skills for Claude Code, Codex, and OpenCode.
☆757Updated this week
beita6969 / ScienceClaw
View on GitHub
🔬🦞 A self-evolving AI research colleague for scientists. 285 skills, zero hallucination, persistent memory.
☆869Jun 8, 2026Updated last month
HankYe / Once-for-Both
View on GitHub
[CVPR'24] Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression
☆16Jul 1, 2024Updated 2 years ago
Zhengsh123 / FREE-Merging
View on GitHub
The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)
☆16Jun 26, 2025Updated last year
Hunyuan-PromptEnhancer / PromptEnhancer
View on GitHub
[CVPR 2026] PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.
☆3,739Jun 10, 2026Updated last month
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
lxtGH / DenseWorld-1M
View on GitHub
Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"
☆129Oct 2, 2025Updated 9 months ago
Peyton-Chen / Sparse-vDiT
View on GitHub
The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv …
☆52Jun 6, 2025Updated last year
SKYLENAGE-AI / DeepVision-103K
View on GitHub
Codebase for DeepVision-103K
☆22Feb 21, 2026Updated 5 months ago
JiakangYuan / HelixFormer
View on GitHub
☆16Nov 12, 2022Updated 3 years ago
BOBrown / SegNet_Source
View on GitHub
The code of source-only training for our method
☆11Mar 3, 2022Updated 4 years ago
ReflexioAI / reflexio
View on GitHub
Make your agents improve themselves. Reflexio is an AI agent self-improvement harness that enables your AI agents to continuously learn f…
☆324Updated this week
Alpha-VLLM / Lumina-Accessory
View on GitHub
☆119Apr 25, 2025Updated last year