Will Pre-Training Ever End? A First Step Toward Next-Generation Foundation MLLMs via Self-Improving Systematic Cognition
☆31May 14, 2025Updated last year
Alternatives and similar repositories for SICOG
Users that are interested in SICOG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2026 Spotlight] Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback☆69Jun 3, 2026Updated 2 weeks ago
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- ☆11Oct 2, 2024Updated last year
- ☆37Sep 28, 2022Updated 3 years ago
- 150本信息安全方面的书籍书籍(持续更新)☆15Feb 16, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [EMNLP 2025] Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations☆44Jan 14, 2026Updated 5 months ago
- [TKDE 2024] Robust Knowledge Adaptation for Dynamic Graph Neural Networks☆11Apr 11, 2024Updated 2 years ago
- [CVPR 2025 HIghlight] XLRS-Bench: ould Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?☆57Oct 31, 2025Updated 7 months ago
- The official code repository for the paper "CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments…☆31Apr 9, 2026Updated 2 months ago
- docker-compose 一键搭建 nextcloud 个人网盘☆12Nov 26, 2021Updated 4 years ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year
- [EMNLP 2025 Main] Official implementation of VRoPE: Rotary Position Embedding for Video Large Language Models.☆28Nov 18, 2025Updated 7 months ago
- Benchmarking Multi-Image Understanding in Vision and Language Models☆11Jul 29, 2024Updated last year
- Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?☆94Jul 13, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆45Jan 4, 2026Updated 5 months ago
- Official Implementation of CODE☆17Sep 26, 2024Updated last year
- [CVPR 2026] HiF-VLA: An efficient, bidirectional spatiotemporal expansion Vision-Language-Action Model☆67Mar 11, 2026Updated 3 months ago
- Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)☆17Jul 7, 2020Updated 5 years ago
- ☆11Oct 24, 2024Updated last year
- ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling☆152Mar 31, 2026Updated 2 months ago
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆38Jul 11, 2024Updated last year
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆70Jan 28, 2026Updated 4 months ago
- ☆101May 16, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- docker-compose 一键式搭建 WordPress 个人博客☆21Nov 26, 2021Updated 4 years ago
- [ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…☆25Jan 31, 2025Updated last year
- [TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation☆15Mar 7, 2026Updated 3 months ago
- [ICCV 2025] Diffusion Curriculum (DisCL)☆18Sep 26, 2025Updated 8 months ago
- ☆43Jun 6, 2025Updated last year
- [TMM 2023] Official Implementation of "Bidirectional Translation Between UHD-HDR and HD-SDR Videos"☆10Aug 8, 2024Updated last year
- Sphinx theme for NLTK☆16Nov 7, 2021Updated 4 years ago
- Official implementation for "Diffusion Instruction Tuning"☆35Apr 1, 2026Updated 2 months ago
- ☆67Feb 27, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository is the official implementation of our AAAI 2025 accepted paper: "PhysAug: A Physical-guided and Frequency-based Data Aug…☆24May 16, 2025Updated last year
- Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs☆45Jun 17, 2025Updated last year
- ☆24Aug 17, 2024Updated last year
- Simulating a 2D Hovering SpaceX Grasshopper with a Thrust Vector Control) engine.☆12Dec 28, 2015Updated 10 years ago
- Adds ITU BT.2100 PQ signaling to PNG images☆15Sep 11, 2017Updated 8 years ago
- DART-GUI: Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation☆93Feb 26, 2026Updated 3 months ago
- ☆19Apr 24, 2021Updated 5 years ago