☆22Jul 9, 2025Updated 9 months ago
Alternatives and similar repositories for VisuLogic-Train
Users that are interested in VisuLogic-Train are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆37Aug 18, 2025Updated 8 months ago
- ☆18Nov 30, 2025Updated 5 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆29Mar 1, 2025Updated last year
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)☆90Sep 23, 2025Updated 7 months ago
- ☆51Oct 28, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- PKU-I2IQA: An Image-to-Image Quality Assessment Database for AI Generated Images☆16Dec 4, 2024Updated last year
- ☆11Dec 20, 2024Updated last year
- ☆12Jul 4, 2024Updated last year
- ☆15Dec 9, 2024Updated last year
- EventHallusion: Diagnosing Event Hallucinations in Video LLMs☆34Aug 5, 2025Updated 8 months ago
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆27Oct 14, 2025Updated 6 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆187Jun 5, 2025Updated 10 months ago
- 首届社交群体智能算法大赛 【赛题1:社交媒体舆论场虚假账号检测】第三名(0.8248)方案☆12May 30, 2024Updated last year
- Repository containing code for CoRL 2020 paper on "Learning Object Manipulation Skills via Approximate State Estimation from Real Videos"☆17Dec 15, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NIPS2025] VideoChat-R1 & R1.5: Enhancing Spatio-Temporal Perception and Reasoning via Reinforcement Fine-Tuning☆267Oct 18, 2025Updated 6 months ago
- A collection of research papers related to Natural Language Reasoning☆10May 27, 2022Updated 3 years ago
- [TCSVT'22] Official Implementation of STI-VQA☆12Oct 18, 2023Updated 2 years ago
- A collection of awesome think with videos papers.☆98Dec 1, 2025Updated 4 months ago
- [AAAI'26] Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augm…☆12Dec 5, 2025Updated 4 months ago
- The code of our paper: Perceptual Video Coding for Machines via Satisfied Machine Ratio Modeling.☆17Jan 10, 2024Updated 2 years ago
- [ACL 2025] RealHiTBench: A Comprehensive Realistic Hierarchical Table Benchmark for Evaluating LLM-Based Table Analysis☆25Aug 8, 2025Updated 8 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Feb 4, 2026Updated 2 months ago
- [ECCV 2024] Official code repository of paper titled "Efficient 3D-Aware Facial Image Editing Via Attribute-Specific Prompt Learning"☆10Aug 2, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆20Nov 4, 2025Updated 5 months ago
- "Blind Image Quality Assessment for Pathological Microscopic Image under Screen and Immersion Scenarios"☆15Aug 29, 2023Updated 2 years ago
- The official github repo for the open online courses: "Dive into LLMs".☆10Mar 15, 2024Updated 2 years ago
- Official implementation for "CONVIQT: Contrastive Video Quality Estimator"☆25Jun 14, 2022Updated 3 years ago
- [EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information☆12Oct 11, 2024Updated last year
- [ICLR 2026] Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks☆30Feb 5, 2026Updated 2 months ago
- ☆68Feb 4, 2026Updated 2 months ago
- Tis is code for Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model (ACM MM 2024))☆12Aug 27, 2024Updated last year
- Codebase for VidHal: Benchmarking Hallucinations in Vision LLMs☆14Apr 23, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"☆21Jul 21, 2025Updated 9 months ago
- ☆15Jul 17, 2025Updated 9 months ago
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- ☆10Apr 13, 2020Updated 6 years ago
- code for G2LTraj☆19Dec 23, 2024Updated last year
- [CVPR 2026] FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding☆57Mar 16, 2026Updated last month
- ☆15Apr 11, 2026Updated 2 weeks ago