FanbinLu / STEVE-R1
R1-like Computer-use Agent
☆55Updated this week
Alternatives and similar repositories for STEVE-R1:
Users that are interested in STEVE-R1 are comparing it to the libraries listed below
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆133Updated last week
- [Arxiv 2024] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation☆63Updated 8 months ago
- SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey Writing☆123Updated last week
- [NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy☆64Updated 2 months ago
- Improving Generalist Model with Domain-Specific Experts☆85Updated 2 months ago
- ☆67Updated last week
- Official implementation of paper "Multi-Level Collaboration in Model Merging"☆40Updated 2 weeks ago
- ☆51Updated 2 weeks ago
- CoS: Chain-of-Shot Prompting for Long Video Understanding☆44Updated last month
- [NeurIPS 2024] Matryoshka Query Transformer for Large Vision-Language Models☆100Updated 8 months ago
- Source code for ICLR2025 paper "NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation".☆71Updated 3 weeks ago
- Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS☆1,077Updated last week
- code based for rectified flow☆105Updated last month
- Official repository of MMGenBench☆119Updated 2 weeks ago
- Codebase for Iterative DPO Using Rule-based Rewards☆227Updated last month
- A Tiny structure of pytorch for learning;☆56Updated 8 months ago
- Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models☆170Updated 4 months ago
- Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"☆15Updated last month
- Efficient DiT architecture for text2any tasks, ICLR2025☆399Updated last month
- ☆207Updated last month
- Code Efficiency Benchmark☆76Updated 2 months ago
- [ICLR 2025] Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models☆36Updated 2 months ago
- [CVPR 2025] The official code for "Olympus: A Universal Task Router for Computer Vision Tasks"☆53Updated 3 weeks ago
- [NeurIPS 2022] Official Code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering☆100Updated 6 months ago
- [ECCV 2024] Efficient Inference of Vision Instruction-Following Models with Elastic Cache☆43Updated 8 months ago
- JittorGeometric is a Jittor-based graph machine learning library.☆152Updated this week
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆170Updated this week
- [NeurIPS2024] MVGamba: Unify 3D Content Generation as State Space Sequence Modeling☆57Updated 3 months ago
- 🔥 🔥 🔥 [NeurIPS 2024] Hawk: Learning to Understand Open-World Video Anomalies☆189Updated 2 weeks ago
- Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].☆265Updated 3 months ago