weihao-bo / ViLoMemLinks
ViLoMem: Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
☆45Updated 2 months ago
Alternatives and similar repositories for ViLoMem
Users that are interested in ViLoMem are comparing it to the libraries listed below
Sorting:
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆115Updated last month
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Updated last week
- [MTI-LLM@NeurIPS 2025] Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."☆147Updated 6 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 8 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆41Updated last week
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆65Updated 3 weeks ago
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆52Updated 2 months ago
- ☆43Updated 8 months ago
- The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"☆136Updated 5 months ago
- Parameter-Efficient Fine-Tuning for Foundation Models☆110Updated 10 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆129Updated 6 months ago
- The official repository for the Scientific Paper Idea Proposer (SciPIP)☆67Updated 11 months ago
- The code and data of We-Math 2.0.☆164Updated 5 months ago
- [ICLR 2026] Geometric-Mean Policy Optimization☆99Updated 2 weeks ago
- [NeurIPS 2025] Thinkless: LLM Learns When to Think☆251Updated 4 months ago
- [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models☆54Updated 9 months ago
- Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning☆55Updated last month
- Open-source Agentic RL for LLMs — RLAnything & DemyAgent☆223Updated last week
- ☆103Updated last month
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆36Updated 11 months ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆48Updated 11 months ago
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆29Updated 4 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆57Updated 8 months ago
- MemEvolve & EvolveLab☆158Updated last month
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆101Updated last year
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Updated 2 weeks ago
- ☆144Updated 9 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Updated 11 months ago
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆89Updated 8 months ago
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆95Updated 8 months ago