menloresearch / visual-thinkerLinks
☆158Updated last month
Alternatives and similar repositories for visual-thinker
Users that are interested in visual-thinker are comparing it to the libraries listed below
Sorting:
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 5 months ago
- Build your own visual reasoning model☆385Updated last week
- Tina: Tiny Reasoning Models via LoRA☆260Updated 3 weeks ago
- official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”☆290Updated this week
- Code for the paper: "Learning to Reason without External Rewards"☆295Updated last week
- minimal GRPO implementation from scratch☆90Updated 3 months ago
- Exploring Applications of GRPO☆230Updated last month
- ☆207Updated 4 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆218Updated 6 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆68Updated 3 months ago
- From scratch implementation of a vision language model in pure PyTorch☆222Updated last year
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆339Updated 6 months ago
- ☆178Updated 6 months ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆315Updated 7 months ago
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆175Updated this week
- ☆88Updated last week
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆53Updated 3 weeks ago
- Train your own SOTA deductive reasoning model☆94Updated 3 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆311Updated 8 months ago
- A simple unified framework for evaluating LLMs☆219Updated 2 months ago
- [ACL 2025 🔥] Rethinking Step-by-step Visual Reasoning in LLMs☆302Updated last month
- Scaling RL on advanced reasoning models☆100Updated this week
- ☆56Updated 7 months ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆238Updated last month
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆476Updated 3 weeks ago
- Code for ExploreTom☆84Updated 6 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆57Updated 2 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆219Updated last month
- RL significantly the reasoning capability of Qwen2.5-1.5B-Instruct☆29Updated 4 months ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆108Updated 2 months ago