menloresearch / visual-thinker
☆131Updated last month
Alternatives and similar repositories for visual-thinker:
Users that are interested in visual-thinker are comparing it to the libraries listed below
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆168Updated 2 months ago
- Train your own SOTA deductive reasoning model☆81Updated 3 weeks ago
- Build your own visual reasoning model☆312Updated last week
- ☆185Updated last month
- ☆82Updated 3 weeks ago
- Rethinking Step-by-step Visual Reasoning in LLMs☆279Updated 2 months ago
- From scratch implementation of a vision language model in pure PyTorch☆207Updated 10 months ago
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆193Updated this week
- Cerule - A Tiny Mighty Vision Model☆67Updated 6 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆138Updated last month
- Easy to use, High Performant Knowledge Distillation for LLMs☆55Updated this week
- Video+code lecture on building nanoGPT from scratch☆66Updated 9 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆310Updated 3 months ago
- working implimention of deepseek MLA☆38Updated 2 months ago
- All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. 🔉Audio summaries here (https…☆163Updated this week
- EvaByte: Efficient Byte-level Language Models at Scale☆85Updated last week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆83Updated last week
- An automated tool for discovering insights from research papaer corpora☆137Updated 9 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆75Updated 3 weeks ago
- nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)☆83Updated last week
- minimal GRPO implementation from scratch☆62Updated 2 weeks ago
- Code for ExploreTom☆78Updated 3 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆52Updated last week
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆91Updated 3 weeks ago
- ☆56Updated 4 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆142Updated last month
- ⚖️ Awesome LLM Judges ⚖️☆86Updated last month
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆105Updated last week
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆131Updated last month
- Clue inspired puzzles for testing LLM deduction abilities☆31Updated this week