zhijie-group / SIFT
SIFT: Grounding LLM Reasoning in Contexts via Stickers
☆51Updated 2 weeks ago
Alternatives and similar repositories for SIFT:
Users that are interested in SIFT are comparing it to the libraries listed below
- ☆16Updated 2 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆62Updated 3 weeks ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆33Updated 2 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆46Updated 4 months ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆98Updated last week
- ☆42Updated last month
- ☆76Updated 2 months ago
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆24Updated 5 months ago
- Open-Pandora: On-the-fly Control Video Generation☆32Updated 3 months ago
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆22Updated last week
- ☆36Updated this week
- Large Language Models Can Self-Improve in Long-context Reasoning☆67Updated 4 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆51Updated last month
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆148Updated last week
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learning☆84Updated last month
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆95Updated 2 months ago
- [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models☆40Updated 2 weeks ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆74Updated 2 weeks ago
- Knowledge Unlearning for Large Language Models☆20Updated 2 weeks ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆64Updated last month
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆101Updated this week
- ☆71Updated last week
- ☆79Updated last week
- Code for Heima☆37Updated last month
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆41Updated 3 weeks ago
- ☆83Updated 2 weeks ago
- ☆70Updated 2 weeks ago
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆23Updated 6 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆54Updated 5 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆55Updated last month