tsa18 / ConciseHintLinks
[Preprint arXiv: 2506.18810 ] ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation
☆21Updated 4 months ago
Alternatives and similar repositories for ConciseHint
Users that are interested in ConciseHint are comparing it to the libraries listed below
Sorting:
- The official implementation of Cross-Task Experience Sharing (COPS)☆29Updated last year
- This is the official Python version of Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play.☆115Updated 3 months ago
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)☆26Updated 11 months ago
- SSRL: Self-Search Reinforcement Learning☆206Updated 5 months ago
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …☆65Updated 2 weeks ago
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆89Updated 3 months ago
- ☆144Updated 9 months ago
- Code for Bolmo: Byteifying the Next Generation of Language Models☆116Updated last month
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆47Updated 6 months ago
- Implementation of the paper "Improving Multi-step RAG with Hypergraph-based Memory for Long-context Complex Relational Modeling"☆110Updated 2 weeks ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆23Updated 7 months ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Updated 2 years ago
- ☆71Updated 3 months ago
- Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs☆24Updated 7 months ago
- ☆19Updated 8 months ago
- Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge☆104Updated last week
- Training Proactive and Personalized LLM Agents☆98Updated 2 weeks ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Updated last year
- [NeurIPS 2025] Thinkless: LLM Learns When to Think☆250Updated 4 months ago
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆53Updated last year
- ☆87Updated last year
- Leveraging Base Language Models for Few-Shot Synthetic Data Generation☆40Updated 3 months ago
- Pivotal Token Search☆144Updated last month
- The official github repo for "Diffusion Language Models are Super Data Learners".☆220Updated 3 months ago
- Data generation and training repository for SERA: Soft-Verified Efficient Repository Agents.☆110Updated this week
- The official implementation of Preference Data Reward-Augmentation.☆18Updated 9 months ago
- ☆50Updated 4 months ago
- Official Project Page for Deep Delta Learning (https://huggingface.co/papers/2601.00417)☆333Updated this week
- Automated Qualitative Analysis of LLMs (ICLR 2025)☆52Updated 7 months ago
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆65Updated last week