shirley-wu / cot_decoding
☆45Updated last year
Alternatives and similar repositories for cot_decoding:
Users that are interested in cot_decoding are comparing it to the libraries listed below
- Easy to use, High Performant Knowledge Distillation for LLMs☆65Updated last week
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 2 months ago
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆150Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆107Updated 7 months ago
- ☆53Updated 11 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆91Updated 2 weeks ago
- entropix style sampling + GUI☆26Updated 6 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 3 months ago
- [Preprint] An inference-time decoding strategy with adaptive foresight sampling☆90Updated 2 weeks ago
- The first dense retrieval model that can be prompted like an LM☆71Updated 7 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆173Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆77Updated last year
- ☆48Updated 6 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆121Updated last year
- ☆44Updated 11 months ago
- How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆32Updated 2 weeks ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆143Updated 7 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated last month
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- ☆24Updated 7 months ago
- Evaluating LLMs with CommonGen-Lite☆90Updated last year
- ☆121Updated 10 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆22Updated last month
- ☆85Updated last week
- Repository for the paper Stream of Search: Learning to Search in Language☆145Updated 3 months ago
- accompanying material for sleep-time compute paper☆73Updated this week
- FuseAI Project☆85Updated 3 months ago
- Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…☆41Updated 2 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆54Updated 7 months ago
- ☆115Updated 3 weeks ago