shirley-wu / cot_decoding
☆45Updated last year
Alternatives and similar repositories for cot_decoding:
Users that are interested in cot_decoding are comparing it to the libraries listed below
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆150Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆58Updated this week
- ☆53Updated 10 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆138Updated last month
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆120Updated last year
- ☆112Updated last week
- Low-Rank adapter extraction for fine-tuned transformers models☆171Updated 11 months ago
- ☆48Updated 5 months ago
- ☆51Updated 5 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆221Updated 5 months ago
- A pipeline for LLM knowledge distillation☆100Updated last week
- ☆24Updated 6 months ago
- ☆33Updated 9 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- entropix style sampling + GUI☆25Updated 5 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 2 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆49Updated last week
- [Preprint] An inference-time decoding strategy with adaptive foresight sampling☆88Updated this week
- The first dense retrieval model that can be prompted like an LM☆69Updated 6 months ago
- Train your own SOTA deductive reasoning model☆83Updated last month
- ☆45Updated last month
- Evaluating LLMs with fewer examples☆148Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆105Updated 7 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆144Updated 2 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆205Updated 10 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆84Updated last month
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆101Updated 2 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆142Updated 6 months ago
- ☆81Updated last month