john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆34Updated 6 months ago
Alternatives and similar repositories for implicit-ins:
Users that are interested in implicit-ins are comparing it to the libraries listed below
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆31Updated last month
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆47Updated last year
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆48Updated last month
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆50Updated 2 weeks ago
- ☆59Updated 7 months ago
- ☆16Updated 3 months ago
- ☆19Updated last month
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆28Updated 3 weeks ago
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆36Updated last year
- ☆43Updated last month
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆25Updated 3 weeks ago
- [NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"☆36Updated 10 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆74Updated 10 months ago
- Exploration of automated dataset selection approaches at large scales.☆35Updated last month
- Long Context Extension and Generalization in LLMs☆53Updated 6 months ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆25Updated 4 months ago
- ☆55Updated last month
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆22Updated 5 months ago
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning☆41Updated 8 months ago
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆37Updated 3 weeks ago
- ☆64Updated last year
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆41Updated 5 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆82Updated 6 months ago
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆20Updated last month
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learning☆88Updated last month
- Knowledge Unlearning for Large Language Models☆25Updated last week
- Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)☆57Updated 11 months ago
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆25Updated 7 months ago
- ☆76Updated 2 months ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆32Updated last year