terminal-agent / reptileLinks
π» Terminal-Agent with Human-in-the-Loop Learning
β34Updated 2 weeks ago
Alternatives and similar repositories for reptile
Users that are interested in reptile are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejectionβ55Updated last year
- Use the tokenizer in parallel to achieve superior accelerationβ20Updated last year
- β20Updated 3 months ago
- The code and data for the paper JiuZhang3.0β49Updated last year
- "Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding" Zhenyu Zhang, Runjin Chen, Shiwβ¦β31Updated last year
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LAβ¦β30Updated last year
- A curated list of awesome resources dedicated to Scaling Laws for LLMsβ81Updated 2 years ago
- β50Updated 5 months ago
- Long Context Extension and Generalization in LLMsβ62Updated last year
- Code for the preprint "Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?"β47Updated 6 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learningβ120Updated 8 months ago
- LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verificationβ73Updated 6 months ago
- The rule-based evaluation subset and code implementation of Omni-MATHβ26Updated last year
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":β44Updated last year
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.β64Updated last year
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"β22Updated 10 months ago
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generationβ33Updated 3 months ago
- β34Updated last year
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"β78Updated last year
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"β75Updated 8 months ago
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.β52Updated last year
- [NeurIPS'24] Official code for *π―DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*β120Updated last year
- LongAttn οΌSelecting Long-context Training Data via Token-level Attentionβ15Updated 6 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoningβ70Updated 6 months ago
- β30Updated last year
- Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradientβ64Updated 5 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scalingβ182Updated 6 months ago
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"β110Updated 3 months ago
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracyβ77Updated 3 months ago
- instruction-following benchmark for large reasoning modelsβ44Updated 5 months ago