uclaml / COPS
The official implementation of Cross-Task Experience Sharing (COPS)
☆22Updated 6 months ago
Alternatives and similar repositories for COPS
Users that are interested in COPS are comparing it to the libraries listed below
Sorting:
- ☆37Updated 7 months ago
- ☆24Updated 3 weeks ago
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆42Updated 5 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆35Updated 2 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆57Updated 11 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆29Updated last month
- How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆32Updated 3 weeks ago
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆45Updated 3 months ago
- ☆95Updated last month
- Exploration of automated dataset selection approaches at large scales.☆40Updated 2 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆41Updated 3 months ago
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆53Updated last year
- ☆27Updated 3 weeks ago
- ☆17Updated 4 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains☆77Updated this week
- FastCuRL: Curriculum Reinforcement Learning with Progressive Context Extension for Efficient Training R1-like Reasoning Models☆43Updated 3 weeks ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆112Updated last year
- Easy control for Key-Value Constrained Generative LLM Inference(https://arxiv.org/abs/2402.06262)☆63Updated last year
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆84Updated 7 months ago
- ☆78Updated 8 months ago
- Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning☆46Updated 2 months ago
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆14Updated 7 months ago
- [Preprint] A Generalizable and Purely Unsupervised Self-Training Framework☆56Updated 3 weeks ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆35Updated 7 months ago
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆25Updated last month
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆31Updated last year
- Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆14Updated last month
- Long Context Extension and Generalization in LLMs☆54Updated 7 months ago
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆63Updated 2 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆22Updated last week