OpenMOSS / Lorsa
☆17Updated last week
Alternatives and similar repositories for Lorsa:
Users that are interested in Lorsa are comparing it to the libraries listed below
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆29Updated last month
- ☆31Updated 4 months ago
- ☆20Updated 4 months ago
- This repo is based on https://github.com/jiaweizzhao/GaLore☆27Updated 7 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆30Updated 2 months ago
- ☆25Updated 7 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆12Updated 3 weeks ago
- ☆78Updated 8 months ago
- ☆25Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 8 months ago
- ☆16Updated 2 months ago
- Lottery Ticket Adaptation☆39Updated 5 months ago
- A repository for research on medium sized language models.☆76Updated 11 months ago
- ☆48Updated 6 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆33Updated last month
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆9Updated last month
- ☆27Updated 3 weeks ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆45Updated 2 weeks ago
- The repository contains code for Adaptive Data Optimization☆24Updated 5 months ago
- ☆17Updated 4 months ago
- ☆33Updated 10 months ago
- Official Code Release for "Training a Generally Curious Agent"☆20Updated last month
- Official repo of paper LM2☆39Updated 2 months ago
- Knowledge Unlearning for Large Language Models☆25Updated this week
- Exploration of automated dataset selection approaches at large scales.☆39Updated 2 months ago
- Simple repository for training small reasoning models☆27Updated 3 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 2 months ago
- The original Shared Recurrent Memory Transformer implementation☆24Updated 3 months ago
- Aioli: A unified optimization framework for language model data mixing☆25Updated 3 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last year