locuslab / scaling_laws_data_filtering
☆60Updated 5 months ago
Related projects: ⓘ
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆44Updated 8 months ago
- A Closer Look into Mixture-of-Experts in Large Language Models☆38Updated last month
- ☆45Updated 7 months ago
- 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆52Updated 3 weeks ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆48Updated last week
- Code implementation of synthetic continued pretraining☆13Updated this week
- ☆87Updated 4 months ago
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆63Updated last year
- Official github repo for the paper "Compression Represents Intelligence Linearly"☆121Updated 3 months ago
- This repo is based on https://github.com/jiaweizzhao/GaLore, paper coming soon☆18Updated this week
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models☆42Updated last week
- Scalable Meta-Evaluation of LLMs as Evaluators☆39Updated 7 months ago
- Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "☆49Updated last month
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆42Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆81Updated 2 weeks ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆55Updated last week
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆33Updated 6 months ago
- ☆22Updated 3 months ago
- Self-Alignment with Principle-Following Reward Models☆144Updated 6 months ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs?☆19Updated 3 months ago
- Lightweight tool to identify Data Contamination in LLMs evaluation☆39Updated 6 months ago
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆36Updated 2 months ago
- ☆13Updated last month
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆54Updated 6 months ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆59Updated 7 months ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆65Updated last month
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆24Updated 2 months ago
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆72Updated 6 months ago
- 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training☆79Updated this week
- Converting Mixtral-8x7B to Mixtral-[1~7]x7B☆20Updated 6 months ago