formll / resolving-scaling-law-discrepanciesView external linksLinks
☆20Nov 4, 2025Updated 3 months ago
Alternatives and similar repositories for resolving-scaling-law-discrepancies
Users that are interested in resolving-scaling-law-discrepancies are comparing it to the libraries listed below
Sorting:
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆24Sep 26, 2024Updated last year
- Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)☆33Sep 28, 2025Updated 4 months ago
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆24Mar 25, 2025Updated 10 months ago
- ☆10Mar 6, 2022Updated 3 years ago
- AutoLibra: Metric Induction for Agents from Open-Ended Human Feedback☆17Oct 15, 2025Updated 4 months ago
- ☆11Mar 13, 2023Updated 2 years ago
- ☆14Dec 25, 2024Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆187Jan 19, 2026Updated 3 weeks ago
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Apr 9, 2024Updated last year
- csl: PyTorch-based Constrained Learning☆12Jun 1, 2022Updated 3 years ago
- ☆13Jul 2, 2025Updated 7 months ago
- Analysing ML conference data and plotting interesting statistics.☆11Aug 4, 2023Updated 2 years ago
- ☆33Jan 25, 2026Updated 3 weeks ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆32Jan 23, 2025Updated last year
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆89Sep 26, 2024Updated last year
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- Post-processing for fair classification☆16Jun 30, 2025Updated 7 months ago
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"☆13Oct 23, 2021Updated 4 years ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Jul 17, 2024Updated last year
- The official repo of continuous speculative decoding☆31Mar 28, 2025Updated 10 months ago
- Replicating O1 inference-time scaling laws☆93Dec 1, 2024Updated last year
- [ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration☆61Feb 21, 2025Updated 11 months ago
- ☆20Mar 3, 2025Updated 11 months ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 8 months ago
- ☆18Mar 19, 2025Updated 10 months ago
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆21Jun 15, 2025Updated 8 months ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- ☆13Mar 14, 2024Updated last year
- A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models☆19May 24, 2025Updated 8 months ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- ☆23Dec 18, 2024Updated last year
- ☆17Feb 4, 2025Updated last year
- MDL Complexity computations and experiments from the paper "Revisiting complexity and the bias-variance tradeoff".☆18Jun 12, 2023Updated 2 years ago
- Code release for "TempLM: Distilling Language Models into Template-Based Generators"☆14Jul 21, 2022Updated 3 years ago
- ☆20Apr 16, 2025Updated 10 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 5 months ago
- ☆19Mar 25, 2025Updated 10 months ago
- DPO, but faster 🚀☆47Dec 6, 2024Updated last year