[NeurIPS'24 Spotlight] Observational Scaling Laws
☆60Oct 2, 2024Updated last year
Alternatives and similar repositories for ObsScaling
Users that are interested in ObsScaling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Language models scale reliably with over-training and on downstream tasks☆101Apr 2, 2024Updated last year
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆147Sep 20, 2024Updated last year
- [TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language models☆10Feb 20, 2025Updated last year
- An open-source implementation of Scaling Laws for Neural Language Models using nanoGPT☆52Dec 8, 2023Updated 2 years ago
- ☆64Apr 9, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales☆16Jun 6, 2024Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 2 months ago
- ☆13Mar 7, 2022Updated 4 years ago
- An Inspect extension for agentic cyber evaluations☆24Feb 24, 2026Updated last month
- Code for "What really matters in matrix-whitening optimizers?"☆23Oct 31, 2025Updated 4 months ago
- ☆15May 17, 2024Updated last year
- ☆59Mar 9, 2023Updated 3 years ago
- ☆24Jan 27, 2026Updated 2 months ago
- The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'☆24May 20, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆18Jun 3, 2024Updated last year
- Experiments from "The Generalization-Stability Tradeoff in Neural Network Pruning": https://arxiv.org/abs/1906.03728.☆14Oct 23, 2020Updated 5 years ago
- ☆13Aug 17, 2020Updated 5 years ago
- A research project exploring fine-tuning BERT-style models for text generation☆39Nov 30, 2025Updated 3 months ago
- ☆47May 21, 2025Updated 10 months ago
- ☆42May 23, 2023Updated 2 years ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆50Mar 2, 2026Updated 3 weeks ago
- Find informative examples to efficiently (human)-evaluate NLG models.☆18Feb 27, 2026Updated last month
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Oct 27, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆10Jun 25, 2020Updated 5 years ago
- World-Gymnast: Training Robots with Reinforcement Learning in a World Model☆30Feb 11, 2026Updated last month
- Official implementation of Categorical Flow Maps on text.☆48Feb 16, 2026Updated last month
- Comp 781 Project☆10Jan 2, 2026Updated 2 months ago
- ☆16Jul 23, 2024Updated last year
- Script and models for clustering LAION-400m CLIP embeddings.☆26Jan 10, 2022Updated 4 years ago
- Test-time-training on nearest neighbors for large language models☆49Apr 18, 2024Updated last year
- Code released for our ECCV 2022 paper "Interpretable Open-Set Domain Adaptation via Angular Margin Separation".☆23Dec 26, 2022Updated 3 years ago
- ☆16Jul 17, 2025Updated 8 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the paper "Mehta, S. V., Patil, D., Chandar, S., & Strubell, E. (2023). An Empirical Investigation of the Role of Pre-training i…☆17Mar 18, 2024Updated 2 years ago
- ☆28Oct 22, 2024Updated last year
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆135Feb 15, 2026Updated last month
- ☆29Dec 19, 2025Updated 3 months ago
- This is a repository for code, data, and models associated with the paper LLM-RUBRIC: A Multidimensional, Calibrated Approach to Automate…☆26Feb 18, 2025Updated last year
- Utilities for PyTorch distributed☆25Feb 27, 2025Updated last year
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆40Jul 1, 2023Updated 2 years ago