[NeurIPS'24 Spotlight] Observational Scaling Laws
☆60Oct 2, 2024Updated last year
Alternatives and similar repositories for ObsScaling
Users that are interested in ObsScaling are comparing it to the libraries listed below
Sorting:
- Language models scale reliably with over-training and on downstream tasks☆100Apr 2, 2024Updated last year
- Original code base for On Pretraining Data Diversity for Self-Supervised Learning☆14Dec 30, 2024Updated last year
- An Inspect extension for agentic cyber evaluations☆22Feb 24, 2026Updated last week
- A package dedicated for running benchmark agreement testing☆17Sep 18, 2025Updated 5 months ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆147Sep 20, 2024Updated last year
- The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'☆24May 20, 2025Updated 9 months ago
- ☆64Apr 9, 2024Updated last year
- An Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales☆16Jun 6, 2024Updated last year
- ✌ CLoG: Benchmarking Continual Learning of Image Generation Models☆20Jun 10, 2024Updated last year
- ♠️TrucoBench: Qual é o melhor LLM no truco? Resultados, análises e insights estratégicos.☆19Feb 24, 2025Updated last year
- ☆43May 23, 2023Updated 2 years ago
- Code for the paper "Mehta, S. V., Patil, D., Chandar, S., & Strubell, E. (2023). An Empirical Investigation of the Role of Pre-training i…☆17Mar 18, 2024Updated last year
- A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations☆15Apr 15, 2024Updated last year
- ☆20Nov 4, 2025Updated 4 months ago
- Fluent student-teacher redteaming☆23Jul 25, 2024Updated last year
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24May 1, 2022Updated 3 years ago
- A toolkit for scaling law research ⚖☆57Jan 27, 2025Updated last year
- Script and models for clustering LAION-400m CLIP embeddings.☆26Jan 10, 2022Updated 4 years ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆78May 2, 2025Updated 10 months ago
- ☆60Mar 9, 2023Updated 2 years ago
- The repository contains code for Adaptive Data Optimization☆32Dec 9, 2024Updated last year
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆27Feb 4, 2023Updated 3 years ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆32Jan 23, 2025Updated last year
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆32Jul 17, 2023Updated 2 years ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆92Oct 30, 2024Updated last year
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated 11 months ago
- This repository collects lecture slides, assignments (CAs), code notebooks, reports, and reference papers used in the "Deep Generative Mo…☆17Feb 14, 2026Updated 2 weeks ago
- Evaluating Data Attribution for Text-to-Image Models: a visual data attribution benchmark for evaluating and learning training image inf…☆79Jun 25, 2024Updated last year
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆89Sep 26, 2024Updated last year
- Trending projects & awesome papers about data-centric llm studies.☆40May 20, 2025Updated 9 months ago
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆86Jul 28, 2024Updated last year
- Code for Continuously Changing Corruptions (CCC) benchmark + evaluation☆41Aug 21, 2024Updated last year
- Scaling Data-Constrained Language Models☆342Jun 28, 2025Updated 8 months ago
- Public Codebase for Rethinking Parameter Counting: Effective Dimensionality Revisited☆37Dec 27, 2022Updated 3 years ago
- scrape, clean and model IPO data with supervised ML☆10Aug 20, 2020Updated 5 years ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- Debiasing Through Data Attribution☆12May 23, 2024Updated last year
- 2019年“OPPO TOP高校创新科技大赛”的参赛项目——“盲人眼镜”,基于“raspberry-web-app”三端交互策略实现盲人听书、导航、聊天等功能☆10Feb 20, 2022Updated 4 years ago
- ☆14Jan 23, 2026Updated last month