mcleish7 / gemstone-scaling-lawsView external linksLinks
Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)
☆33Sep 28, 2025Updated 4 months ago
Alternatives and similar repositories for gemstone-scaling-laws
Users that are interested in gemstone-scaling-laws are comparing it to the libraries listed below
Sorting:
- Official implementation of GOAT model (ICML2023)☆38Jul 3, 2023Updated 2 years ago
- What do we learn from inverting CLIP models?☆58Mar 6, 2024Updated last year
- ☆20Nov 4, 2025Updated 3 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Nov 17, 2024Updated last year
- ☆18Oct 12, 2022Updated 3 years ago
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆77Apr 3, 2024Updated last year
- ☆31Feb 8, 2026Updated last week
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆198May 28, 2024Updated last year
- ☆33Nov 27, 2023Updated 2 years ago
- Algorithms for approximate attention in LLMs☆21Apr 14, 2025Updated 10 months ago
- Official Code for "Baseline Defenses for Adversarial Attacks Against Aligned Language Models"☆31Oct 26, 2023Updated 2 years ago
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆29Jan 10, 2026Updated last month
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆24Sep 26, 2024Updated last year
- The official repository of the paper "On the Exploitability of Instruction Tuning".☆70Feb 5, 2024Updated 2 years ago
- Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives☆70Feb 22, 2024Updated last year
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 7 months ago
- ☆11Oct 20, 2023Updated 2 years ago
- A simple and efficient baseline for data attribution☆11Nov 10, 2023Updated 2 years ago
- Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion☆11Apr 1, 2024Updated last year
- ☆11Oct 20, 2023Updated 2 years ago
- An official repository for GPTailor☆16Jun 29, 2025Updated 7 months ago
- Pytorch ImageNet1k Loader with Bounding Boxes.☆13Jan 23, 2022Updated 4 years ago
- ☆14Mar 2, 2025Updated 11 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- ☆21Jul 21, 2025Updated 6 months ago
- ☆33Jan 7, 2025Updated last year
- Training vision models with full-batch gradient descent and regularization☆39Feb 14, 2023Updated 3 years ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆107Sep 23, 2023Updated 2 years ago
- ☆16Jul 17, 2022Updated 3 years ago
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆31Jun 5, 2025Updated 8 months ago
- ☆19Mar 25, 2025Updated 10 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 5 months ago
- DPO, but faster 🚀☆47Dec 6, 2024Updated last year
- [NeurIPS 2024] Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning☆72Feb 11, 2025Updated last year
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Mar 4, 2025Updated 11 months ago
- Implementation of experiments from The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning☆17May 14, 2023Updated 2 years ago
- [ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆48Dec 21, 2025Updated last month
- ☆16Jul 23, 2024Updated last year
- ☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models☆19Jun 4, 2025Updated 8 months ago