☆31Nov 30, 2025Updated 4 months ago
Alternatives and similar repositories for inductive-bias-probes
Users that are interested in inductive-bias-probes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆19May 19, 2019Updated 6 years ago
- Benchmarking Optimizers for LLM Pretraining☆57Dec 30, 2025Updated 3 months ago
- ☆16Mar 22, 2025Updated last year
- ☆52Mar 30, 2026Updated 3 weeks ago
- Code for "What really matters in matrix-whitening optimizers?"☆23Oct 31, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆34Jul 5, 2023Updated 2 years ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12May 24, 2021Updated 4 years ago
- Code for "Towards Optimal Correlational Object Search" | ICRA 2022☆21Jul 10, 2024Updated last year
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆71Updated this week
- ☆61Sep 17, 2025Updated 7 months ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated 2 years ago
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11Oct 10, 2025Updated 6 months ago
- We study toy models of skill learning.☆33Feb 3, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Rationales for Sequential Predictions☆40Mar 10, 2022Updated 4 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆27Oct 14, 2025Updated 6 months ago
- The color scheme of your GitHub contribution graph, bringing a vibrant and unique touch to your profile page.☆15Mar 25, 2025Updated last year
- Pytorch routines for (Ker)nel (Mac)hines☆12Oct 10, 2025Updated 6 months ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆65Jan 26, 2026Updated 2 months ago
- Simple MoE - Day 17 of 365 Days of Repos☆18Jan 17, 2025Updated last year
- General tips to drive your research at Mila☆22May 14, 2024Updated last year
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆27Jun 4, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Exploring the minimal architecture required for coherent English language generation.☆12Mar 5, 2025Updated last year
- Library that provides metrics to assess representation quality☆27Feb 5, 2025Updated last year
- An implementation of a Meta Harness for Hermes.☆70Apr 7, 2026Updated last week
- [ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation☆29Feb 25, 2025Updated last year
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 6 years ago
- A python package to design and debug RL agents.☆33Apr 2, 2026Updated 2 weeks ago
- ☆84Aug 31, 2023Updated 2 years ago
- The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Proces…☆86Jan 12, 2025Updated last year
- Reproducing GPT on the TinyStories dataset☆19Jan 18, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- NeurIPS22 "RankFeat: Rank-1 Feature Removal for Out-of-distribution Detection" and T-PAMI Extension☆20Feb 21, 2025Updated last year
- ☆17Feb 4, 2025Updated last year
- This is a repository for RM2021 Software tutorial☆11Nov 4, 2020Updated 5 years ago
- [ICLR 2025] This repository contains the code to reproduce the results from our paper From Sparse Dependence to Sparse Attention: Unveili…☆12Mar 7, 2025Updated last year
- Official implementation of "CellFlux: Simulating Cellular Morphology Changes via Flow Matching" (ICML 2025)☆37Sep 3, 2025Updated 7 months ago
- HyperPose☆12Nov 6, 2025Updated 5 months ago
- A template project to both illustrate and serve as an example for plugin creations on top of the manim.☆20Apr 30, 2021Updated 4 years ago