☆29Nov 30, 2025Updated 3 months ago
Alternatives and similar repositories for inductive-bias-probes
Users that are interested in inductive-bias-probes are comparing it to the libraries listed below
Sorting:
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated 2 weeks ago
- ☆25Feb 20, 2026Updated 2 weeks ago
- ☆47Jul 21, 2025Updated 7 months ago
- Benchmarking Optimizers for LLM Pretraining☆54Dec 30, 2025Updated 2 months ago
- ☆57Sep 17, 2025Updated 5 months ago
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆66Updated this week
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Using machine learning to improve simulations of a dynamical system☆10Apr 24, 2019Updated 6 years ago
- Exploring the minimal architecture required for coherent English language generation.☆12Mar 5, 2025Updated last year
- Code for "What really matters in matrix-whitening optimizers?"☆22Oct 31, 2025Updated 4 months ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- Code for WACV24 work for multiview acoustic-visual detection☆13Mar 22, 2024Updated last year
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- Pytorch routines for (Ker)nel (Mac)hines☆11Oct 10, 2025Updated 4 months ago
- Tutorials for MATH 4432 Statistical Machine Learning, HKUST, Fall 2022☆11Sep 17, 2024Updated last year
- This is a repository for RM2021 Software tutorial☆11Nov 4, 2020Updated 5 years ago
- HyperPose☆12Nov 6, 2025Updated 4 months ago
- [ICLR 2025] This repository contains the code to reproduce the results from our paper From Sparse Dependence to Sparse Attention: Unveili…☆12Mar 7, 2025Updated last year
- The color scheme of your GitHub contribution graph, bringing a vibrant and unique touch to your profile page.☆15Mar 25, 2025Updated 11 months ago
- ☆11Jun 20, 2023Updated 2 years ago
- C++-Animation-(Standard-Template-Library)-Engine,or CASTLE for short,is a C++ plotting and animation engine created by BiliBili uploader …☆11Jan 17, 2021Updated 5 years ago
- This is the notebooks for videos in my Bilibili Channel (https://space.bilibili.com/32773300?spm_id_from=333.1007.0.0)☆29Nov 6, 2025Updated 4 months ago
- ☆34Feb 4, 2026Updated last month
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12May 24, 2021Updated 4 years ago
- ☆13Oct 2, 2023Updated 2 years ago
- IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance, ICCV 2025☆30Oct 1, 2025Updated 5 months ago
- ☆17Feb 4, 2025Updated last year
- Reproducing GPT on the TinyStories dataset☆19Jan 18, 2024Updated 2 years ago
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19May 8, 2025Updated 10 months ago
- NeurIPS22 "RankFeat: Rank-1 Feature Removal for Out-of-distribution Detection" and T-PAMI Extension☆20Feb 21, 2025Updated last year
- Library that provides metrics to assess representation quality☆23Feb 5, 2025Updated last year
- Computing the greatest common divisor with transformers, source code for the paper https//arxiv.org/abs/2308.15594☆14Aug 11, 2025Updated 6 months ago
- ☆20Feb 8, 2025Updated last year
- Code for "Towards Optimal Correlational Object Search" | ICRA 2022☆21Jul 10, 2024Updated last year
- A template project to both illustrate and serve as an example for plugin creations on top of the manim.☆20Apr 30, 2021Updated 4 years ago
- The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Proces…☆84Jan 12, 2025Updated last year
- Code for steering and monitoring with concepts vectors in LLMs. https://arxiv.org/abs/2502.03708☆25Aug 10, 2025Updated 6 months ago
- ☆18Oct 3, 2025Updated 5 months ago