A Mechanistic Interpretability Analysis of Grokking
☆27Sep 26, 2022Updated 3 years ago
Alternatives and similar repositories for Grokking
Users that are interested in Grokking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons☆13Feb 13, 2023Updated 3 years ago
- A very hacky set of functions for getting plotly to do what I want when doing mech interp research, designed to be compatible with PyTorc…☆13Jun 16, 2023Updated 2 years ago
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆27Jun 4, 2024Updated last year
- A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations☆16Apr 15, 2024Updated last year
- ☆19Mar 5, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆17Feb 14, 2024Updated 2 years ago
- ☆12Oct 5, 2020Updated 5 years ago
- Code for reproducing key results in the paper "Improving the Neural GPU Architecture for Algorithm Learning" by Karlis Freivalds, Renars …☆13Jul 4, 2018Updated 7 years ago
- ☆33Feb 15, 2026Updated last month
- Code for reproducing key results in the paper "Neural Shuffle-Exchange Networks - Sequence Processing in O(n log n) Time" by Kārlis Freiv…☆11Apr 10, 2020Updated 5 years ago
- use pip in IPython☆13Oct 29, 2024Updated last year
- Scripts for quantifying stuff from my life☆19Nov 1, 2015Updated 10 years ago
- playing with gpt4☆14Mar 17, 2023Updated 3 years ago
- Collection of small Three.js examples relevant to math visualization☆12Aug 27, 2014Updated 11 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Mechanistic Interpretability Visualizations using React☆331Dec 18, 2024Updated last year
- ☆16Jul 20, 2023Updated 2 years ago
- Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"☆18Nov 24, 2023Updated 2 years ago
- DBT spectral analysis scripts for matlab☆10May 27, 2018Updated 7 years ago
- ☆13Oct 10, 2019Updated 6 years ago
- Exceptions to the ABC conjecture in Lean☆20Jan 26, 2026Updated 2 months ago
- Digital texts in Prakrit☆10Sep 14, 2025Updated 6 months ago
- An Offline Wikipedia Dump Reader in Javascript that probably only works on Chrome☆19Dec 23, 2011Updated 14 years ago
- JAX port of FLUX.1 models using flax.nnx☆24Sep 28, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR'24] Symphony: Symmetry-Equivariant Point-Centered Spherical Harmonics for Molecule Generation☆28Feb 24, 2025Updated last year
- ☆21Mar 18, 2025Updated last year
- An exploration of LLM steering☆25Jun 15, 2024Updated last year
- Code for reproducing the results from "CrAM: A Compression-Aware Minimizer" accepted at ICLR 2023☆10Mar 1, 2023Updated 3 years ago
- ☆18Jun 12, 2023Updated 2 years ago
- Earth mover's distance on Nvidia GPUS☆16Sep 10, 2016Updated 9 years ago
- a version of baby agi using dspy and typed predictors☆16Mar 9, 2024Updated 2 years ago
- Traditional operating systems are reactive - they wait for user input or system events before taking action. SwarmOS breaks this paradigm…☆15Dec 6, 2024Updated last year
- Simple WireGuard interface monitoring☆13Mar 9, 2026Updated 2 weeks ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆14Mar 15, 2025Updated last year
- Code for computing Hypergraph Co-Optimal Transport distances☆17Jan 19, 2023Updated 3 years ago
- building a game engine using rust-lang from scratch!☆10May 25, 2017Updated 8 years ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- A Next.js chatbot app demonstrating seamless integration with window.ai.☆15Jun 25, 2023Updated 2 years ago
- minimalistic AI library that resembles HF's transformers☆13Dec 31, 2024Updated last year
- Statistical analysis methods for comparing prompt and model performance in LLM evaluations.☆84Updated this week