A minimalistic framework for transparently training language models and storing comprehensive checkpoints for in-depth learning dynamics research.
☆317Feb 19, 2026Updated 2 months ago
Alternatives and similar repositories for pico-train
Users that are interested in pico-train are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A companion toolkit to pico-train for quantifying, comparing, and visualizing how language models evolve during training.☆113Feb 19, 2026Updated 2 months ago
- Engine for collecting, uploading, and downloading model activations☆28Apr 2, 2025Updated last year
- Fully Open Language Models with Stellar Performance☆317Nov 14, 2025Updated 5 months ago
- Attribution-based Parameter Decomposition☆34Jun 11, 2025Updated 10 months ago
- Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…☆38Apr 8, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository showcases the usage of GenAI to chat with Media Contant enhancing user's experience.☆13Dec 28, 2024Updated last year
- BunCurl2 is a blazing fast, fetch-like HTTP client built with Bun and cURL in TypeScript.☆23Mar 11, 2026Updated last month
- ☆14Feb 1, 2024Updated 2 years ago
- ☆30Nov 5, 2024Updated last year
- CM6_control_software☆10Feb 28, 2023Updated 3 years ago
- ☆12Apr 19, 2022Updated 4 years ago
- Improve safety, security, and privacy of AI systems at build, deploy and run stages.☆39Jan 27, 2026Updated 2 months ago
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- Landing page for MIB: A Mechanistic Interpretability Benchmark☆24Aug 15, 2025Updated 8 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Mapping out the "memory" of neural nets with data attribution☆53Updated this week
- [ICLR2026] AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model☆53Oct 12, 2025Updated 6 months ago
- Stock OpenWRT with patches that enable HW offload on Mono Gateway☆36Feb 9, 2026Updated 2 months ago
- My Solution to Assignments of CS234(Stanford / Fall 2019)☆15Sep 3, 2020Updated 5 years ago
- ☆12Jan 10, 2023Updated 3 years ago
- Official Code Repo for the Paper: "How does This Interaction Affect Me? Interpretable Attribution for Feature Interactions", In NeurIPS 2…☆41Oct 31, 2022Updated 3 years ago
- Tools for basic array manipulation and help dealing with the different flavors of arrays in Julia☆12Sep 24, 2024Updated last year
- ☆24Mar 7, 2025Updated last year
- Find the aesthetic score of your images using a neural network predictor☆14Mar 14, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆24Mar 25, 2025Updated last year
- JSON encoder and decoder for python written in C/C++☆10Jan 22, 2024Updated 2 years ago
- ☆27Mar 26, 2026Updated 3 weeks ago
- Generate Click options from msgspec types☆11Feb 1, 2025Updated last year
- Storing long contexts in tiny caches with self-study☆261Mar 23, 2026Updated 3 weeks ago
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.☆25Apr 13, 2026Updated last week
- RuboCop plugin for Slim template.☆13Sep 15, 2025Updated 7 months ago
- Cache your API calls with a single line of code. No mocks, no fixtures. Just faster, cleaner code.☆25Apr 11, 2026Updated last week
- 🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓☆5,497Apr 11, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- my nixOS configuration☆16Aug 20, 2024Updated last year
- ☆30Mar 18, 2026Updated last month
- ☆14Jul 25, 2024Updated last year
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.☆11Aug 20, 2024Updated last year
- This repository contains a ready-to-use boilerplate for quickly setting up and working with crewai. It provides essential configurations …☆11Sep 11, 2024Updated last year
- Linear Mode Connectivity in Multitask and Continual Learning: https://arxiv.org/abs/2010.04495☆12Oct 12, 2020Updated 5 years ago
- macOS window selector☆13Aug 27, 2024Updated last year