A minimalistic framework for transparently training language models and storing comprehensive checkpoints for in-depth learning dynamics research.
☆317Feb 19, 2026Updated last month
Alternatives and similar repositories for pico-train
Users that are interested in pico-train are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A companion toolkit to pico-train for quantifying, comparing, and visualizing how language models evolve during training.☆113Feb 19, 2026Updated last month
- A simple and minimal open source implementation of "Introducing LFM2: The Fastest On-Device Foundation Models on the Market" from Liquid …☆23Mar 22, 2026Updated last week
- Engine for collecting, uploading, and downloading model activations☆28Apr 2, 2025Updated 11 months ago
- Attribution-based Parameter Decomposition☆34Jun 11, 2025Updated 9 months ago
- Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…☆38Apr 8, 2023Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- BunCurl2 is a blazing fast, fetch-like HTTP client built with Bun and cURL in TypeScript.☆23Mar 11, 2026Updated 2 weeks ago
- HELP: a dataset for Handling Entailments with Lexical and logical Phenomena (Ver.1.0)☆15Jul 20, 2023Updated 2 years ago
- Efficiently computing & storing token n-grams from large corpora☆27Oct 6, 2024Updated last year
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- Landing page for MIB: A Mechanistic Interpretability Benchmark☆24Aug 15, 2025Updated 7 months ago
- Range-based algorithms in Go☆14Sep 10, 2023Updated 2 years ago
- ☆10Jun 19, 2019Updated 6 years ago
- C++ and Python libraries for neural networks.☆18Nov 19, 2025Updated 4 months ago
- Mapping out the "memory" of neural nets with data attribution☆50Updated this week
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Figma Clone with real time Collaboration☆15Mar 1, 2024Updated 2 years ago
- LLM tokenizer in Zig☆15Dec 7, 2025Updated 3 months ago
- Syntactic evaluation sets, attribute-varying grammars, and code for replicating the CLAMS paper. ACL 2020.☆17Nov 26, 2024Updated last year
- TOKEN-IMPORTANCE GUIDED DIRECT PREFERENCE OPTIMIZATION☆24Jan 26, 2026Updated 2 months ago
- ☆12Jan 10, 2023Updated 3 years ago
- msglm makes it a little easier to create messages for language models like Claude and OpenAI GPTs.☆14Jan 29, 2026Updated 2 months ago
- ☆25Mar 7, 2025Updated last year
- Efficient non-uniform quantization with GPTQ for GGUF☆63Sep 17, 2025Updated 6 months ago
- Aspect Definition Language. A powerful and succinct replacement for XML, JSON, YAML, etc.☆12Jan 6, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆24Mar 25, 2025Updated last year
- Storing long contexts in tiny caches with self-study☆251Updated this week
- Generate Click options from msgspec types☆11Feb 1, 2025Updated last year
- RuboCop plugin for Slim template.☆12Sep 15, 2025Updated 6 months ago
- Guide: from fragile multi-agent app to prod ready with orra - code and resources.☆14Mar 24, 2025Updated last year
- Graphs in go☆19Dec 22, 2022Updated 3 years ago
- 🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓☆5,325Mar 19, 2026Updated last week
- Package for estimating the entropy of a mixture distribution☆15Oct 28, 2017Updated 8 years ago
- ☆19Jul 4, 2025Updated 8 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This repository contains a ready-to-use boilerplate for quickly setting up and working with crewai. It provides essential configurations …☆11Sep 11, 2024Updated last year
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.☆11Aug 20, 2024Updated last year
- A JSON-formatted registry file to describe NFTs within a given wallet that should and should not be showcased☆16Sep 7, 2021Updated 4 years ago
- turn small javascript functions into GPT function calls☆12Aug 23, 2023Updated 2 years ago
- PluRel: Synthetic Data unlocks Scaling Laws for Relational Foundation Models☆46Mar 16, 2026Updated last week
- The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU cluste…☆4,829Mar 20, 2026Updated last week
- This provide A Zero-Server Web Interface for use with Ollama local LLM's and provides AI search via Perplexity (API Key required) and ima…☆33Mar 15, 2026Updated 2 weeks ago