A minimalistic framework for transparently training language models and storing comprehensive checkpoints for in-depth learning dynamics research.
☆318Feb 19, 2026Updated 3 months ago
Alternatives and similar repositories for pico-train
Users that are interested in pico-train are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A companion toolkit to pico-train for quantifying, comparing, and visualizing how language models evolve during training.☆116Feb 19, 2026Updated 3 months ago
- Engine for collecting, uploading, and downloading model activations☆28Apr 2, 2025Updated last year
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated last year
- Fully Open Language Models with Stellar Performance☆319May 13, 2026Updated 2 weeks ago
- ADAG: Transluce's MLP neuron-level circuit tracing library☆26Apr 10, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Attribution-based Parameter Decomposition☆34Jun 11, 2025Updated 11 months ago
- Auto-Browse: AI Enabled Browser Automation☆18Jul 7, 2025Updated 10 months ago
- BunCurl2 is a blazing fast, fetch-like HTTP client built with Bun and cURL in TypeScript.☆23Mar 11, 2026Updated 2 months ago
- ☆14May 25, 2023Updated 3 years ago
- ☆16Nov 30, 2022Updated 3 years ago
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- Gradient Descent optimizers for Julia☆12May 26, 2020Updated 6 years ago
- C++ and Python libraries for neural networks.☆18May 18, 2026Updated last week
- A harness for small llms☆93Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Aioli: A unified optimization framework for language model data mixing☆32Jan 17, 2025Updated last year
- A collection of Claude commands and utilities☆27May 23, 2026Updated last week
- AGridable is a Python library which makes formatting tables in your Dash app a breeze.☆15Jun 4, 2024Updated last year
- An active learning library for Pytorch based on Lightning-Fabric.☆79May 4, 2024Updated 2 years ago
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…☆13Sep 19, 2024Updated last year
- msglm makes it a little easier to create messages for language models like Claude and OpenAI GPTs.☆15Apr 6, 2026Updated last month
- Efficient non-uniform quantization with GPTQ for GGUF☆63Sep 17, 2025Updated 8 months ago
- 🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓☆5,752May 18, 2026Updated last week
- Implementing LRP (Layer-wise Relevance Propagation) for a sequence-to-sequence model with GRU layers.☆12Sep 8, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- JSON encoder and decoder for python written in C/C++☆11Jan 22, 2024Updated 2 years ago
- Storing long contexts in tiny caches with self-study☆268Mar 23, 2026Updated 2 months ago
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.☆26May 13, 2026Updated 2 weeks ago
- Agent Skills for working with Tuist projects☆32Mar 27, 2026Updated 2 months ago
- Cache your API calls with a single line of code. No mocks, no fixtures. Just faster, cleaner code.☆26May 20, 2026Updated last week
- ☆14Jul 25, 2024Updated last year
- ☆19Jul 4, 2025Updated 10 months ago
- ☆32Mar 18, 2026Updated 2 months ago
- This repository contains a ready-to-use boilerplate for quickly setting up and working with crewai. It provides essential configurations …☆11Sep 11, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.☆12Aug 20, 2024Updated last year
- Linear Mode Connectivity in Multitask and Continual Learning: https://arxiv.org/abs/2010.04495☆12Oct 12, 2020Updated 5 years ago
- turn small javascript functions into GPT function calls☆12Aug 23, 2023Updated 2 years ago
- Reacts component for interacting with the Ethereum Name Service.☆16Aug 27, 2023Updated 2 years ago
- Tools that make use of Pinata API in Python☆14Oct 23, 2021Updated 4 years ago
- macOS window selector☆13Aug 27, 2024Updated last year
- ☆138Mar 20, 2025Updated last year