A minimalistic framework for transparently training language models and storing comprehensive checkpoints for in-depth learning dynamics research.
☆317Feb 19, 2026Updated 2 months ago
Alternatives and similar repositories for pico-train
Users that are interested in pico-train are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A companion toolkit to pico-train for quantifying, comparing, and visualizing how language models evolve during training.☆114Feb 19, 2026Updated 2 months ago
- Engine for collecting, uploading, and downloading model activations☆28Apr 2, 2025Updated last year
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆20Apr 10, 2025Updated last year
- Attribution-based Parameter Decomposition☆34Jun 11, 2025Updated 10 months ago
- Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…☆39Apr 8, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆22Aug 30, 2025Updated 8 months ago
- Accompanying code for paper "Attention-Based Contextual Language Model Adaptation for Speech Recognition", submitted to ACL 2021.☆14Jul 25, 2023Updated 2 years ago
- Auto-Browse: AI Enabled Browser Automation☆18Jul 7, 2025Updated 10 months ago
- BunCurl2 is a blazing fast, fetch-like HTTP client built with Bun and cURL in TypeScript.☆23Mar 11, 2026Updated last month
- ☆14Feb 1, 2024Updated 2 years ago
- HELP: a dataset for Handling Entailments with Lexical and logical Phenomena (Ver.1.0)☆15Jul 20, 2023Updated 2 years ago
- Trakbit option analysis tool☆57Apr 29, 2026Updated last week
- Efficiently computing & storing token n-grams from large corpora☆27Oct 6, 2024Updated last year
- A harness for small llms☆81Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- C++ and Python libraries for neural networks.☆18Nov 19, 2025Updated 5 months ago
- ☆16Oct 1, 2023Updated 2 years ago
- MCPify all the projects!☆29Nov 28, 2025Updated 5 months ago
- TopoLM: brain-like spatio-functional organization in a topographic language model☆29May 23, 2025Updated 11 months ago
- An active learning library for Pytorch based on Lightning-Fabric.☆79May 4, 2024Updated 2 years ago
- OpenEXR and Radiance HDR image viewer☆14Apr 27, 2026Updated last week
- Agent Skills for working with Tuist projects☆33Mar 27, 2026Updated last month
- ☆19Apr 27, 2012Updated 14 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Guide: from fragile multi-agent app to prod ready with orra - code and resources.☆14Mar 24, 2025Updated last year
- None of the other digital drawing tools were my jam, so I wrote my own.☆13Mar 18, 2026Updated last month
- Create, run, rate, and iterate on your Claude Skills☆65Jan 13, 2026Updated 3 months ago
- 🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓☆5,610Updated this week
- ☆34Mar 9, 2025Updated last year
- Linear Mode Connectivity in Multitask and Continual Learning: https://arxiv.org/abs/2010.04495☆12Oct 12, 2020Updated 5 years ago
- MCP Server leveraging crawl4ai for web scraping and LLM-based content extraction (Markdown, text snippets, smart extraction). Designed fo…☆27Aug 12, 2025Updated 8 months ago
- ☆138Mar 20, 2025Updated last year
- The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU cluste…☆4,935May 1, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆34Aug 26, 2025Updated 8 months ago
- MCP Toggle is a simple GUI tool to help you manage MCP servers across clients seamlessly.☆16Apr 18, 2025Updated last year
- First open-source implementation of Google TurboQuant (ICLR 2026) -- near-optimal KV cache compression for LLM inference. 5x compression …☆60Apr 17, 2026Updated 3 weeks ago
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- A tool for model sparse based on torch.fx☆13Jun 3, 2024Updated last year
- Securely run AI-generated code in stateful sandboxes that run forever.☆228Apr 17, 2025Updated last year
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year