pico-lm / pico-analyzeLinks
A companion toolkit to pico-train for quantifying, comparing, and visualizing how language models evolve during training.
☆110Updated last week
Alternatives and similar repositories for pico-analyze
Users that are interested in pico-analyze are comparing it to the libraries listed below
Sorting:
- A minimalistic framework for transparently training language models and storing comprehensive checkpoints for in-depth learning dynamics …☆293Updated 2 weeks ago
- Provider-agnostic, open-source evaluation infrastructure for language models☆661Updated last week
- Observability and runtime visualization for JS/TS/Python code with zero code change☆134Updated 5 months ago
- Securely run AI-generated code in stateful sandboxes that run forever.☆224Updated 7 months ago
- Pixelagent — Multimodal stateful agents☆223Updated 5 months ago
- Build reliable AI Workflows and Agents with humans in the loop, structured outputs and durable execution.☆411Updated last week
- ☆159Updated last month
- ☆187Updated 4 months ago
- Official python implementation of UTCP. UTCP is an open standard that lets AI agents call any API directly, without extra middleware.☆612Updated this week
- Declarative language for composable Al workflows. Devtool for agents and mere humans.☆586Updated this week
- ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)☆115Updated last week
- The IDE of the future☆173Updated last month
- The specification for the Universal Tool Calling Protocol☆258Updated this week
- Live-bending a foundation model’s output at neural network level.☆270Updated 7 months ago
- See Through Your Models☆402Updated 4 months ago
- Managed Agent Posttraining☆60Updated last week
- ☆113Updated 4 months ago
- cmux lets you run Claude Code, Codex CLI, Amp, Gemini CLI, Cursor CLI, Opencode, and other coding agent CLIs in parallel across multiple …☆625Updated this week
- Routing on Random Forest (RoRF)☆222Updated last year
- Visual inference exploration & experimentation playground☆96Updated last year
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆269Updated last month
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆112Updated this week
- An open-source Text2SQL tool that transforms natural language into SQL using graph-powered schema understanding. Ask your database questi…☆248Updated this week
- ☆141Updated last month
- An LLM-powered programming-by-example programming language.☆204Updated 11 months ago
- 🎲 codenames, but AI plays against AI. (OpenAI o1)☆51Updated 8 months ago
- A prompt management, versioning, testing, and evaluation inference server and UI toolkit. Provider agnostic and OpenAI API compatible.☆118Updated 5 months ago
- ArxivTok 📚: Browse ArXiv papers with a TikTok-style vertical swipe interface.☆90Updated 9 months ago
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆217Updated last week
- Awesome Code Sandboxing for AI☆201Updated 4 months ago