wandb / wandb-mcp-serverLinks
A collection of MCP (Model Context Protocol) tools and examples for wandb and weave
☆14Updated last week
Alternatives and similar repositories for wandb-mcp-server
Users that are interested in wandb-mcp-server are comparing it to the libraries listed below
Sorting:
- ☆230Updated last week
- An interface library for RL post training with environments.☆66Updated this week
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆463Updated 2 months ago
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆277Updated this week
- ☆244Updated 7 months ago
- A holistic evaluation library for multi-modal generative models using Weave☆27Updated 11 months ago
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆584Updated last week
- Multi-backend recommender systems with Keras 3☆145Updated this week
- ☆271Updated 6 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆325Updated last year
- ☆211Updated this week
- Super basic implementation (gist-like) of RLMs with REPL environments.☆204Updated last week
- Source code for the collaborative reasoner research project at Meta FAIR.☆102Updated 6 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆560Updated 2 months ago
- ⏰ AI conference deadline countdowns☆285Updated last week
- 🧠🔗 From idea to production in just few lines: Graph-Based Programmable Neuro-Symbolic LM Framework - a production-first LM framework bu…☆342Updated last week
- Training-Ready RL Environments + Evals☆132Updated this week
- ☆103Updated 3 months ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆209Updated last week
- Train your own SOTA deductive reasoning model☆108Updated 7 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆346Updated 4 months ago
- AIRA-dojo: a framework for developing and evaluating AI research agents☆104Updated last month
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆169Updated this week
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆188Updated 7 months ago
- code for training & evaluating Contextual Document Embedding models☆198Updated 5 months ago
- PyTorch library for Active Fine-Tuning☆93Updated last month
- ☆159Updated 10 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆115Updated 2 months ago
- ☆92Updated 3 weeks ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆330Updated 11 months ago