willccbb / agent-engineeringLinks
Agent Engineering course files
☆71Updated 6 months ago
Alternatives and similar repositories for agent-engineering
Users that are interested in agent-engineering are comparing it to the libraries listed below
Sorting:
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆148Updated this week
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆102Updated 6 months ago
- ☆67Updated 8 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 5 months ago
- ☆54Updated 9 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆61Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- A reading list of relevant papers and projects on foundation model annotation☆28Updated 11 months ago
- Claude Deep Research config for Claude Code.☆225Updated 10 months ago
- Setup guide for ML training on NVIDIA DGX Spark (GB10 Blackwell, CUDA 13, aarch64)☆77Updated 3 weeks ago
- Letting Claude Code develop his own MCP tools :)☆123Updated 10 months ago
- EXO Gym is an open-source Python toolkit that facilitates distributed AI research.☆94Updated 2 months ago
- A prompt management, versioning, testing, and evaluation inference server and UI toolkit. Provider agnostic and OpenAI API compatible.☆118Updated 7 months ago
- Low memory full parameter finetuning of LLMs☆53Updated 6 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆27Updated 8 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆128Updated 3 months ago
- CLI for Recursive Language Models☆37Updated last week
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆124Updated 10 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 8 months ago
- ⚖️ Awesome LLM Judges ⚖️☆148Updated 9 months ago
- Lightly-reviewed collection of community environments☆210Updated last week
- rl from zero pretrain, can it be done? yes.☆286Updated 4 months ago
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆94Updated last week
- A framework for optimizing DSPy programs with RL☆308Updated 3 weeks ago
- Useful resources for LLM-based Diarization and Transcription.☆55Updated last year
- ☆51Updated 5 months ago
- ☆14Updated 9 months ago
- Inference-time scaling for LLMs-as-a-judge.☆327Updated 3 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆51Updated last year