willccbb / agent-engineeringLinks
Agent Engineering course files
☆71Updated 5 months ago
Alternatives and similar repositories for agent-engineering
Users that are interested in agent-engineering are comparing it to the libraries listed below
Sorting:
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆99Updated 5 months ago
- Curated collection of community environments☆195Updated last week
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated 4 months ago
- ☆68Updated 7 months ago
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆121Updated this week
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 2 months ago
- ☆72Updated last month
- A framework for optimizing DSPy programs with RL☆302Updated last month
- Super basic implementation (gist-like) of RLMs with REPL environments.☆286Updated 2 months ago
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆166Updated 2 weeks ago
- ☆53Updated 8 months ago
- ☆14Updated 8 months ago
- SIMD quantization kernels☆93Updated 3 months ago
- Claude Deep Research config for Claude Code.☆224Updated 9 months ago
- look how they massacred my boy☆63Updated last year
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆27Updated 6 months ago
- Compiling useful links, papers, benchmarks, ideas, etc.☆45Updated 9 months ago
- PageRank for LLMs☆51Updated 3 months ago
- Low memory full parameter finetuning of LLMs☆53Updated 5 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆71Updated this week
- ☆57Updated 9 months ago
- Minimal agent runtime built with DSPy modules and a thin Python loop. Includes CLI, FastAPI server, and eval harness with OpenAI/Ollama s…☆65Updated this week
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆75Updated 7 months ago
- rl from zero pretrain, can it be done? yes.☆282Updated 2 months ago
- Inference-time scaling for LLMs-as-a-judge.☆317Updated last month
- Rust Implementation of micrograd☆53Updated last year
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 7 months ago
- a single interface around speech-to-speech foundation models☆27Updated 5 months ago
- Verbosity control for AI agents☆64Updated last year