willccbb / agent-engineeringLinks
Agent Engineering course files
☆42Updated this week
Alternatives and similar repositories for agent-engineering
Users that are interested in agent-engineering are comparing it to the libraries listed below
Sorting:
- A reading list of relevant papers and projects on foundation model annotation☆27Updated 4 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆82Updated 3 weeks ago
- ☆63Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- ☆30Updated 7 months ago
- ☆28Updated last week
- rl from zero pretrain, can it be done? we'll see.☆56Updated this week
- Compiling useful links, papers, benchmarks, ideas, etc.☆46Updated 3 months ago
- NLP with Rust for Python 🦀🐍☆62Updated last month
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆45Updated 2 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated 2 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆131Updated last month
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 9 months ago
- ☆47Updated 4 months ago
- ☆38Updated 11 months ago
- Simple repository for training small reasoning models☆33Updated 4 months ago
- ☆61Updated last week
- SIMD quantization kernels☆72Updated this week
- ☆127Updated 3 months ago
- LLM training in simple, raw C/CUDA☆14Updated 6 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆80Updated last month
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆68Updated 3 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆23Updated 2 months ago
- A miniature version of Modal☆20Updated last year
- Rust Implementation of micrograd☆52Updated 11 months ago
- ☆50Updated 2 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆58Updated last month
- ☆39Updated 2 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆112Updated last month
- look how they massacred my boy☆63Updated 8 months ago