Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning
☆59Dec 18, 2025Updated 2 months ago
Alternatives and similar repositories for GRL
Users that are interested in GRL are comparing it to the libraries listed below
Sorting:
- reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"☆17Sep 25, 2024Updated last year
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Jan 16, 2023Updated 3 years ago
- d3LLM: Ultra-Fast Diffusion LLM 🚀☆93Feb 4, 2026Updated last month
- Efficient Long-context Language Model Training by Core Attention Disaggregation☆91Feb 23, 2026Updated last week
- ☆21Mar 6, 2020Updated 5 years ago
- ☆45May 27, 2025Updated 9 months ago
- ☆26Feb 27, 2022Updated 4 years ago
- This is system where images are trained and recognize of bumch of faces at a time☆23Oct 25, 2025Updated 4 months ago
- Code for Stable Control Representations☆26Apr 5, 2025Updated 11 months ago
- Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance☆47Jun 2, 2025Updated 9 months ago
- PyTorch implementation of the descriptor DEAL presented at NeurIPS 2021 "Extracting Deformation-Aware Local Features by Learning to Defor…☆31Jan 12, 2022Updated 4 years ago
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆64Feb 19, 2026Updated 2 weeks ago
- Data recipes and robust infrastructure for training AI agents☆104Updated this week
- [NeurIPS 2025] ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive☆66Dec 11, 2025Updated 2 months ago
- a size profiler for cuda binary☆72Jan 15, 2026Updated last month
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- TPU inference for vLLM, with unified JAX and PyTorch support.☆247Updated this week
- ☆10Nov 17, 2022Updated 3 years ago
- Generating a cover letter using LLM given the job description and your resume☆10Feb 1, 2025Updated last year
- A collection of production-ready subagents for kilocode☆28Dec 16, 2025Updated 2 months ago
- TensorFlow 2 / Lite implementation of Ultra-Fast Structure-Aware Lane Detection☆12Aug 19, 2020Updated 5 years ago
- A Keras implementation of hybrid efficientnet swin transformer model.☆34Oct 14, 2023Updated 2 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Jan 6, 2024Updated 2 years ago
- This is the official GDSC repo with all of the source code presented in the video tutorials☆14Jun 27, 2023Updated 2 years ago
- Implementation of a simple linear regression algorithm in MAMBA☆10Feb 12, 2020Updated 6 years ago
- ☆14Mar 20, 2025Updated 11 months ago
- Bayesian Deep Ensembles via MILE: easy to use, scikit-learn compatible and fast (JAX powered)☆40Updated this week
- Official code for AL-PINNS: Augmented Lagrangian relaxation method for Physics-Informed Neural Networks☆12Jul 29, 2023Updated 2 years ago
- An Awesome list of AI tools powered by ChatGPT / Whisper and Stable DIffusion or are useful to developers of that domain☆10Jul 26, 2023Updated 2 years ago
- ☆17Nov 18, 2025Updated 3 months ago
- HyFormer: Hybrid Transformer and CNN For Pixel-level Multispectral Image Classification☆15Feb 15, 2023Updated 3 years ago
- ☆16Feb 22, 2025Updated last year
- ☆10Apr 12, 2025Updated 10 months ago
- Single-Life Reinforcement Learning☆14Dec 17, 2022Updated 3 years ago
- Website and Code for Directed Ray Distance Functions for 3D Scene Reconstruction☆38Sep 13, 2023Updated 2 years ago
- Implementation for FP8/INT8 Rollout for RL training without performence drop.☆293Nov 7, 2025Updated 3 months ago
- ☆59May 21, 2025Updated 9 months ago
- Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.…☆11May 16, 2024Updated last year
- Visualize, create, and operate on pytrees in the most intuitive way possible.☆46Jan 11, 2025Updated last year