Rainlabuw / rl-enabled-distributed-assignment
Implementation of RL-Enabled Distributed Assignment (REDA)
☆14Updated 8 months ago
Alternatives and similar repositories for rl-enabled-distributed-assignment:
Users that are interested in rl-enabled-distributed-assignment are comparing it to the libraries listed below
- ☆16Updated 6 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- ☆18Updated last month
- a WIP architecture designed to allow transformers to think in a manner without tokens☆19Updated 11 months ago
- Simple GRPO scripts and configurations.☆58Updated 2 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆17Updated this week
- Small, simple agent task environments for training and evaluation☆18Updated 5 months ago
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, i…☆18Updated 2 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 4 months ago
- LLM reads a paper and produce a working prototype☆51Updated 3 weeks ago
- LMQL implementation of tree of thoughts☆34Updated last year
- ☆38Updated 8 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆41Updated last year
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆17Updated this week
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- This repository implements DSPy programs to tasks in Indian Languages☆13Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 9 months ago
- ☆25Updated 6 months ago
- Automatic Prompt Optimization☆28Updated 10 months ago
- ☆38Updated 2 months ago
- Elevate your language models with insightful diversity metrics.☆11Updated last year
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆16Updated this week
- Public repository containing METR's DVC pipeline for eval data analysis☆34Updated this week
- Automated Capability Discovery via Foundation Model Self-Exploration☆44Updated last month
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- ☆16Updated 10 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- Entailment self-training☆25Updated last year
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆18Updated 3 weeks ago
- AI_Powered_Dev_Search_Engine☆12Updated last year