Rainlabuw / rl-enabled-distributed-assignment
Implementation of RL-Enabled Distributed Assignment (REDA)
☆17Updated 10 months ago
Alternatives and similar repositories for rl-enabled-distributed-assignment
Users that are interested in rl-enabled-distributed-assignment are comparing it to the libraries listed below
Sorting:
- ☆18Updated 7 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆20Updated last month
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- ☆21Updated 3 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆23Updated last month
- ☆35Updated 3 weeks ago
- LLM reads a paper and produce a working prototype☆57Updated last month
- ☆38Updated 9 months ago
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Updated 2 months ago
- Elevate your language models with insightful diversity metrics.☆11Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- ☆15Updated last year
- A Data Source for Reasoning Embodied Agents☆19Updated last year
- ☆41Updated 5 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- Very minimal (and stateless) agent framework☆44Updated 4 months ago
- Simple repository for training small reasoning models☆27Updated 3 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 6 months ago
- Based on the tree of thoughts paper☆48Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- ☆11Updated 9 months ago
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, i…☆18Updated 3 months ago
- AI_Powered_Dev_Search_Engine☆12Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 11 months ago
- The original Shared Recurrent Memory Transformer implementation☆25Updated 3 months ago
- ☆20Updated last week
- ☆45Updated 7 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆23Updated last month
- ☆18Updated last year
- ☆48Updated 6 months ago