Rainlabuw / rl-enabled-distributed-assignment
Implementation of RL-Enabled Distributed Assignment (REDA)
☆14Updated 7 months ago
Alternatives and similar repositories for rl-enabled-distributed-assignment:
Users that are interested in rl-enabled-distributed-assignment are comparing it to the libraries listed below
- ☆14Updated 4 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆14Updated 11 months ago
- The Swarm Ecosystem☆19Updated 6 months ago
- ☆38Updated 6 months ago
- ☆16Updated 9 months ago
- ☆48Updated 3 months ago
- ☆17Updated last week
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 11 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆13Updated last month
- LLM reads a paper and produce a working prototype☆48Updated 2 weeks ago
- ☆20Updated last year
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Updated 4 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 3 months ago
- Very minimal (and stateless) agent framework☆41Updated last month
- LMQL implementation of tree of thoughts☆33Updated last year
- Clean RL implementation using MLX☆28Updated 11 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Updated 8 months ago
- ☆18Updated last year
- Elevate your language models with insightful diversity metrics.☆11Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆40Updated 10 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 3 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆72Updated last month
- Automatic Prompt Optimization☆26Updated 9 months ago