allenai / genesysLinks
Source code and utilities for the Genesys distributed language model architecture discovery system.
☆130Updated 3 months ago
Alternatives and similar repositories for genesys
Users that are interested in genesys are comparing it to the libraries listed below
Sorting:
- A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]☆120Updated 4 months ago
- Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning☆77Updated 5 months ago
- A ReAct-Based Highly Robust Autonomous Agent Framework.☆208Updated last month
- MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler a…☆177Updated 5 months ago
- [ACL 2025 Findings] MegaAgent: A Large-Scale Autonomous LLM-based Multi-Agent System Without Predefined SOPs https://arxiv.org/abs/2408.0…☆187Updated last week
- Accurate, private and configurable document retrieval LLM☆129Updated 2 weeks ago
- ☆52Updated last month
- ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…☆302Updated 2 months ago
- [EMNLP 2024] DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models☆76Updated 2 months ago
- Official implementation of RARE: Retrieval-Augmented Reasoning Modeling☆182Updated 4 months ago
- A general AI agent framework that can be adapted to various tasks and environments.☆102Updated 8 months ago
- Framework exploring ergonomic, lightweight multi-agent orchestration.☆116Updated last week
- SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL☆191Updated 4 months ago
- 一个基于多个大语言模型的智能学术范文写作系统,能够根据输入的开题报告或研究设计文档,自动生成包含引用的学术范文的各章节内容。☆222Updated 2 months ago
- Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML☆36Updated last week
- DocAgent is a system designed to generate high-quality, context-aware code documentation for Python codebases using a multi-agent approac…☆374Updated 5 months ago
- EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…☆422Updated 2 weeks ago
- Zero Graph – Minimalist LLM framework designed for AI Agent programming☆104Updated 2 months ago
- [NeurIPS 2025] Hybrid Latent Reasoning via Reinforcement Learning☆153Updated 3 weeks ago
- Code and dataset of CodeSteer☆88Updated 6 months ago
- ☆163Updated this week
- AI powered tools playground☆120Updated 2 years ago
- Enable AI agents to understand and execute any DeFi protocol operations☆76Updated 8 months ago
- A Contextual RAG Bot Framework☆82Updated 11 months ago
- ☆186Updated last week
- Dataset and evaluation code of ISDrama(ACM-MM 2025): Immersive Spatial Drama Generation through Multimodal Prompting☆121Updated last month
- The repository for the paper titled "Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks"☆158Updated 9 months ago
- Flexible RAG tools, Features semantic search, document indexing, and intelligent reranking with minimal intrusion design.☆88Updated last month
- AutoRLAIF is a cutting-edge framework designed to revolutionize the fine-tuning of large language models through Reinforcement Learning …☆94Updated 11 months ago
- AI solution for Patent Classification☆140Updated 5 years ago