REALM-Bench: A Real-World Planning Benchmark for LLMs and Multi-Agent Systems
☆42May 21, 2026Updated 3 weeks ago
Alternatives and similar repositories for REALM-Bench
Users that are interested in REALM-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My research paper notes, focusing on data mining/recommender/reinforcement learning. 我的论文笔记,主要聚焦于数据挖掘、推荐系统、强化学习☆24Dec 4, 2021Updated 4 years ago
- A bot for an algorithmic trading competition that trades options using statistical arbitrage and delta and vega hedging☆12Jan 27, 2018Updated 8 years ago
- track golang trending in github☆22Updated this week
- Sparse Mixture of Learned Kernels for Interpretable and Efficient PPG Signal Quality Assessment and Artifact Segmentation☆22Jan 6, 2025Updated last year
- Joint Optimization of Cascade Ranking Models (WSDM 19)☆13Jun 21, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The pytorch implementation of paper: A Graph-Enhanced Click Model for Web Search☆15Nov 17, 2021Updated 4 years ago
- AutoML framework for implementing automated machine learning on data streams☆15Jun 29, 2023Updated 2 years ago
- ☆84Mar 11, 2025Updated last year
- 非沪籍高校毕业生留沪各项流程汇总☆17Jan 24, 2018Updated 8 years ago
- Using DeepBSDE solver to price/hedge options & optimize portfolios under Black-Scholes, Heston and multiscale models.☆18Mar 20, 2020Updated 6 years ago
- ☆15Apr 26, 2025Updated last year
- [WWW'2025] "RTBAgent: A LLM-based Agent System for Real-Time Bidding"☆35Apr 14, 2025Updated last year
- [ WSDM '22 ] On Sampling Collaborative Filtering Datasets☆20Jan 13, 2022Updated 4 years ago
- SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution [ICSE 2026]☆31Nov 11, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17Aug 7, 2024Updated last year
- Official pytorch implementation of Spatial Relation Decomposition method (AAAI 23)☆23Dec 29, 2023Updated 2 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- Pairs Trading in Python☆28Apr 25, 2021Updated 5 years ago
- This is the reading list of Large Language Model-Based Data Science Agent☆40Nov 3, 2025Updated 7 months ago
- Prompt templates for language models☆10Apr 7, 2026Updated 2 months ago
- ☆41Updated this week
- Langchain + Docker + Neo4j☆10Oct 29, 2024Updated last year
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆373Jul 23, 2025Updated 10 months ago
- This repository contains the official implementation of the paper "Robustness of Graph Neural Networks at Scale" (NeurIPS, 2021).☆31Jul 25, 2023Updated 2 years ago
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆11Apr 14, 2022Updated 4 years ago
- ☆36May 24, 2025Updated last year
- Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.☆15Mar 22, 2023Updated 3 years ago
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- ☆23Jun 10, 2026Updated last week
- An updated version of eICU Benchmark with an updated problem definition on LoS and Decompensation tasks☆13Aug 12, 2021Updated 4 years ago
- 机器学习(Machine Learning)、深度学习(Deep Learning)、对抗神经网络(GAN),图神经网络(GNN),NLP,大数据相关的发展路书(roadmap), 并附海量源码(python,pytorch)带大家消化基本知识点,突破面试,完成从新手到合格…☆10Feb 25, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Elevate your language models with insightful diversity metrics.☆11Feb 4, 2024Updated 2 years ago
- Computational predictor of protein intrinsic disorder and its functions☆11Dec 4, 2023Updated 2 years ago
- Hedging unsing Deep Reinforcement Learning and Deep Learning☆27Mar 29, 2021Updated 5 years ago
- Model to predict kinase-ligand pKi values.☆12Jul 6, 2023Updated 2 years ago
- ☆14May 12, 2025Updated last year
- ☆11Apr 8, 2022Updated 4 years ago
- ☆14May 25, 2022Updated 4 years ago