REALM-Bench: A Real-World Planning Benchmark for LLMs and Multi-Agent Systems
☆36Dec 31, 2025Updated 3 months ago
Alternatives and similar repositories for REALM-Bench
Users that are interested in REALM-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2025] Retraining-Free Merging of Sparse MoE via Hierarchical Clustering☆24Oct 26, 2025Updated 5 months ago
- A bot for an algorithmic trading competition that trades options using statistical arbitrage and delta and vega hedging☆12Jan 27, 2018Updated 8 years ago
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". …☆22Nov 17, 2025Updated 5 months ago
- Code for "APTBench: Benchmarking Agentic Potential of Base LLMs During Pre-Training"☆41Dec 23, 2025Updated 3 months ago
- track golang trending in github☆21Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Pairs Trading using Unsupervised Clustering and Deep Reinforcement Learning☆11Aug 19, 2023Updated 2 years ago
- A curated list of awesome resources, libraries, frameworks, and tools for multi-agent systems (MAS) research and development.☆28Feb 17, 2025Updated last year
- ☆11Oct 9, 2021Updated 4 years ago
- A PyTorch Implementation of Feature Boosting and Suppression☆18Sep 14, 2020Updated 5 years ago
- Implementaion of the WWW paper Implicit User Awareness Modeling via Candidate Items for CTR Prediction in Search Ads☆17Apr 27, 2022Updated 3 years ago
- The pytorch implementation of paper: A Graph-Enhanced Click Model for Web Search☆15Nov 17, 2021Updated 4 years ago
- ☆82Mar 11, 2025Updated last year
- KAIST medical VL research group☆20Dec 20, 2024Updated last year
- Making survival analysis work in TensorFlow☆19Jun 4, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- unofficial implementation of the CoT-decoding method for extract cot paths in an unsupervised way☆20Jan 11, 2026Updated 3 months ago
- ☆31Apr 2, 2025Updated last year
- 非沪籍高校毕业生留沪各项流程汇总☆17Jan 24, 2018Updated 8 years ago
- Using DeepBSDE solver to price/hedge options & optimize portfolios under Black-Scholes, Heston and multiscale models.☆18Mar 20, 2020Updated 6 years ago
- [WWW'2025] "RTBAgent: A LLM-based Agent System for Real-Time Bidding"☆32Apr 14, 2025Updated last year
- [ NeurIPS '22 ] Data distillation for recommender systems. Shows equivalent performance with 2-3 orders less data.☆23Jun 8, 2023Updated 2 years ago
- [ WSDM '22 ] On Sampling Collaborative Filtering Datasets☆20Jan 13, 2022Updated 4 years ago
- 面向高校青年教师的纵向科研项目资料汇总仓库☆34Aug 11, 2025Updated 8 months ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆56Apr 6, 2025Updated last year
- ☆16Feb 22, 2025Updated last year
- Prompt templates for language models☆10Apr 7, 2026Updated last week
- Langchain + Docker + Neo4j☆10Oct 29, 2024Updated last year
- ☆359Jul 23, 2025Updated 8 months ago
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆11Apr 14, 2022Updated 4 years ago
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- ☆13May 12, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆21Updated this week
- 机器学习(Machine Learning)、深度学习(Deep Learning)、对抗神经网络(GAN),图神经网络(GNN),NLP,大数据相关的发展路书(roadmap), 并附海量源码(python,pytorch)带大家消化基本知识点,突破面试,完成从新手到合格…☆10Feb 25, 2020Updated 6 years ago
- Elevate your language models with insightful diversity metrics.☆11Feb 4, 2024Updated 2 years ago
- ☆38May 23, 2024Updated last year
- Demonstrate Function Calling code portability across 4 AI Models: OpenAI, AzureOpenAI, VertexAI Gemini and Mistral AI.☆13Jun 7, 2024Updated last year
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆10Apr 14, 2022Updated 4 years ago
- [EMNLP 2024] FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents☆22Jan 6, 2025Updated last year