REALM-Bench: A Real-World Planning Benchmark for LLMs and Multi-Agent Systems
☆41May 21, 2026Updated last week
Alternatives and similar repositories for REALM-Bench
Users that are interested in REALM-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My research paper notes, focusing on data mining/recommender/reinforcement learning. 我的论文笔记,主要聚焦于数据挖掘、推荐系统、强化学习☆24Dec 4, 2021Updated 4 years ago
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". …☆23Nov 17, 2025Updated 6 months ago
- Code for "APTBench: Benchmarking Agentic Potential of Base LLMs During Pre-Training"☆41Dec 23, 2025Updated 5 months ago
- Code Developed for my Masters Thesis titled: A real-time independent and inexpensive PPG signal quality classification tool for vital sig…☆13Jun 21, 2020Updated 5 years ago
- a survey on deep research☆48Sep 9, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A PyTorch Implementation of Feature Boosting and Suppression☆18Sep 14, 2020Updated 5 years ago
- Implementaion of the WWW paper Implicit User Awareness Modeling via Candidate Items for CTR Prediction in Search Ads☆18Apr 27, 2022Updated 4 years ago
- The pytorch implementation of paper: A Graph-Enhanced Click Model for Web Search☆15Nov 17, 2021Updated 4 years ago
- ☆84Mar 11, 2025Updated last year
- GRASP (Greedy Randomized Adaptive Search Procedure) Function for TSP problems.☆18Dec 22, 2022Updated 3 years ago
- unofficial implementation of the CoT-decoding method for extract cot paths in an unsupervised way☆20Jan 11, 2026Updated 4 months ago
- 非沪籍高校毕业生留沪各项流程汇总☆17Jan 24, 2018Updated 8 years ago
- [ NeurIPS '22 ] Data distillation for recommender systems. Shows equivalent performance with 2-3 orders less data.☆23Jun 8, 2023Updated 2 years ago
- [ WSDM '22 ] On Sampling Collaborative Filtering Datasets☆20Jan 13, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official pytorch implementation of Spatial Relation Decomposition method (AAAI 23)☆23Dec 29, 2023Updated 2 years ago
- [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆58Apr 6, 2025Updated last year
- ☆16Feb 22, 2025Updated last year
- Prompt templates for language models☆10Apr 7, 2026Updated last month
- Source code for Grounded Adaptation for Zero-shot Executable Semantic Parsing☆21Feb 1, 2021Updated 5 years ago
- ☆41Nov 22, 2025Updated 6 months ago
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- ☆35May 24, 2025Updated last year
- Experiments codes for RecSys '21 paper "Mitigating Confounding Bias in Recommendation via Information Bottleneck"☆19Apr 6, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.☆15Mar 22, 2023Updated 3 years ago
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- An updated version of eICU Benchmark with an updated problem definition on LoS and Decompensation tasks☆12Aug 12, 2021Updated 4 years ago
- ☆34Jul 4, 2025Updated 10 months ago
- natural annotated text-category pairs for text classification☆10Sep 10, 2021Updated 4 years ago
- Official page for ICLR 2025 paper "Sufficient Context: A New Lens on Retrieval Augmented Generation Systems"☆66Jul 22, 2025Updated 10 months ago
- ☆13May 12, 2025Updated last year
- Demonstrate Function Calling code portability across 4 AI Models: OpenAI, AzureOpenAI, VertexAI Gemini and Mistral AI.☆13Jun 7, 2024Updated last year
- ☆11Apr 8, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Integrating neurosymbolic representations into LLMs for interpretability, steering, and running symbolic algorithms☆14Feb 2, 2026Updated 3 months ago
- ☆14May 25, 2022Updated 4 years ago
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆10Apr 14, 2022Updated 4 years ago
- [EMNLP 2024] FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents☆22Jan 6, 2025Updated last year
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆21Oct 28, 2025Updated 7 months ago
- ☆12Feb 2, 2024Updated 2 years ago
- An Image Recognition tutorial written for the HyperionDev blog☆10Dec 19, 2017Updated 8 years ago