REALM-Bench: A Real-World Planning Benchmark for LLMs and Multi-Agent Systems
☆30Dec 31, 2025Updated 2 months ago
Alternatives and similar repositories for REALM-Bench
Users that are interested in REALM-Bench are comparing it to the libraries listed below
Sorting:
- My research paper notes, focusing on data mining/recommender/reinforcement learning. 我的论文笔记,主要聚焦于数据挖掘、推荐系统、强化学习☆23Dec 4, 2021Updated 4 years ago
- unofficial implementation of the CoT-decoding method for extract cot paths in an unsupervised way☆21Jan 11, 2026Updated last month
- Computational predictor of protein intrinsic disorder and its functions☆10Dec 4, 2023Updated 2 years ago
- Some resources (books, paper, video and online courses) about ML,DL,DM☆12Mar 14, 2021Updated 4 years ago
- Code for "APTBench: Benchmarking Agentic Potential of Base LLMs During Pre-Training"☆38Dec 23, 2025Updated 2 months ago
- Integrating neurosymbolic representations into LLMs for interpretability, steering, and running symbolic algorithms☆14Feb 2, 2026Updated last month
- An updated version of eICU Benchmark with an updated problem definition on LoS and Decompensation tasks☆11Aug 12, 2021Updated 4 years ago
- ☆11Dec 5, 2024Updated last year
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆11Apr 14, 2022Updated 3 years ago
- These are the official datasets used on the Medicare.gov Hospital Compare Website provided by the Centers for Medicare & Medicaid Service…☆10Mar 12, 2018Updated 7 years ago
- ☆16Feb 22, 2025Updated last year
- 机器学习(Machine Learning)、深度学习(Deep Learning)、对抗神经网络(GAN),图神经网络(GNN),NLP,大数据相关的发展路书(roadmap), 并附海量源码(python,pytorch)带大家消化基本知识点,突破面试,完成从新手到合格…☆10Feb 25, 2020Updated 6 years ago
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆10Apr 14, 2022Updated 3 years ago
- Langchain + Docker + Neo4j☆10Oct 29, 2024Updated last year
- Scripts and data to run AbDesign as described in Tools for protein science 2021☆14Nov 4, 2020Updated 5 years ago
- A central repository for curating and managing diverse datasets used in healthcare applications.☆11Jun 8, 2024Updated last year
- This repository contains my research work on building the state of the art next basket recommendations using techniques such as Autoencod…☆11Mar 10, 2021Updated 4 years ago
- ☆12Mar 1, 2025Updated last year
- AttentionDTA: prediction of drug–target binding affinity using attention model.https://ieeexplore.ieee.org/abstract/document/8983125☆13Aug 29, 2020Updated 5 years ago
- ☆12Feb 2, 2024Updated 2 years ago
- CIKM'24☆10Oct 26, 2024Updated last year
- ☆11Apr 8, 2022Updated 3 years ago
- Deepseek-CoT☆10Oct 6, 2024Updated last year
- Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.☆15Mar 22, 2023Updated 2 years ago
- A bot for an algorithmic trading competition that trades options using statistical arbitrage and delta and vega hedging☆12Jan 27, 2018Updated 8 years ago
- ☆13May 25, 2022Updated 3 years ago
- Force Fields☆14Oct 25, 2022Updated 3 years ago
- Knowledge-Based System'24☆12May 28, 2024Updated last year
- Cross-Care☆11Jun 24, 2024Updated last year
- 基于qwen3的医疗大模型研发全流程 0.分词训练 1.增量预训练 2.微调 3.强 化 4.量化 5.蒸馏 6.评估 7.lora模型合并 8.服务 9.部署☆30Jan 3, 2026Updated 2 months ago
- 基于PyTorch GPT-2的针对各种数据并行pretrain的研究代码.☆11Dec 16, 2022Updated 3 years ago
- Reading list for multimodal sequence learning☆14Sep 4, 2023Updated 2 years ago
- Demonstrate Function Calling code portability across 4 AI Models: OpenAI, AzureOpenAI, VertexAI Gemini and Mistral AI.☆13Jun 7, 2024Updated last year
- ☆13May 12, 2025Updated 9 months ago
- ☆12Jul 2, 2025Updated 8 months ago
- Pairs Trading using Unsupervised Clustering and Deep Reinforcement Learning☆11Aug 19, 2023Updated 2 years ago
- SynthEHRella is a benchmarking package used for evaluating synthetic Electronic Health Records (EHR) data generation methods.☆14Sep 17, 2025Updated 5 months ago