[AAAI 2025] ORQA is a new QA benchmark designed to assess the reasoning capabilities of LLMs in a specialized technical domain of Operations Research. The benchmark evaluates whether LLMs can emulate the knowledge and reasoning skills of OR experts when presented with complex optimization modeling tasks.
☆48Jun 7, 2025Updated 11 months ago
Alternatives and similar repositories for ORQA
Users that are interested in ORQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of the paper "Chain-of-Experts: When LLMs Meet Complex Operation Research Problems"☆118Feb 6, 2026Updated 3 months ago
- OptiBench and ReSocratic Synthesis Method☆34Oct 2, 2025Updated 7 months ago
- This repository contains reference implementation for multi-LLM ToM paper (accepted to EMNLP 2023), Theory of Mind for Multi-Agent Collab…☆18Jun 11, 2024Updated last year
- Meta-black-box Optimization Platform☆19Apr 28, 2025Updated last year
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆18Feb 21, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Evolution of Heuristics☆319May 22, 2026Updated last week
- This repository is the official implementation of Bidirectional Learning for Offline Infinite-width Model-based Optimization (NeurIPS 202…☆14Jan 19, 2023Updated 3 years ago
- OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problems with Reasoning LLM☆92Updated this week
- ☆12Jul 12, 2024Updated last year
- LED-Net: A lightweight and efficient dual-branch convolutional neural network designed to address the challenge of achieving high-perform…☆16Sep 9, 2025Updated 8 months ago
- [AAAI-25] Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning.☆33May 29, 2025Updated last year
- PyTorch Implementation of Weakly Supervised Pre-training - [IJCAI19]☆12May 23, 2020Updated 6 years ago
- ☆10Jun 16, 2022Updated 3 years ago
- This is the pytorch demo code for Multi-Source Unsupervised Domain Adaptation via Pseudo Target Domain, (PTMDA) (IEEE Transactions on Ima…☆11Apr 15, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implementation of Materials Discovery with Extreme properties via AI-Driven Combinatorial Chemistry☆10May 8, 2024Updated 2 years ago
- This project is an implementation of two-step object detection (super-resolution and object detection) to address degradation of object d…☆10May 29, 2021Updated 5 years ago
- U-Net neural network applied to FWI problems☆13Dec 8, 2022Updated 3 years ago
- Code for "MSFMamba: Multi-Scale Feature Fusion State Space Model for Multi-Source Remote Sensing Image Classification"☆10Aug 26, 2024Updated last year
- Accepted at ICCV '23☆16Oct 4, 2023Updated 2 years ago
- Implementation of the Biased Boundary Attack for the NeurIPS 2018 Adversarial Vision Challenge☆13Jan 29, 2020Updated 6 years ago
- Force-directed graph layout implementation in Scala☆25Apr 4, 2017Updated 9 years ago
- Flexible Job Shop Instances☆68May 1, 2026Updated 3 weeks ago
- Error-controlled interaction discovery in machine learning models☆22Jun 24, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆19May 4, 2023Updated 3 years ago
- Reinforcement learning environment for job scheduling written in python.☆25Dec 21, 2019Updated 6 years ago
- A geometric-driven semi-supervised approach for fishing activity detection from AIS data.☆13Aug 24, 2022Updated 3 years ago
- ☆26Apr 6, 2026Updated last month
- [MICCAI-FLARE2022] Combining Self-Training and Hybrid Architecture for Semi-supervised Abdominal Organ Segmentation☆11Aug 24, 2022Updated 3 years ago
- Deep Convolutional Generative Adversarial Networks, with some small improvements.☆15Jul 28, 2016Updated 9 years ago
- Source code for "Improving Attention Mechanism in Graph Neural Networks via Cardinality Preservation" (IJCAI 2020)☆17Jul 25, 2024Updated last year
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆18Apr 11, 2024Updated 2 years ago
- AudioLDM training, finetuning, evaluation and inference.☆14Mar 27, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆18May 5, 2026Updated 3 weeks ago
- 🧀 KoBART summarization using pytorch☆13Jun 7, 2023Updated 2 years ago
- Optimization Modeling Using mip Solvers and large language models☆268Nov 4, 2025Updated 6 months ago
- Anomaly detection from ships' Automatic Identification System (AIS) data☆13Nov 2, 2024Updated last year
- ☆16Jul 11, 2023Updated 2 years ago
- Code for [MLMI 2019] [KiPA22 Challenge] [AMOS 2022 Challenge] [MICCAI 2022 Workshop] Boundary-Aware Network for Medical Image Segmentatio…☆16Mar 21, 2023Updated 3 years ago
- The deeplabv3+ person segmentation android example.☆28Feb 1, 2021Updated 5 years ago