Ablustrund / MPLSandboxLinks

MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler and analysis tools for LLMs.

☆178

Alternatives and similar repositories for MPLSandbox

Users that are interested in MPLSandbox are comparing it to the libraries listed below

Sorting:

yiyihum / da-code
[EMNLP 2024] DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models
☆74Updated 3 weeks ago
ShuaiLyu0110 / SQL-o1
SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL
☆192Updated 2 months ago
jincan333 / MAS-TTS
Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning
☆76Updated 3 months ago
code-philia / CoEdPilot
Source code for "CoEdPilot: Recommending Code Edits with Learned Prior Edit Relevance, Project-wise Awareness, and Interactive Nature"
☆101Updated 4 months ago
yileijin / PayAttn
Official Implementation of "Pay Attention to What You Need"
☆43Updated 5 months ago
Yueeeeeeee / HRPO
Hybrid Latent Reasoning via Reinforcement Learning
☆142Updated 2 months ago
bird-bench / livesqlbench
☆100Updated 2 weeks ago
huangd1999 / EffiBench
[NeurIPS 2024] EffiBench: Benchmarking the Efficiency of Automatically Generated Code
☆55Updated 8 months ago
bird-bench / BIRD-Interact
[BIRD-INTERACT] Re-imagines Text-to-SQL evaluation via lens of dynamic interactions.
☆111Updated 3 weeks ago
chatsci / Aeiva
A general AI agent framework that can be adapted to various tasks and environments.
☆101Updated 6 months ago
OpenDCAI / RARE
Official implementation of RARE: Retrieval-Augmented Reasoning Modeling
☆183Updated 2 months ago
Mercury7353 / PyBench
LLM Benchmark for Code
☆30Updated 11 months ago
syr-cn / AutoRefine
Search and Refine During Think: Autonomous Retrieval‑Augmented Reasoning of LLMs
☆89Updated last month
Qcompiler / MixQ_Tensorrt_LLM
Mixed precision inference by Tensorrt-LLM
☆81Updated 9 months ago
YangLinyi / GLUE-X
We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that …
☆93Updated last year
tapilot-crossing / tapilot_code
☆45Updated last year
duguodong7 / Awesome-Knowledge-Fusion
A collection of papers related to knowledge fusion
☆57Updated 9 months ago
xiongsiheng / SWAP
[ACL 25 main] Deliberate Reasoning in Language Models as Structure-Aware Planning with an Accurate World Model
☆34Updated 2 months ago
xuyang-sudo / AutoRLAIF
AutoRLAIF is a cutting-edge framework designed to revolutionize the fine-tuning of large language models through Reinforcement Learning …
☆94Updated 9 months ago
MrYxJ / enhance_long
This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …
☆45Updated last year
Ljyustc / SocraticLM
☆145Updated 4 months ago
gersteinlab / ML-Bench
ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…
☆302Updated this week
SHUMKASHUN / Plots
This repo contains my customised style python based plots for NLP papers, and includes my reproduction for my favourite papers' plots
☆39Updated last year
ColinLu50 / SafeDelta
The official code repo for "Safe Delta: Consistently Preserving Safety when Fine-Tuning LLMs on Diverse Datasets" in ICML 2025.
☆50Updated last month
Qcompiler / vllm-mixed-precision
Support mixed-precsion inference with vllm
☆85Updated 2 weeks ago
Rafa-zy / QLASS
☆38Updated 3 weeks ago
HSLiu-Initial / CtrlA
This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.
☆62Updated 9 months ago
S1s-Z / NOVA
[ACL'25] Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"
☆20Updated last week
jzhoubu / vsearch
An Extensible Framework for Retrieval-Augmented LLM Applications: Learning Relevance Beyond Simple Similarity.
☆39Updated 7 months ago
heng840 / AMIG
Code of Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Ne…
☆26Updated last year