cognizant-ai-lab/neuro-san-benchmarking

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cognizant-ai-lab/neuro-san-benchmarking)

cognizant-ai-lab / neuro-san-benchmarking

General benchmarking apparatus for running multi-agent systems against benchmarks

☆46

Alternatives and similar repositories for neuro-san-benchmarking

Users that are interested in neuro-san-benchmarking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BJSwaroop / PortfolioCodeWithSwaroop
View on GitHub
☆29Sep 16, 2023Updated 2 years ago
ARiSE-Lab / CYCLE_OOPSLA_24
View on GitHub
Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"
☆10Mar 8, 2024Updated 2 years ago
wssun / PromptCS
View on GitHub
A Prompt Learning Framework for Source Code Summarization
☆14Dec 26, 2023Updated 2 years ago
VsonicV / es-at-scale
View on GitHub
This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"
☆376Jun 26, 2026Updated 3 weeks ago
AngelRuizMoreno / ConcensusPharmacophore
View on GitHub
Consensus pharmacophore for Drug Design
☆15Aug 22, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
neurosnap / deck-continuations
View on GitHub
☆14Apr 5, 2023Updated 3 years ago
alphaXiv / agents
View on GitHub
TypeScript agents for real applications.
☆24Updated this week
HiteSit / PyMol_Fitter
View on GitHub
Ready-To-Use Pymol Plugin for Docking and Minimization
☆12Nov 15, 2025Updated 8 months ago
ajac-zero / mock-ai
View on GitHub
False LLM endpoints for testing
☆14May 21, 2025Updated last year
flowersteam / EAGER
View on GitHub
☆10Oct 11, 2022Updated 3 years ago
UnixJunkie / molenc
View on GitHub
MolEnc: a molecular encoder using rdkit and OCaml.
☆21May 12, 2026Updated 2 months ago
ejmichaud / precision-ml
View on GitHub
☆13Feb 12, 2023Updated 3 years ago
jondot / vscode-hygen
View on GitHub
This extension bundles Hygen into VSCode and offers seamless code generator functionality right into your editor.
☆20Sep 14, 2018Updated 7 years ago
GATECH-EIC / FracTrain
View on GitHub
[NeurIPS 2020] "FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training" by Yonggan Fu, Ha…
☆10Feb 13, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
cmpnd-ai / dspy-tutorial-deep-research
View on GitHub
Learn DSPy's core abstractions while building a deep research agent.
☆44Mar 8, 2026Updated 4 months ago
facebookresearch / dual-system-for-visual-language-reasoning
View on GitHub
Github repo for Peifeng's internship project
☆13Nov 7, 2023Updated 2 years ago
sparkle-reasoning / sparkle
View on GitHub
[NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning
☆16Dec 12, 2025Updated 7 months ago
abrvkh / explainability_toolkit
View on GitHub
☆14Dec 12, 2024Updated last year
3DStreet / aframe-loader-3dtiles-component
View on GitHub
A-Frame component using 3D-Tiles
☆16Oct 2, 2024Updated last year
govtech-responsibleai / KnowOrNot
View on GitHub
☆28Feb 11, 2026Updated 5 months ago
merlresearch / SMART
View on GitHub
Training and testing code from our CVPR 2023 paper "Are Deep Neural Networks SMARTer than Second Graders?"
☆11Aug 10, 2023Updated 2 years ago
noambrown / acpc_poker_gui_client
View on GitHub
Rails application that allows humans to play poker matches managed by the Annual Computer Poker Competition's Dealer program in a web GUI…
☆11Apr 25, 2015Updated 11 years ago
osmoai / vexo
View on GitHub
Cheminformatics tools that work natively with Google tools such as Sheets and BigQuery
☆17Jul 12, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
samaya-ai / frontier-finance
View on GitHub
Samaya AI's FrontierFinance Benchmark Grader
☆17Jul 16, 2026Updated last week
tabzhangjx / MixupExplainer
View on GitHub
☆10Jun 11, 2023Updated 3 years ago
forlilab / bottchscore
View on GitHub
Calculate Böttcher score on small molecules (doi.org/10.1021/acs.jcim.5b00723)
☆15Sep 20, 2024Updated last year
The-Swarm-Corporation / IoTAgents
View on GitHub
Seamlessly integrate IoT data with AI agents, enabling the effortless parsing, processing, and utilization of IoT data streams.
☆11Jan 27, 2025Updated last year
MaheepChaudhary / SAE-Ravel
View on GitHub
Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…
☆13Jan 26, 2025Updated last year
mega002 / qdmr-based-question-generation
View on GitHub
The official code of TACL 2022, "Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition".
☆12Oct 18, 2021Updated 4 years ago
goldberg-consulting / measured.one.inkwell-extension
View on GitHub
Inkwell: Markdown to publication-quality PDF. Live preview, Pandoc + XeLaTeX compilation, runnable code blocks, and LaTeX template manage…
☆16Jul 2, 2026Updated 3 weeks ago
snu-larr / ibc_official
View on GitHub
Code for "Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum" (ICML 2023)
☆10Jul 6, 2023Updated 3 years ago
kyegomez / FastFF
View on GitHub
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆16Nov 11, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
janwilmake / LLMTEXT-mcp
View on GitHub
llms.txt -> MCP converter and other tools for the adoption of the `llms.txt` standard
☆49Jul 7, 2026Updated 2 weeks ago
nii-nlp / med-eval
View on GitHub
Evaluation Pipeline for medical tasks.
☆12Apr 8, 2026Updated 3 months ago
Hoyyyaard / NavGPT
View on GitHub
☆10Nov 16, 2023Updated 2 years ago
aws-samples / amazon-isv-plug-n-play
View on GitHub
☆10Apr 26, 2023Updated 3 years ago
harbor-framework / harbor-index
View on GitHub
A compact high-signal benchmark for evaluating frontier agents
☆21Updated this week
UKPLab / codeclarqa
View on GitHub
Asking Clarification Questions for Code Generation in General-Purpose Programming Language
☆11May 26, 2023Updated 3 years ago
YuantianDing / HilbertProver
View on GitHub
An Automatic Theorem Prover for Hilbert System, generating nearly-minimal proofs.
☆14Jan 21, 2025Updated last year