Wenyueh/inductive_reasoning_benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Wenyueh/inductive_reasoning_benchmark)

Wenyueh / inductive_reasoning_benchmark

inductive reasoning benchmark with subregular hierarchy for string-to-string transformation

☆20

Alternatives and similar repositories for inductive_reasoning_benchmark

Users that are interested in inductive_reasoning_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

t54-labs / AgenticRiskStandard
View on GitHub
Agentic Risk Standard is a settlement-layer standard for trustworthy transactions with AI Agent
☆34Mar 29, 2026Updated 3 months ago
Wenyueh / game_theory
View on GitHub
How to create rational LLM-based agents? Using game-theoretic workflows!
☆110Jun 8, 2025Updated last year
AgentOptimizer / agentopt
View on GitHub
AgentOpt automatically finds the best LLM model combination for each step of your agent — optimizing for accuracy, cost, and latency.
☆85Updated this week
meituan-longcat / MineExplorer
View on GitHub
Reproduction code for paper "MineExplorer: Evaluating Open-World Exploration of MLLM Agents in Minecraft"
☆17Jun 12, 2026Updated last month
hkust-nlp / RL-Verifier-Robustness
View on GitHub
From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.
☆24Oct 7, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ElvishElvis / LCA-on-the-line
View on GitHub
LCA-on-the-line (ICML 2024 Oral)
☆14Feb 13, 2025Updated last year
koalazf99 / nanoverl
View on GitHub
Collections of RLxLM experiments using minimal codes
☆14Feb 17, 2025Updated last year
whuAdv / AdvPattern
View on GitHub
☆10Mar 6, 2020Updated 6 years ago
dgcnz / FACT
View on GitHub
Code for [Re] On the Reproducibility of Post-Hoc Concept Bottleneck Models.
☆13Nov 27, 2024Updated last year
GAIR-NLP / self-improvement-reversal
View on GitHub
☆13Jul 14, 2024Updated 2 years ago
K-Kuyama / yet-another-UI-for-AW
View on GitHub
UI for ActivityWatch. Include category editor and viewer for multiple categorizations.
☆10Jan 31, 2024Updated 2 years ago
GongRzhe / Calendar-Autoauth-MCP-Server
View on GitHub
A Model Context Protocol (MCP) server for Google Calendar integration in Cluade Desktop with auto authentication support. This server ena…
☆12Mar 11, 2025Updated last year
keiji / region_cropper
View on GitHub
Help creating image dataset for machine learning.
☆10Nov 4, 2020Updated 5 years ago
haolunc / iGSM-Replication-physics-LLM
View on GitHub
This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.
☆17Sep 13, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lucianmarin / pybench
View on GitHub
Python benchmark tool inspired by Geekbench.
☆20Feb 21, 2026Updated 5 months ago
wenquanlu / huginn-latent-cot
View on GitHub
[COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…
☆20Oct 4, 2025Updated 9 months ago
hkust-nlp / model-task-align-rl
View on GitHub
[ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".
☆18Feb 9, 2026Updated 5 months ago
Trustworthy-and-Responsible-AI-Lab / Qu-ANTI-zation
View on GitHub
[NeurIPS 2021] Source code for the paper "Qu-ANTI-zation: Exploiting Neural Network Quantization for Achieving Adversarial Outcomes"
☆18Nov 9, 2021Updated 4 years ago
microsoft / social-reasoning-bench
View on GitHub
A benchmark to evaluate AI Agents in social domains.
☆18Updated this week
kenkenpa2126 / vanilla_transformer_from_scratch_with_JAX
View on GitHub
☆10Dec 18, 2023Updated 2 years ago
SCLBD / Effective_backdoor_defense
View on GitHub
☆14Oct 7, 2022Updated 3 years ago
a01sa01to / TitleAndURL_Picker
View on GitHub
Chrome Extension. As the name suggests.
☆10Jan 30, 2022Updated 4 years ago
zmzhang2000 / trustworthy-alignment
View on GitHub
Official repository for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
☆12Sep 2, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
giovannicoppola / alfred-yaanki
View on GitHub
yet another anki app
☆14Sep 9, 2024Updated last year
bryanchrist / MathNeuro
View on GitHub
Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes
☆23Jun 15, 2025Updated last year
Jiaran-Ye / ImplicitReasoning
View on GitHub
Code for paper 'How does Transformer Learn Implicit Reasoning?'
☆18Jul 3, 2025Updated last year
amayuelas / multi-agent-attack
View on GitHub
MutliAgent Attack
☆15Oct 3, 2024Updated last year
Geralt-Targaryen / MC-Evaluation
View on GitHub
☆14May 21, 2024Updated 2 years ago
lyveng / pandas-hbase
View on GitHub
Pandas Helper Library for reading and writing DataFrames from and to HBase.
☆10Mar 8, 2018Updated 8 years ago
ubermenchh / mini-vllm
View on GitHub
☆21Jun 14, 2026Updated last month
llmrecsys / llmrecsys.github.io
View on GitHub
☆12Sep 23, 2023Updated 2 years ago
wumingqi / LLM-Math-Evaluation
View on GitHub
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination.
☆21Jul 18, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
bbartoldson / TBA
View on GitHub
Official implementation of TBA for async LLM post-training.
☆31Nov 5, 2025Updated 8 months ago
ModelCloud / Device-SMI
View on GitHub
Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separat…
☆16Updated this week
DaisukeBekki / JSeM
View on GitHub
Japanese semantic test suite (FraCaS counterpart and extensions)
☆13Apr 21, 2026Updated 3 months ago
tkm2261 / analytics_ansible
View on GitHub
Ansible for building kaggle environment
☆13Jul 30, 2019Updated 6 years ago
lisongx / atom-tidal
View on GitHub
Atom Editor plugin for tidalcycles(Since tidal version 0.8, the offical atom plugin atom-tidalcycles should be used)
☆14Mar 14, 2018Updated 8 years ago
dmis-lab / Outlier-Safe-Pre-Training
View on GitHub
[ACL 2025] Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models
☆39Nov 4, 2025Updated 8 months ago
tohinz / SVHN-Classifier
View on GitHub
Simple classifier to classify SVHN images, based on Keras with the Tensorflow backend.
☆17Feb 26, 2018Updated 8 years ago