thu-coai/AutoDetect

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thu-coai/AutoDetect)

thu-coai / AutoDetect

Official github repo for AutoDetect, an automated weakness detection framework for LLMs.

☆47

Alternatives and similar repositories for AutoDetect

Users that are interested in AutoDetect are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Dongping-Chen / ISG
View on GitHub
(ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.
☆31Aug 7, 2025Updated 11 months ago
jmnian / WRAG
View on GitHub
Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"
☆16Oct 2, 2025Updated 9 months ago
illidanlab / RCA
View on GitHub
Implementation for paper: RCA: A Deep Collaborative Autoencoder Approach for Anomaly Detection
☆16Oct 25, 2022Updated 3 years ago
thu-coai / SPaR
View on GitHub
☆47Jun 11, 2025Updated last year
THUDM / ChatGLM-Math
View on GitHub
☆82Apr 18, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Dongping-Chen / MixSet
View on GitHub
(NAACL 2024) Official code repository for Mixset.
☆27Dec 4, 2024Updated last year
HowieHwong / DataGen
View on GitHub
[ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models
☆69Mar 8, 2025Updated last year
shangshang-wang / Resa
View on GitHub
Resa: Transparent Reasoning Models via SAEs
☆50Sep 23, 2025Updated 9 months ago
JinaLeejnl / 2D-TPE
View on GitHub
2D-TPE: Two-Dimensional Positional Encoding Enhances Table Understanding for Large Language Models (WWW 2025)
☆10Apr 15, 2025Updated last year
matthewrenze / jhu-concise-cot
View on GitHub
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆25Nov 25, 2024Updated last year
Flossiee / HonestyLLM
View on GitHub
[NeurIPS 2024] HonestLLM: Toward an Honest and Helpful Large Language Model
☆29Jun 10, 2025Updated last year
mozhu621 / SuperWriter
View on GitHub
☆36Jun 5, 2025Updated last year
Job-Bench / job-bench-eval
View on GitHub
Official eval scripts for JobBench
☆28Updated this week
foreverlasting1202 / QuestA
View on GitHub
☆22Jan 2, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
leopoldwhite / Awesome-Inference-Time-Trustworthiness
View on GitHub
☆15May 15, 2026Updated 2 months ago
othr-nlp / rage_toolkit
View on GitHub
☆11Sep 27, 2024Updated last year
princeton-nlp / continual-factoid-memorization
View on GitHub
Continual Memorization of Factoids in Large Language Models
☆12Nov 20, 2024Updated last year
JieyuZ2 / TaskMeAnything
View on GitHub
[NeurIPS 2024] A task generation and model evaluation system for multimodal language models.
☆71Nov 27, 2024Updated last year
NUSTM / LLMs-Waver-In-Judgments
View on GitHub
☆12Sep 23, 2024Updated last year
RUCAIBox / FIGA
View on GitHub
[ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"
☆10May 5, 2024Updated 2 years ago
uservan / ThinkPO
View on GitHub
☆17Aug 1, 2025Updated 11 months ago
VisualSphinx / VisualSphinx
View on GitHub
☆17Jun 3, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
zzh-SJTU / CRT-QA
View on GitHub
The official data and code for EMNLP 2023 main conference paper: CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular D…
☆13May 19, 2025Updated last year
hrwise-nlp / AppBench
View on GitHub
This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction
☆16Nov 4, 2024Updated last year
Liyan06 / ChartMuseum
View on GitHub
[NeurIPS 2025] ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models
☆24Apr 20, 2026Updated 3 months ago
WilliamZR / ProTrix
View on GitHub
Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context
☆17Nov 15, 2024Updated last year
clayandgithub / rnn_ner
View on GitHub
chinese ner based on rnn
☆12Oct 14, 2016Updated 9 years ago
Junjie-Ye / ToolEyes
View on GitHub
[COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios
☆74May 13, 2025Updated last year
jinzhuoran / RAG-RewardBench
View on GitHub
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
☆18Dec 19, 2024Updated last year
trestad / mitigating-reversal-curse
View on GitHub
Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'
☆14Aug 2, 2024Updated last year
RobustNLP / DeRTa
View on GitHub
A novel approach to improve the safety of large language models, enabling them to transition effectively from unsafe to safe state.
☆72May 22, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JinjieNi / MixEval
View on GitHub
The official evaluation suite and dynamic data release for MixEval.
☆254Nov 10, 2024Updated last year
jjbrophy47 / instance_based_interpretability
View on GitHub
Existing literature about training-data analysis.
☆17Dec 17, 2021Updated 4 years ago
thu-coai / JailbreakDefense_GoalPriority
View on GitHub
[ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
☆29Jul 9, 2024Updated 2 years ago
GasolSun36 / SURf
View on GitHub
[EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information
☆11Oct 11, 2024Updated last year
RUCAIBox / SWE-World
View on GitHub
☆49Mar 6, 2026Updated 4 months ago
PlusLabNLP / VISCO
View on GitHub
[CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning
☆13Jun 7, 2025Updated last year
yiqingxyq / RepoST
View on GitHub
Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"
☆24Mar 18, 2025Updated last year