A toolkit for testing and improving named entity recognition [ESEC/FSE'23]
☆11Aug 31, 2023Updated 2 years ago
Alternatives and similar repositories for TestNER
Users that are interested in TestNER are comparing it to the libraries listed below
Sorting:
- [ICSE'25] Aligning the Objective of LLM-based Program Repair☆23Mar 8, 2025Updated 11 months ago
- ☆11Jan 19, 2025Updated last year
- [NeurIPS'25] Official Implementation of RISE (Reinforcing Reasoning with Self-Verification)☆31Aug 8, 2025Updated 6 months ago
- A lightweight tool for detecting bugs on Graph Database Management Systems☆15Jan 9, 2024Updated 2 years ago
- A toolkit for testing machine translation [ICSE'20, '21, ESEC/FSE'20]☆33Nov 15, 2021Updated 4 years ago
- A toolkit for hybrid log parsing☆18Aug 23, 2023Updated 2 years ago
- [ESEC/FSE'23] Hue: A User-Adaptive Parser for Hybrid Logs☆10Aug 24, 2023Updated 2 years ago
- Structure-Invariant Testing for Machine Translation [ICSE'20]☆16Dec 17, 2020Updated 5 years ago
- A toolkit for Light Log Anomaly Detection [ICSE'24]☆22Feb 22, 2025Updated last year
- [ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench☆35Aug 12, 2025Updated 6 months ago
- MTTM: Metamorphic Testing for Textual Content Moderation Software☆32Feb 10, 2023Updated 3 years ago
- A novel approach to improve the safety of large language models, enabling them to transition effectively from unsafe to safe state.☆71May 22, 2025Updated 9 months ago
- AutoLog: A Log Sequence Synthesis Framework for Anomaly Detection [ASE'23]☆41Feb 20, 2024Updated 2 years ago
- Implementation of MetaVQA.☆12Jul 3, 2021Updated 4 years ago
- A basic repository for a Clang-based tool, with CMake integration.☆10Sep 22, 2023Updated 2 years ago
- 🎓 A collection of Code Example Files, Programming Assignments and Final Project for "Introduction to Data, Signal, and Image Analysis wi…☆13Jan 30, 2022Updated 4 years ago
- [WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews☆17Dec 14, 2025Updated 2 months ago
- A Layered Approach for Multi-Agent Path Finding☆12Jan 5, 2023Updated 3 years ago
- Building self-refined guardrails via DSPy☆14Jul 2, 2024Updated last year
- ☆16Apr 7, 2025Updated 10 months ago
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- Beyond Words: A Multimodal Exploration of Persuasion in Memes☆13Jun 8, 2024Updated last year
- ☆13Feb 14, 2024Updated 2 years ago
- ☆11Feb 1, 2023Updated 3 years ago
- Builds a WMT18-like corpus for word-level QE with annotations in the source and target words.☆10Sep 19, 2022Updated 3 years ago
- ☆16Mar 22, 2025Updated 11 months ago
- [ICLR 2026] "When AI Agents Collude Online: Financial Fraud Risks by Collaborative LLM Agents on Social Platforms"☆26Feb 3, 2026Updated 3 weeks ago
- ☆12Apr 9, 2025Updated 10 months ago
- [ACL 2024] CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion☆58Oct 1, 2025Updated 4 months ago
- Multilingual safety benchmark for Large Language Models☆53Sep 1, 2024Updated last year
- An Open-source Factuality Evaluation Demo for LLMs☆24Updated this week
- AI Memory System - Consciousness continuity through intelligent memory curation and retrieval☆18Jan 28, 2026Updated last month
- ☆20Nov 15, 2024Updated last year
- A log compression tool (ASE2024)☆16Apr 15, 2025Updated 10 months ago
- Advancing the frontier of efficient AI☆53Feb 10, 2026Updated 2 weeks ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 4 months ago
- ☆13Sep 12, 2024Updated last year
- SanRazor is a sanitizer check reduction tool aiming to incur little overhead while retaining all important sanitizer checks.☆56Jun 6, 2021Updated 4 years ago
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆14Jun 21, 2024Updated last year