Repository for the paper "RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?"
☆27May 1, 2025Updated 10 months ago
Alternatives and similar repositories for RTP-LX
Users that are interested in RTP-LX are comparing it to the libraries listed below
Sorting:
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆20Dec 14, 2024Updated last year
- [ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"☆25Mar 4, 2025Updated last year
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- ☆12Feb 11, 2026Updated last month
- ☆18Dec 12, 2025Updated 3 months ago
- Benchmark of LLMs on real open-source projects against dependency hell, legacy toolchains, and complex build systems.☆53Dec 23, 2025Updated 3 months ago
- Implementation of "ACL'24: When Do LLMs Need Retrieval Augmentation? Mitigating LLMs’ Overconfidence Helps Retrieval Augmentation"☆24Jul 19, 2024Updated last year
- Remix example showing how to ensure the Suspense fallback is rendered on route change☆10Mar 15, 2024Updated 2 years ago
- Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)☆23Mar 3, 2025Updated last year
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…☆25May 10, 2024Updated last year
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 11 months ago
- The Universal Anaphora Scorer☆15Sep 2, 2024Updated last year
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models☆87Sep 12, 2024Updated last year
- Improved version of the technical workshops for the 10-day ML4G camp on safety of AI systems☆19Mar 7, 2026Updated 2 weeks ago
- 2019 PyCon kr tutorial: "네이버 영화 평점 데이터로 자연어처리 논문 구현 시작하기"☆13Aug 21, 2019Updated 6 years ago
- code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis☆12Nov 17, 2024Updated last year
- Documentation effort for the BookCorpus dataset☆34Jun 2, 2021Updated 4 years ago
- Official implementation for "MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models"☆18Oct 26, 2024Updated last year
- VK apps + tensorflow-js demo app☆12May 17, 2019Updated 6 years ago
- Thin wrapper for OpenAI GPT APIs☆11Dec 24, 2024Updated last year
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- ☆27Apr 29, 2019Updated 6 years ago
- Implementation of AdaCQR(COLING 2025)☆13Dec 30, 2024Updated last year
- ☆14Sep 17, 2025Updated 6 months ago
- ☆16Mar 4, 2024Updated 2 years ago
- Official implementation of the ACL 2022 paper "Learning Non-Autoregressive Models from Search for Unsupervised Sentence Summarization"☆14Dec 26, 2022Updated 3 years ago
- Like a bear. My official site☆10Mar 15, 2026Updated last week
- Example showing how to use `ComponentErrorBoundary`☆16Jun 21, 2024Updated last year
- Python library written in Rust for creating/transporting/parsing AI characters between different frontends (TavernAI, SillyTavern, TextGe…☆21Nov 14, 2025Updated 4 months ago
- 秀和システム「フロントエンド開発入門 プロフェッショナルな開発ツールと設計・実装 」書籍内で利用するサンプルアプリケーションです。☆13Aug 27, 2023Updated 2 years ago
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 9 months ago
- ☆12Jun 14, 2021Updated 4 years ago
- [NeurIPS 2025] Reasoning Models Better Express Their Confidence"☆22Nov 19, 2025Updated 4 months ago
- This Repositry is an experiment with an agent that searches documents and asks questions repeatedly in response to the main question. It …☆19Jul 17, 2023Updated 2 years ago
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆14Dec 16, 2024Updated last year
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated 9 months ago
- A d3 visualization of the emergence of online echo chambers.☆17Nov 7, 2023Updated 2 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 7 months ago
- Are foundation LMs multilingual knowledge bases? (EMNLP 2023)☆19Dec 8, 2023Updated 2 years ago