Inverse Constitutional AI [ICLR 2025]: compressing pairwise preference data into a short constitution of principles.
☆41Mar 2, 2026Updated 2 weeks ago
Alternatives and similar repositories for icai
Users that are interested in icai are comparing it to the libraries listed below
Sorting:
- Feedback Forensics: An open-source toolkit to measure AI personality☆24Dec 2, 2025Updated 3 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 5 months ago
- Code and data for automatic paraphrase dataset augmentation.☆11Mar 8, 2021Updated 5 years ago
- ☆12Dec 13, 2022Updated 3 years ago
- Scaling Sparse Fine-Tuning to Large Language Models☆18Jan 31, 2024Updated 2 years ago
- ☆13Sep 27, 2022Updated 3 years ago
- Code and data from the paper 'Human Feedback is not Gold Standard'☆20Mar 6, 2026Updated 2 weeks ago
- Teaching Models to Express Their Uncertainty in Words☆39May 26, 2022Updated 3 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- Extractive and Compressive Neural Summarization Based on Summary State Representations (NAACL 2019)☆16May 12, 2020Updated 5 years ago
- codebase for the SIMAT dataset and evaluation☆38Feb 16, 2022Updated 4 years ago
- ☆17Mar 15, 2023Updated 3 years ago
- NLQuAD: A Non-Factoid Long Question Answering Data Set. To be published at EACL2021☆13May 18, 2021Updated 4 years ago
- Data processing for the Collective Constitutional AI project (a collaboration between The Collective Intelligence Project & Anthropic)☆26Oct 17, 2023Updated 2 years ago
- ☆18Mar 23, 2025Updated 11 months ago
- 🏆 Ambassador Paper for Innovative Use of NLP for Building Educational Applications 2023: Is ChatGPT a Good Teacher Coach? Measuring Zero…☆14Jul 21, 2024Updated last year
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆43Feb 24, 2023Updated 3 years ago
- A library for language transfer methods and algorithms.☆16Feb 6, 2026Updated last month
- suffix array construction and searching algorithms for in-memory binary data.☆12Sep 10, 2022Updated 3 years ago
- ☆20May 12, 2022Updated 3 years ago
- A plugin for Figma that draws Sigils to a document☆21Jan 5, 2023Updated 3 years ago
- A tool for calling (and calling out to) large language models.☆16Aug 13, 2024Updated last year
- From Hero to Zéroe: A Benchmark of Low-Level Adversarial Attacks☆14Feb 23, 2023Updated 3 years ago
- ☆148Jul 23, 2025Updated 7 months ago
- [ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang☆14Jan 4, 2024Updated 2 years ago
- Python module to remove wiki markup text.☆10Jan 15, 2016Updated 10 years ago
- Code and instructions accompanying ICCV'23 paper Protoype-based Dataset Comparison☆18Dec 15, 2023Updated 2 years ago
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆14Jun 21, 2024Updated last year
- PyTorch code for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles (DANCE)☆23Nov 29, 2022Updated 3 years ago
- 短视频内容理解与推荐竞赛☆12Feb 18, 2019Updated 7 years ago
- ☆121Jan 19, 2026Updated 2 months ago
- Lightblue LLM Eval Framework: tengu, elyza100, ja-mtbench, rakuda☆18Jan 6, 2026Updated 2 months ago
- kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translation (ACL2023)☆11Jul 26, 2023Updated 2 years ago
- Debian packaging for NNCP [archived], moved to https://salsa.debian.org/go-team/packages/nncp☆14Feb 18, 2023Updated 3 years ago
- Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge☆14Feb 20, 2024Updated 2 years ago
- Code for our ACL19 paper on argument generation☆14Nov 9, 2020Updated 5 years ago
- ☆21Feb 10, 2025Updated last year
- Distilling Model Failures as Directions in Latent Space☆47Feb 8, 2023Updated 3 years ago
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago