Domain-specific preference (DSP) data and customized RM fine-tuning.
☆25Mar 7, 2024Updated last year
Alternatives and similar repositories for DSP
Users that are interested in DSP are comparing it to the libraries listed below
Sorting:
- Code for ACL2024 paper - Adversarial Preference Optimization (APO).☆56Jun 3, 2024Updated last year
- ☆26May 30, 2023Updated 2 years ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and Classification [AI in Medicine Journal]☆12May 20, 2022Updated 3 years ago
- ☆11Mar 20, 2023Updated 2 years ago
- Multi-thread version of simdjson☆15Jul 27, 2019Updated 6 years ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Feb 27, 2024Updated 2 years ago
- Code for the AAAI 2020 oral paper - Dynamic Embedding on Textual Networks via a Gaussian Process.☆12Mar 26, 2020Updated 5 years ago
- This repo support auto line plot for multi-seed event file from TensorBoard☆12Jun 23, 2022Updated 3 years ago
- Code for Unsupervised multi-granular Chinese word segmentation and term discovery via graph partition [JBI]☆16Jan 28, 2022Updated 4 years ago
- A large-scale, fine-grained, diverse preference dataset (and models).☆363Dec 29, 2023Updated 2 years ago
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- Code repository for the paper "Invariant and Transportable Representations for Anti-Causal Domain Shifts"☆16Jul 4, 2022Updated 3 years ago
- Generative Biomedical Entity Linking via Knowledge Base-Guided Pre-training and Synonyms-Aware Fine-tuning [NAACL 2022]☆19Jan 27, 2023Updated 3 years ago
- AI Alignment: A Comprehensive Survey☆136Nov 2, 2023Updated 2 years ago
- ☆50Mar 14, 2024Updated last year
- ☆49Jul 30, 2023Updated 2 years ago
- BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model [ACL-BioNLP 2022]☆52Oct 26, 2022Updated 3 years ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆29Feb 22, 2023Updated 3 years ago
- Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"☆33Oct 16, 2023Updated 2 years ago
- ☆29Jan 16, 2023Updated 3 years ago
- An offical implementation of EHRDiff [TMLR]☆31Jun 25, 2024Updated last year
- [EMNLP 2022] Summarization as Indirect Supervision for Relation Extraction (SuRE)☆28Nov 22, 2022Updated 3 years ago
- Tools for content datamining and NLP at scale☆44Jun 20, 2024Updated last year
- Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024☆144Feb 24, 2025Updated last year
- vector quantization for stochastic gradient descent.☆35May 12, 2020Updated 5 years ago
- CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]☆81Jun 28, 2022Updated 3 years ago
- Research repo for AI aided drug discovery, de novo drug development and related topics☆38Jan 4, 2022Updated 4 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Aug 15, 2019Updated 6 years ago
- Collection of papers for scalable automated alignment.☆93Oct 22, 2024Updated last year
- ☆11Feb 28, 2024Updated 2 years ago
- Teaching Categories to Human Learners with Visual Explanations - CVPR 2018☆11Jun 21, 2022Updated 3 years ago
- ☆26Updated this week
- This is the official implementation for MA-LoT.☆19Aug 4, 2025Updated 6 months ago
- A Benchmark for Evaluating Safety and Trustworthiness in Web Agents for Enterprise Scenarios☆19Updated this week
- Machine learning for molecules workshop 2022☆13Nov 30, 2022Updated 3 years ago
- the datasets of our paper☆11Feb 26, 2024Updated 2 years ago
- Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples☆44Jul 16, 2025Updated 7 months ago