Domain-specific preference (DSP) data and customized RM fine-tuning.
☆25Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for DSP
Users that are interested in DSP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for ACL2024 paper - Adversarial Preference Optimization (APO).☆54Jun 3, 2024Updated 2 years ago
- ☆26May 30, 2023Updated 3 years ago
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆33Dec 6, 2023Updated 2 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆12Oct 8, 2021Updated 4 years ago
- This repo support auto line plot for multi-seed event file from TensorBoard☆12Jun 23, 2022Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A large-scale, fine-grained, diverse preference dataset (and models).☆368Dec 29, 2023Updated 2 years ago
- Code for Unsupervised multi-granular Chinese word segmentation and term discovery via graph partition [JBI]☆16Jan 28, 2022Updated 4 years ago
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- ☆11Mar 20, 2023Updated 3 years ago
- Biomedical Entity Linking Benchmark☆14Dec 10, 2024Updated last year
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Feb 27, 2024Updated 2 years ago
- Code for the AAAI 2020 oral paper - Dynamic Embedding on Textual Networks via a Gaussian Process.☆12Mar 26, 2020Updated 6 years ago
- AI Alignment: A Comprehensive Survey☆137Nov 2, 2023Updated 2 years ago
- ☆21May 22, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model [ACL-BioNLP 2022]☆52Oct 26, 2022Updated 3 years ago
- Generative Biomedical Entity Linking via Knowledge Base-Guided Pre-training and Synonyms-Aware Fine-tuning [NAACL 2022]☆19Jan 27, 2023Updated 3 years ago
- ☆50Jul 30, 2023Updated 2 years ago
- ☆50Mar 14, 2024Updated 2 years ago
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆24Oct 10, 2023Updated 2 years ago
- A quick test of SIREN on out-of-sample tasks☆13Jul 4, 2020Updated 5 years ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Oct 29, 2023Updated 2 years ago
- Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"☆33Oct 16, 2023Updated 2 years ago
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆211Jul 31, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- Social-AI papers across computing communities, courses, and dissertations.☆21Apr 8, 2026Updated 2 months ago
- LLMPerf is a library for validating and benchmarking LLMs☆11Aug 13, 2024Updated last year
- The implementation for the work "Unconstrained Monotonic Calibration of Predictions in Deep Ranking Systems".☆23Jun 11, 2025Updated last year
- Collection of papers for scalable automated alignment.☆92Oct 22, 2024Updated last year
- Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback☆1,606Nov 24, 2025Updated 7 months ago
- [EMNLP 2022] Summarization as Indirect Supervision for Relation Extraction (SuRE)☆27Nov 22, 2022Updated 3 years ago
- Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"☆12Aug 26, 2023Updated 2 years ago
- The first real-world FL benchmark for legal NLP☆13Nov 29, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official Code Repository for the paper "KALA: Knowledge-Augmented Language Model Adaptation" (NAACL 2022)☆35Oct 17, 2023Updated 2 years ago
- ☆11Oct 2, 2023Updated 2 years ago
- Answering Ambiguous Questions via Iterative Prompting☆14May 25, 2024Updated 2 years ago
- An awesome & curated list of anything that might be useful for computer science students☆13Mar 27, 2023Updated 3 years ago
- Codes and Pre-trained models for RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training [ACM MM 202…☆29Nov 2, 2023Updated 2 years ago
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 4 years ago
- A Benchmark for Evaluating Safety and Trustworthiness in Web Agents for Enterprise Scenarios