Domain-specific preference (DSP) data and customized RM fine-tuning.
☆25Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for DSP
Users that are interested in DSP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for ACL2024 paper - Adversarial Preference Optimization (APO).☆56Jun 3, 2024Updated last year
- ☆26May 30, 2023Updated 2 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- Code for Unsupervised multi-granular Chinese word segmentation and term discovery via graph partition [JBI]☆16Jan 28, 2022Updated 4 years ago
- Code and data for "An Accurate Unsupervised Method for Joint Entity Alignment and Dangling Entity Detection".☆15Mar 26, 2022Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and Classification [AI in Medicine Journal]☆12May 20, 2022Updated 3 years ago
- ☆11Mar 20, 2023Updated 3 years ago
- Biomedical Entity Linking Benchmark☆14Dec 10, 2024Updated last year
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Feb 27, 2024Updated 2 years ago
- ☆19Apr 22, 2024Updated last year
- ML Benchmarks in Algebraic Combinatorics☆25Jan 15, 2026Updated 2 months ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆29Feb 22, 2023Updated 3 years ago
- Code for the AAAI 2020 oral paper - Dynamic Embedding on Textual Networks via a Gaussian Process.☆12Mar 26, 2020Updated 6 years ago
- Multi-thread version of simdjson☆15Jul 27, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- AI Alignment: A Comprehensive Survey☆137Nov 2, 2023Updated 2 years ago
- ☆21May 22, 2023Updated 2 years ago
- BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model [ACL-BioNLP 2022]☆52Oct 26, 2022Updated 3 years ago
- Instruction Following Eval☆16Jan 16, 2025Updated last year
- ☆49Jul 30, 2023Updated 2 years ago
- ☆50Mar 14, 2024Updated 2 years ago
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆24Oct 10, 2023Updated 2 years ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- DeepEvolve is a research and coding agent for new algorithm discovery in different science domains with Deep Research and AlphaEvolve.☆126Oct 11, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The implement of LLMTreeRec☆14Dec 9, 2024Updated last year
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Oct 29, 2023Updated 2 years ago
- Github Repo for ICML 2022 paper: Communication-Efficient Adaptive Federated Learning☆10Nov 18, 2022Updated 3 years ago
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆211Jul 31, 2023Updated 2 years ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- Calculating Expected Time for training LLM.☆39Apr 17, 2023Updated 2 years ago
- Code for AISTATS'25 paper - On the Power of Adaptive Weighted Aggregation in Heterogeneous Federated Learning and Beyond☆13Sep 23, 2025Updated 6 months ago
- Social-AI papers across computing communities, courses, and dissertations.☆21Jun 10, 2025Updated 9 months ago
- LLMPerf is a library for validating and benchmarking LLMs☆11Aug 13, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ACL 2023] Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation☆14Jul 11, 2023Updated 2 years ago
- Code for Neural Networks journal paper - StoCFL: A stochastically clustered federated learning framework for Non-IID data with dynamic cl…☆12Apr 28, 2024Updated last year
- The implementation for the work "Unconstrained Monotonic Calibration of Predictions in Deep Ranking Systems".☆22Jun 11, 2025Updated 9 months ago
- ☆10May 8, 2024Updated last year
- Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback☆1,595Nov 24, 2025Updated 4 months ago
- [EMNLP 2022] Summarization as Indirect Supervision for Relation Extraction (SuRE)☆27Nov 22, 2022Updated 3 years ago
- Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"☆12Aug 26, 2023Updated 2 years ago