☆123May 2, 2024Updated 2 years ago
Alternatives and similar repositories for opinions_qa
Users that are interested in opinions_qa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆25Mar 4, 2024Updated 2 years ago
- Code for the paper "CoS: Enhancing Personalization and Mitigating Bias with Context Steering"☆20Dec 13, 2024Updated last year
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆18Sep 9, 2022Updated 3 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆96May 25, 2023Updated 2 years ago
- ☆11Jul 7, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 青空文庫及びサピエの点字データから作成した振り仮名コーパスのデータセット☆22Jan 17, 2024Updated 2 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆28Aug 21, 2024Updated last year
- Röttger et al. (2024): "IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance"☆16Mar 6, 2026Updated last month
- [EMNLP 2024] "Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective"☆32Jul 22, 2024Updated last year
- ☆23Mar 8, 2024Updated 2 years ago
- Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"☆32Jun 16, 2024Updated last year
- Data and models for Misinfo Reaction Frames paper.☆14Jun 9, 2024Updated last year
- YesBut - Multimodal Satire Comprehension Dataset☆19Oct 23, 2024Updated last year
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆80Jan 16, 2026Updated 3 months ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year
- The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.☆12Dec 15, 2021Updated 4 years ago
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- Code for "Goodtriever: Toxicity Mitigation with Retrieval-augmented Language Models"☆25May 30, 2024Updated last year
- Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"☆25Dec 12, 2023Updated 2 years ago
- ☆15Oct 24, 2022Updated 3 years ago
- UniGraph: Learning a Unified Cross-Domain Foundation Model for Text-Attributed Graphs (KDD'25)☆28Jun 6, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [NeurIPS 2023] Model-enhanced Vector Index☆26May 9, 2024Updated last year
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging☆119Oct 23, 2023Updated 2 years ago
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆112Nov 15, 2024Updated last year
- Tools for understanding how transformer predictions are built layer-by-layer☆585Aug 7, 2025Updated 8 months ago
- ☆12May 18, 2022Updated 3 years ago
- Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Data☆59Oct 14, 2025Updated 6 months ago
- ☆19Jun 21, 2025Updated 10 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆185Mar 12, 2026Updated last month
- Project repository for "Evaluating the persuasive influence of political microtargeting with large language models" by Kobi Hackenburg an…☆11Jun 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Jul 6, 2023Updated 2 years ago
- Code for Preventing Language Models From Hiding Their Reasoning, which evaluates defenses against LLM steganography.☆25Jan 26, 2024Updated 2 years ago
- My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensor…☆12Mar 18, 2022Updated 4 years ago
- Reinforcement Learning via Regressing Relative Rewards☆40Dec 12, 2024Updated last year
- [ICLR 2025] Official Repository for "Tamper-Resistant Safeguards for Open-Weight LLMs"☆66Jun 9, 2025Updated 10 months ago
- Overlooked Factors in Concept-based Explanations: Dataset Choice, Concept Learnability, and Human Capability (CVPR 2023)☆10Mar 14, 2023Updated 3 years ago
- ☆33Mar 13, 2025Updated last year