☆23Oct 4, 2024Updated last year
Alternatives and similar repositories for nyu-debate-modeling
Users that are interested in nyu-debate-modeling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jun 4, 2024Updated last year
- Debate interface, experiments, etc.☆10Mar 12, 2024Updated 2 years ago
- Type-level API for standard collections☆33Jun 6, 2016Updated 9 years ago
- This project showcases engaging interactions between two AI chatbots.☆10Jan 10, 2024Updated 2 years ago
- Code for ICLR 2025 Failures to Find Transferable Image Jailbreaks Between Vision-Language Models☆36Jun 1, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Jan 14, 2026Updated 3 months ago
- A simple library for working with Hugging Face models.☆14Dec 30, 2024Updated last year
- Redwood Research's transformer interpretability tools☆15Apr 15, 2022Updated 4 years ago
- Implicit generative models and related stuff based on the MMD, in PyTorch☆16Sep 24, 2020Updated 5 years ago
- Improving transparency of large language models' reasoning☆15Nov 25, 2025Updated 4 months ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆30May 23, 2024Updated last year
- Bootstrap (Linear) Thompson Sampling☆13Jun 30, 2016Updated 9 years ago
- ☆36Jun 4, 2018Updated 7 years ago
- Efficient and Scalable Estimation of Tool Representations in Vector Space☆29Sep 5, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆72Jun 19, 2024Updated last year
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- ☆22Aug 30, 2025Updated 7 months ago
- Understanding how features learned by neural networks evolve throughout training☆41Oct 24, 2024Updated last year
- A toy Inspect implementation of the Bliss Attractor eval from Claude 4 System Card Welfare Assessment☆38Jun 5, 2025Updated 10 months ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆62Mar 30, 2024Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Official Code for our paper: "Language Models Learn to Mislead Humans via RLHF""☆19Oct 11, 2024Updated last year
- Code for the paper "Larger and more instructable language models become less reliable"☆34Oct 9, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- TensorFlow code for paper: Learning Generative ConvNets via Multi-grid Modeling and Sampling☆25Apr 16, 2018Updated 8 years ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆81Aug 30, 2023Updated 2 years ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆129Mar 22, 2024Updated 2 years ago
- ☆33Jun 24, 2024Updated last year
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated last week
- Control Fusion 360 with any AI through Model Context Protocol (MCP)☆78Jan 28, 2026Updated 2 months ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆69Feb 27, 2024Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- ☆18Mar 2, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Jun 7, 2023Updated 2 years ago
- ☆10Oct 20, 2023Updated 2 years ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- This collection of hypnosis scripts has been curated from various open source repositories.☆22Apr 7, 2019Updated 7 years ago
- Debiasing Through Data Attribution☆13May 23, 2024Updated last year
- [ICLR 2025] ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domains☆17Mar 4, 2025Updated last year
- ☆10Mar 13, 2023Updated 3 years ago