jaehunjung1/cascaded-selective-evaluation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jaehunjung1/cascaded-selective-evaluation)

jaehunjung1 / cascaded-selective-evaluation

☆29

Alternatives and similar repositories for cascaded-selective-evaluation

Users that are interested in cascaded-selective-evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

allenai / hybrid-preferences
View on GitHub
Learning to route instances for Human vs AI Feedback (ACL Main '25)
☆29Jul 23, 2025Updated last year
jifan-chen / Fact-checking-via-Raw-Evidence
View on GitHub
Code for the arxiv paper: Complex Claim Verification with Evidence Retrieved in the Wild
☆13Nov 27, 2023Updated 2 years ago
jaehunjung1 / impossible-distillation
View on GitHub
☆18Jul 3, 2024Updated 2 years ago
chentong0 / rl-binary-rar
View on GitHub
Official repo for "Binary Retrieval-augmented Reward Mitigates Hallucinations"
☆15Nov 13, 2025Updated 8 months ago
stellalisy / alfa
View on GitHub
Repository for the paper: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning
☆18Feb 21, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
paul-rottger / issuebench
View on GitHub
Röttger et al. (2024): "IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance"
☆17Mar 6, 2026Updated 4 months ago
jxnl / instructor-classify
View on GitHub
☆37May 5, 2025Updated last year
seilna / CNN-Units-in-NLP
View on GitHub
Repository for our ICLR 2019 paper: Discovery of Natural Language Concepts in Individual Units of CNNs
☆26Mar 9, 2019Updated 7 years ago
GXimingLu / IPA
View on GitHub
Codebase for Inference-Time Policy Adapters
☆25Nov 3, 2023Updated 2 years ago
jjcherian / conformal-safety
View on GitHub
☆35Nov 26, 2024Updated last year
allenai / MacGyver
View on GitHub
Code and Data for the NAACL 24 paper: MacGyver: Are Large Language Models Creative Problem Solvers?
☆30Mar 26, 2024Updated 2 years ago
jaehunjung1 / Maieutic-Prompting
View on GitHub
☆52Oct 24, 2023Updated 2 years ago
modal-labs / ci-on-modal
View on GitHub
A sample pattern for running CI tests on Modal
☆19Apr 12, 2025Updated last year
allenai / clarifydelphi
View on GitHub
☆13Apr 24, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
wade3han / normlens
View on GitHub
An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…
☆10May 9, 2024Updated 2 years ago
Yebin46 / FLEUR
View on GitHub
[ACL 2024] FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Model
☆17Apr 28, 2025Updated last year
amazon-science / llm-open-domain-table-reasoner
View on GitHub
Official implementation of OpenTab (ICLR2024)
☆14Mar 27, 2024Updated 2 years ago
JiwanChung / VisArgs
View on GitHub
Corpus to accompany: "Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding"
☆11Apr 11, 2025Updated last year
axiomic-ai / axiomic
View on GitHub
Creating Generative AI Apps which work
☆17Apr 14, 2025Updated last year
jaehunjung1 / prismatic-synthesis
View on GitHub
☆28May 27, 2025Updated last year
AI-for-Animals / ahb
View on GitHub
Animal Harm Assessment public repository
☆12May 3, 2026Updated 2 months ago
PlusLabNLP / Com2Sense
View on GitHub
Dataset & Code for Com2Sense Benchmark
☆13Sep 8, 2021Updated 4 years ago
ucsb-goard-lab / EstrousNet
View on GitHub
EstrousNet is a deep learning network that provides unbiased classification of estrous stage.
☆21Aug 28, 2025Updated 11 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
adlnlp / pdfvqa
View on GitHub
☆18Jun 12, 2024Updated 2 years ago
wade3han / champagne
View on GitHub
An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"
☆52Aug 13, 2023Updated 2 years ago
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
jiacheng-xu / lattice-generation
View on GitHub
Code for Massive-scale Decoding for Text Generation using Lattices
☆44Jul 29, 2022Updated 3 years ago
waitwaitforget / ImageNet-Hierarchy-Visualization
View on GitHub
Visualizing ImageNet Classes Hierarchical Structure.
☆15Apr 8, 2018Updated 8 years ago
ritheshkumar95 / minimal_diffusion_models
View on GitHub
☆16Dec 31, 2021Updated 4 years ago
lab-smile / DOMINO
View on GitHub
☆11Nov 19, 2025Updated 8 months ago
vered1986 / self_talk
View on GitHub
Code and data for the paper: "Unsupervised Common Sense Question Answering with Self-Talk"
☆79Jul 19, 2021Updated 5 years ago
UCSC-VLAA / ReasoningEval
View on GitHub
Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.
☆43Jun 6, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
alisawuffles / DExperts
View on GitHub
code associated with ACL 2021 DExperts paper
☆119May 24, 2023Updated 3 years ago
kharvd / nand2tetris
View on GitHub
Курс Nand2Tetris в школе "Интеллектуал"
☆19May 11, 2017Updated 9 years ago
Social-Nav / tvss_nav
View on GitHub
☆22Feb 4, 2026Updated 5 months ago
ZerojumpLine / ModelEvaluationUnderClassImbalance
View on GitHub
[MICCAI2022] Estimating Model Performance under Domain Shifts with Class-Specific Confidence Scores.
☆12Jun 7, 2024Updated 2 years ago
declare-lab / DoubleMix
View on GitHub
Code for the COLING 2022 paper "DoubleMix: Simple Interpolation-Based Data Augmentation for Text Classification"
☆19Oct 19, 2022Updated 3 years ago
open-compass / ProSA
View on GitHub
[EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs
☆29May 22, 2025Updated last year
zhang64-llnl / Mix-n-Match-Calibration
View on GitHub
☆38Nov 13, 2020Updated 5 years ago