damanimehul / RLCRView external linksLinks
Official repository for Beyond Binary Rewards: Training LMs to Reason about Their Uncertainty
☆53Aug 20, 2025Updated 5 months ago
Alternatives and similar repositories for RLCR
Users that are interested in RLCR are comparing it to the libraries listed below
Sorting:
- Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".☆22Sep 7, 2023Updated 2 years ago
- ☆28Sep 13, 2021Updated 4 years ago
- Code for the paper "Semi-Conditional Normalizing Flows for Semi-Supervised Learning"☆11Mar 30, 2020Updated 5 years ago
- Test-Time Adaptation via Conjugate Pseudo-Labels☆42May 25, 2023Updated 2 years ago
- ☆12Sep 24, 2024Updated last year
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Nov 22, 2022Updated 3 years ago
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated 11 months ago
- Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆15Nov 18, 2025Updated 2 months ago
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆15Aug 15, 2025Updated 5 months ago
- Code accompanying our ICML 2020 paper on choice set optimization in group decision-making.☆11Jun 27, 2020Updated 5 years ago
- UQGAN: A Unified Model for Uncertainty Quantification of Deep Classifiers trained via Conditional GANs☆11Apr 13, 2023Updated 2 years ago
- ☆12Nov 2, 2021Updated 4 years ago
- A list of all papers related to anomaly detection in NeurIPS 2020.☆10Jan 13, 2021Updated 5 years ago
- Implementation of: Kristiadi, Agustinus, and Asja Fischer. "Predictive Uncertainty Quantification with Compound Density Networks." (2019)…☆16May 26, 2022Updated 3 years ago
- Toolkit in Python for the acquisition, analysis and visualization of motion capture using IMU☆14May 19, 2021Updated 4 years ago
- ☆13Feb 14, 2022Updated 3 years ago
- ☆11Jan 21, 2021Updated 5 years ago
- Code Release for Task Agnostic Dynamics Priors for Deep Reinforcement Learning☆12Jun 13, 2019Updated 6 years ago
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆17May 15, 2025Updated 8 months ago
- Code release for Unsupervised Domain Adaptation via Distilled Discriminative Clustering published by Pattern Recognition in 2022☆11May 19, 2023Updated 2 years ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆13Mar 9, 2021Updated 4 years ago
- A Google Colab for DFDNet: Blind Face Restoration☆12Aug 9, 2021Updated 4 years ago
- ☆12Oct 28, 2022Updated 3 years ago
- Python package to accelerate research on generalized out-of-distribution (OOD) detection.☆15Jun 19, 2024Updated last year
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- ☆11Feb 5, 2024Updated 2 years ago
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- Plugin to support creating and developing Mbed OS projects in CLion☆10May 28, 2021Updated 4 years ago
- Code for https://arxiv.org/abs/1811.00145☆12Feb 13, 2021Updated 5 years ago
- This repo is for our EMNLP2023 short paper (Findings): InstOptima: Evolutionary Multi-objective Instruction Optimization via Large Langua…☆13Jan 11, 2024Updated 2 years ago
- ☆14Mar 2, 2025Updated 11 months ago
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Apr 9, 2024Updated last year
- [NeurIPS 2025] TrajAgent: An LLM-Agent Framework for Trajectory Modeling via Large-and-Small Model Collaboration☆19Nov 30, 2025Updated 2 months ago
- Supercharging Imbalanced Data Learning WithCausal Representation Transfer☆12Nov 29, 2021Updated 4 years ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆16Oct 14, 2024Updated last year
- PyTorch implementations of the beta divergence loss.☆11Jan 31, 2022Updated 4 years ago
- SPUQ: Perturbation-Based Uncertainty Quantification for Large Language Models☆15Jun 24, 2024Updated last year
- A forked version of thesisdown for writing UNSW theses with bookdown and RMarkdown☆11Jan 12, 2018Updated 8 years ago
- A wrapper around pytorch module objects with a sklearn-like interface, allowing boilerplate-free training of complex neural nets.☆14Dec 25, 2017Updated 8 years ago