mila-iqia / Casande-RLLinks
Casande-RL
☆11Updated 2 years ago
Alternatives and similar repositories for Casande-RL
Users that are interested in Casande-RL are comparing it to the libraries listed below
Sorting:
- ☆27Updated 5 months ago
- Code for paper Towards Mitigating LLM Hallucination via Self Reflection☆27Updated last year
- [NAACL 25 main] Awesome LLM Causal Reasoning is a collection of LLM-based casual reasoning works, including papers, codes and datasets.☆70Updated 4 months ago
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆21Updated 9 months ago
- ACL24☆10Updated last year
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆68Updated 2 months ago
- ☆26Updated 3 months ago
- Lightweight Adapting for Black-Box Large Language Models☆22Updated last year
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆41Updated last year
- ☆30Updated last year
- MedSafetyBench: Evaluating and Improving the Medical Safety of LLMs [NeurIPS 2024]☆30Updated last week
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated last year
- Source code for InBedder, an instruction-following text embedder☆27Updated 9 months ago
- ☆28Updated 4 months ago
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆26Updated last year
- [ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation wi…☆39Updated last year
- [NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models☆98Updated 11 months ago
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆25Updated 8 months ago
- AbstainQA, ACL 2024☆27Updated 9 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆49Updated 8 months ago
- ☆18Updated 10 months ago
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆19Updated 3 weeks ago
- This is a unified platform for implementing and evaluating test-time reasoning mechanisms in Large Language Models (LLMs).☆20Updated 6 months ago
- Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)☆25Updated last year
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆77Updated 6 months ago
- This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by g…☆35Updated last week
- This is the code for the ICLR 2023 paper "Leveraging Large Language Models for Multiple Choice Question Answering."☆40Updated 2 years ago
- ☆45Updated last year
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆32Updated last year
- ☆36Updated 11 months ago