jxzhangjhu / awesome-LLM-controlled-decoding-generation
awesome-LLM-controlled-constrained-generation
☆23Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-LLM-controlled-decoding-generation
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆22Updated 2 months ago
- Restore safety in fine-tuned language models through task arithmetic☆26Updated 7 months ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆57Updated 8 months ago
- ☆41Updated last year
- InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆51Updated 3 weeks ago
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models☆33Updated 3 months ago
- Multilingual safety benchmark for Large Language Models☆22Updated 2 months ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆29Updated 6 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆39Updated 3 months ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆68Updated 3 weeks ago
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆50Updated 2 months ago
- This repository contains data, code and models for contextual noncompliance.☆18Updated 3 months ago
- Min-K%++: Improved baseline for detecting pre-training data of LLMs https://arxiv.org/abs/2404.02936☆26Updated 5 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆67Updated 5 months ago
- We have released the code and demo program required for LLM with self-verification☆48Updated last year
- Official code for the paper: Evaluating Copyright Takedown Methods for Language Models☆15Updated 3 months ago
- [NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.☆23Updated last year
- ☆26Updated last year
- Directional Preference Alignment☆49Updated last month
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆64Updated last year
- The Paper List on Data Contamination for Large Language Models Evaluation.☆74Updated this week
- ☆33Updated last year
- Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>☆44Updated last year
- "TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks" [TMLR 2024]☆28Updated this week
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆32Updated 10 months ago
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆97Updated last year
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆38Updated last month
- ☆15Updated 3 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆46Updated 4 months ago
- ☆27Updated 8 months ago