ZBWpro / PretCoTandKELinks
☆26Updated 8 months ago
Alternatives and similar repositories for PretCoTandKE
Users that are interested in PretCoTandKE are comparing it to the libraries listed below
Sorting:
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆52Updated 7 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated last year
- Contrastive Chain-of-Thought Prompting☆68Updated 2 years ago
- QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking☆33Updated last month
- The paper list of multilingual pre-trained models (Continual Updated).☆24Updated last year
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆31Updated last year
- ☆36Updated 6 months ago
- Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue (ACL 2024)☆24Updated 2 months ago
- ☆35Updated last year
- [AAAI 2024] DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning☆15Updated last year
- Code for Robust Fine-tuning (RbFT)☆16Updated 11 months ago
- Codebase for Instruction Following without Instruction Tuning☆36Updated last year
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆85Updated last year
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆53Updated last year
- ☆62Updated last year
- ☆53Updated last year
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆27Updated 2 months ago
- RewardAnything: Generalizable Principle-Following Reward Models☆45Updated 7 months ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆39Updated 2 years ago
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Updated 3 months ago
- ☆50Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- ☆32Updated 2 years ago
- Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆48Updated 6 months ago
- ☆70Updated 2 years ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆53Updated last year
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Updated last year
- ☆22Updated last year
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Updated last year
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆16Updated last year