d-f / llm-summarization
LoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR summarization dataset
☆12Updated 3 months ago
Alternatives and similar repositories for llm-summarization
Users that are interested in llm-summarization are comparing it to the libraries listed below
Sorting:
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆12Updated last year
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆19Updated 6 months ago
- ☆26Updated 10 months ago
- ☆45Updated last month
- [NAACL '25] Rationale-Guided Retrieval Augmented Generation for Medical Question Answering☆24Updated 2 months ago
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆21Updated 7 months ago
- ☆30Updated 6 months ago
- MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation☆28Updated last year
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held …☆41Updated 2 years ago
- In-context learning, Fine-Tuning, RLHF on Flan-T5☆14Updated last year
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆27Updated last year
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆34Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆46Updated 5 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆88Updated 6 months ago
- ☆13Updated 4 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆93Updated 3 months ago
- Official codebase for permutation self-consistency.☆18Updated last year
- ☆47Updated 11 months ago
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification☆40Updated 2 years ago
- This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured …☆47Updated 2 weeks ago
- ☆43Updated 3 months ago
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks☆55Updated last year
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆33Updated 5 months ago
- official repository for ListT5☆44Updated 3 months ago
- MIRAGE is a light benchmark to evaluate RAG performance.☆13Updated 3 weeks ago
- The SQL-RL-GEN is an algorithm based on a Reinforcement Learning approach with a reward function generated by a LLM to guide the agent's …☆12Updated last week
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆25Updated 5 months ago
- [ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"☆53Updated last year
- Code Repo for EfficientRAG: Efficient Retriever for Multi-Hop Question Answering☆45Updated 2 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆83Updated 9 months ago