huawei-lin / LLMsEasyFinetuneLinks
An easy-to-run implementation for finetuning large language models (LLMs) such as llama and gemma, supporting full parameter finetuning, LoRA, and QLoRA.
☆12Updated last year
Alternatives and similar repositories for LLMsEasyFinetune
Users that are interested in LLMsEasyFinetune are comparing it to the libraries listed below
Sorting:
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆38Updated 2 years ago
- Contrastive Chain-of-Thought Prompting☆68Updated 2 years ago
- Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"☆49Updated 2 years ago
- ☆39Updated last year
- ☆22Updated 3 years ago
- ☆18Updated last year
- Methods and evaluation for aligning language models temporally☆30Updated last year
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning☆53Updated last year
- This is the code for the ICLR 2023 paper "Leveraging Large Language Models for Multiple Choice Question Answering."☆41Updated 2 years ago
- ☆41Updated last week
- The git repository of Modular Prompted Chatbot paper☆35Updated 2 years ago
- [ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"☆53Updated 2 years ago
- ☆88Updated 2 years ago
- This is the official implementation of the paper: "Contrastive Learning of Sentence Embeddings from Scratch"☆40Updated 2 years ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆79Updated last year
- Data and code for EMNLP 2023 industry-track paper "Investigating Table-to-Text Generation Capabilities of Large Language Models in Real-W…☆30Updated 2 years ago
- ☆41Updated 2 years ago
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆78Updated 2 years ago
- ☆39Updated 3 years ago
- [EMNLP 2024] CompAct: Compressing Retrieved Documents Actively for Question Answering☆38Updated last year
- official repository for ListT5☆48Updated 2 months ago
- Clinical NLP Shared Task @ NAACL'24☆40Updated 5 months ago
- [NAACL 2024] Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data? https://aclanthology.org/2024.naa…☆55Updated 6 months ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆64Updated 2 years ago
- Implementation of the Paper "Goal-Driven Explainable Clustering via Language Descriptions"☆40Updated 2 years ago
- Official repository for ICLR 2024 Spotlight paper "Large Language Models Are Not Robust Multiple Choice Selectors"☆43Updated 8 months ago
- Code and data for "MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models"☆51Updated 2 months ago
- ☆49Updated last year
- Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"☆42Updated last year
- Code for the ACL2022 paper "Synthetic Question Value Estimation for Domain Adaptation of Question Answering"☆17Updated 3 years ago