philschmid / deep-learning-pytorch-huggingface
☆624Updated this week
Related projects: ⓘ
- ☆1,194Updated this week
- Official repository for ORPO☆409Updated 3 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆689Updated last week
- A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。☆498Updated 5 months ago
- Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'☆1,130Updated 2 weeks ago
- YaRN: Efficient Context Window Extension of Large Language Models☆1,306Updated 5 months ago
- Minimalistic large language model 3D-parallelism training☆1,111Updated this week
- Code for fine-tuning Platypus fam LLMs using LoRA☆625Updated 7 months ago
- An open collection of methodologies to help with successful training of large language models.☆441Updated 7 months ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆657Updated 5 months ago
- Generative Representational Instruction Tuning☆525Updated 2 weeks ago
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.☆508Updated 6 months ago
- LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processin…☆659Updated this week
- distributed trainer for LLMs☆521Updated 3 months ago
- Efficient Retrieval Augmentation and Generation Framework☆1,255Updated last week
- An open collection of implementation tips, tricks and resources for training large language models☆455Updated last year
- ReFT: Representation Finetuning for Language Models☆1,076Updated 2 weeks ago
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,352Updated 6 months ago
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆858Updated 4 months ago
- ☆1,456Updated 3 weeks ago
- Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).☆748Updated last year
- LLM Workshop by Sourab Mangrulkar☆322Updated 3 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆1,396Updated this week
- A library for advanced large language model reasoning☆1,124Updated 2 weeks ago
- Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning☆658Updated last year
- Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03…☆508Updated 9 months ago
- [ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding☆1,099Updated 7 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,451Updated last month
- Best practices for distilling large language models.☆370Updated 7 months ago
- Expanding natural instructions☆941Updated 9 months ago