philschmid / knowledge-distillation-transformers-pytorch-sagemaker
☆42Updated 3 years ago
Alternatives and similar repositories for knowledge-distillation-transformers-pytorch-sagemaker:
Users that are interested in knowledge-distillation-transformers-pytorch-sagemaker are comparing it to the libraries listed below
- Finetune mistral-7b-instruct for sentence embeddings☆78Updated 9 months ago
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆179Updated 2 years ago
- DSIR large-scale data selection framework for language model training☆241Updated 10 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆67Updated 4 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆142Updated 5 months ago
- Scalable training for dense retrieval models.☆275Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆102Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆162Updated last year
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆89Updated last year
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆100Updated 2 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆125Updated 11 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆107Updated 7 months ago
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆186Updated 5 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆135Updated 3 months ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆92Updated last year
- Unofficial implementation of AlpaGasus☆90Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆252Updated 7 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆157Updated 8 months ago
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆195Updated last year
- ☆271Updated last year
- contrastive decoding☆193Updated 2 years ago
- ☆174Updated 2 years ago
- ☆251Updated last year
- Benchmarking library for RAG☆167Updated this week
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆119Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)☆97Updated 11 months ago
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks☆54Updated 10 months ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆144Updated 9 months ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆67Updated 11 months ago
- Code for Zero-Shot Tokenizer Transfer☆120Updated last month