philschmid / knowledge-distillation-transformers-pytorch-sagemaker
☆39Updated 2 years ago
Alternatives and similar repositories for knowledge-distillation-transformers-pytorch-sagemaker:
Users that are interested in knowledge-distillation-transformers-pytorch-sagemaker are comparing it to the libraries listed below
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆106Updated 6 months ago
- Unofficial implementation of AlpaGasus☆90Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆102Updated last year
- Scalable training for dense retrieval models.☆273Updated last year
- Finetune mistral-7b-instruct for sentence embeddings☆74Updated 8 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆124Updated 10 months ago
- DSIR large-scale data selection framework for language model training☆242Updated 9 months ago
- ☆268Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆137Updated 4 months ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆162Updated last year
- ☆173Updated last year
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆136Updated 6 months ago
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆179Updated 2 years ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆89Updated last year
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆171Updated 3 months ago
- contrastive decoding☆190Updated 2 years ago
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆118Updated last month
- ☆124Updated this week
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆132Updated last month
- Scaling Data-Constrained Language Models☆330Updated 3 months ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆99Updated 2 years ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆92Updated last year
- Benchmarking library for RAG☆154Updated this week
- Reverse Instructions to generate instruction tuning data with corpus examples☆207Updated 10 months ago
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆113Updated 4 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆71Updated 7 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆204Updated 7 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆171Updated 2 weeks ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆129Updated 2 months ago
- A Multilingual Replicable Instruction-Following Model☆94Updated last year