philschmid / knowledge-distillation-transformers-pytorch-sagemakerLinks

☆47

Alternatives and similar repositories for knowledge-distillation-transformers-pytorch-sagemaker

Users that are interested in knowledge-distillation-transformers-pytorch-sagemaker are comparing it to the libraries listed below

Sorting:

p-lambda / dsir
DSIR large-scale data selection framework for language model training
☆258Updated last year
neelsjain / NEFTune
Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning
☆400Updated last year
bigscience-workshop / lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
☆105Updated 2 years ago
FreedomIntelligence / MultilingualSIFT
MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning
☆94Updated 2 years ago
mzbac / llama2-fine-tune
Scripts for fine-tuning Llama2 via SFT and DPO.
☆203Updated 2 years ago
nelson-liu / lost-in-the-middle
Code and data for "Lost in the Middle: How Language Models Use Long Contexts"
☆358Updated last year
facebookresearch / dpr-scale
Scalable training for dense retrieval models.
☆299Updated 2 months ago
AIR-Bench / AIR-Bench
[ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
☆153Updated last month
kamalkraj / e5-mistral-7b-instruct
Finetune mistral-7b-instruct for sentence embeddings
☆86Updated last year
xfactlab / orpo
Official repository for ORPO
☆463Updated last year
naver / bergen
Benchmarking library for RAG
☆224Updated last month
AI21Labs / in-context-ralm
☆286Updated last year
jongwooko / distillm
Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)
☆226Updated 5 months ago
kaistAI / CoT-Collection
[EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
☆247Updated last year
nlp-uoregon / Okapi
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
☆97Updated 2 years ago
pacman100 / LLM-Workshop
LLM Workshop by Sourab Mangrulkar
☆394Updated last year
DaoD / INTERS
This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"
☆204Updated 8 months ago
nlp-uoregon / mlmm-evaluation
Multilingual Large Language Models Evaluation Benchmark
☆130Updated last year
huggingface / cosmopedia
☆536Updated 9 months ago
ParticleMedia / RAGTruth
Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"
☆196Updated 9 months ago
yizhongw / Tk-Instruct
Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.
☆181Updated 2 years ago
huggingface / large_language_model_training_playbook
An open collection of implementation tips, tricks and resources for training large language models
☆479Updated 2 years ago
asahi417 / lmppl
Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder …
☆163Updated 2 months ago
facebookresearch / tart
Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.
☆163Updated last year
microsoft / Multilingual-Evaluation-of-Generative-AI-MEGA
Code for Multilingual Eval of Generative AI paper published at EMNLP 2023
☆70Updated last year
huggingface / datablations
Scaling Data-Constrained Language Models
☆340Updated 2 months ago
JinjieNi / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆245Updated 9 months ago
akoksal / LongForm
Reverse Instructions to generate instruction tuning data with corpus examples
☆215Updated last year
AlexTMallen / adaptive-retrieval
☆185Updated 2 months ago
allenai / wimbd
What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets
☆224Updated 9 months ago