philschmid / deep-learning-pytorch-huggingface
☆1,172Updated 2 months ago
Alternatives and similar repositories for deep-learning-pytorch-huggingface:
Users that are interested in deep-learning-pytorch-huggingface are comparing it to the libraries listed below
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,482Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,671Updated last week
- Minimalistic large language model 3D-parallelism training☆1,836Updated this week
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,464Updated 2 months ago
- AllenAI's post-training codebase☆2,939Updated this week
- A reading list on LLM based Synthetic Data Generation 🔥☆1,255Updated 2 months ago
- Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'☆1,500Updated 3 months ago
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,736Updated 4 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆837Updated this week
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,515Updated last year
- YaRN: Efficient Context Window Extension of Large Language Models☆1,479Updated last year
- Recipes to scale inference-time compute of open models☆1,066Updated 2 months ago
- Reference implementation for DPO (Direct Preference Optimization)☆2,553Updated 8 months ago
- A bibliography and survey of the papers surrounding o1☆1,190Updated 5 months ago
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆886Updated 2 months ago
- This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicit…☆1,007Updated last month
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,843Updated 8 months ago
- ☆1,017Updated 4 months ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆693Updated last month
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,366Updated this week
- Summarize existing representative LLMs text datasets.☆1,259Updated last month
- Curated list of datasets and tools for post-training.☆3,002Updated 3 months ago
- Official repository for ORPO☆450Updated 11 months ago
- LLM Workshop by Sourab Mangrulkar☆379Updated 10 months ago
- Generative Representational Instruction Tuning☆624Updated last month
- distributed trainer for LLMs☆573Updated 11 months ago
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆719Updated 7 months ago
- Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).☆768Updated last year
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆867Updated this week
- 📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥☆1,455Updated 3 weeks ago