stanfordnlp / pyreft
ReFT: Representation Finetuning for Language Models
β1,373Updated 2 weeks ago
Alternatives and similar repositories for pyreft:
Users that are interested in pyreft are comparing it to the libraries listed below
- Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'β1,400Updated 3 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ970Updated this week
- Evaluate your LLM's response with Prometheus and GPT4 π―β841Updated last week
- Bringing BERT into modernity via both architecture changes and scalingβ1,045Updated last week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β1,879Updated this week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.β2,147Updated last week
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).β785Updated 2 weeks ago
- β2,289Updated this week
- Minimalistic large language model 3D-parallelism trainingβ1,386Updated this week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.β1,992Updated last month
- Generative Representational Instruction Tuningβ584Updated 2 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ1,654Updated 5 months ago
- A library for advanced large language model reasoningβ1,659Updated this week
- Code for Quiet-STaRβ698Updated 4 months ago
- A reading list on LLM based Synthetic Data Generation π₯β969Updated 2 months ago
- Recipes to scale inference-time compute of open modelsβ932Updated this week
- The official implementation of Self-Play Fine-Tuning (SPIN)β1,099Updated 8 months ago
- YaRN: Efficient Context Window Extension of Large Language Modelsβ1,398Updated 9 months ago
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"β831Updated last month
- β484Updated last month
- Tools for merging pretrained large language models.β5,113Updated last week
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuningβ346Updated 4 months ago
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projectionβ1,481Updated 2 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMsβ2,311Updated this week
- Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"β977Updated 3 months ago
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,β¦β1,930Updated 7 months ago
- β769Updated 3 weeks ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAIβ1,358Updated 9 months ago
- β996Updated last month
- Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality sβ¦β565Updated last week