chujiezheng / chat_templates
Chat Templates for π€ HuggingFace Large Language Models
β529Updated last week
Related projects β
Alternatives and complementary repositories for chat_templates
- Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality sβ¦β480Updated last week
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuningβ610Updated 5 months ago
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.β644Updated last month
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.β402Updated 2 months ago
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruningβ553Updated 8 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ788Updated this week
- Minimalistic large language model 3D-parallelism trainingβ1,229Updated last week
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.β528Updated 8 months ago
- β1,263Updated this week
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuningβ334Updated 2 months ago
- This repo contains the source code for RULER: Whatβs the Real Context Size of Your Long-Context Language Models?β698Updated 2 weeks ago
- Official repository for ORPOβ419Updated 5 months ago
- An Open Source Toolkit For LLM Distillationβ352Updated last month
- Codebase for Merging Language Models (ICML 2024)β769Updated 6 months ago
- Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)β817Updated last week
- Implementation of paper Data Engineering for Scaling Language Models to 128K Contextβ435Updated 7 months ago
- Generative Representational Instruction Tuningβ562Updated this week
- β493Updated 3 weeks ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β1,612Updated this week
- β447Updated 2 weeks ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ1,554Updated 2 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).β739Updated last week
- YaRN: Efficient Context Window Extension of Large Language Modelsβ1,341Updated 6 months ago
- ReFT: Representation Finetuning for Language Modelsβ1,147Updated this week
- Evaluate your LLM's response with Prometheus and GPT4 π―β795Updated 2 months ago
- RewardBench: the first evaluation tool for reward models.β426Updated 2 weeks ago
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"β428Updated 6 months ago
- [EMNLP 2023] Adapting Language Models to Compress Long Contextsβ276Updated 2 months ago
- FuseAI Projectβ448Updated 2 months ago
- β294Updated 5 months ago