chujiezheng / chat_templatesLinks
Chat Templates for ๐ค HuggingFace Large Language Models
โ693Updated 8 months ago
Alternatives and similar repositories for chat_templates
Users that are interested in chat_templates are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data โฆโ756Updated 5 months ago
- FuseAI Projectโ579Updated 6 months ago
- โ536Updated 9 months ago
- An Open Source Toolkit For LLM Distillationโ712Updated last month
- A collection of benchmarks and datasets for evaluating LLM.โ498Updated last year
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsโ1,820Updated last week
- Codebase for Merging Language Models (ICML 2024)โ844Updated last year
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.โ739Updated 10 months ago
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuningโ661Updated last year
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.โ547Updated last year
- Generative Representational Instruction Tuningโ664Updated last month
- โ893Updated last month
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.โ488Updated 11 months ago
- RewardBench: the first evaluation tool for reward models.โ624Updated 2 months ago
- Automatic evals for LLMsโ519Updated last month
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"โ506Updated 7 months ago
- Official repository for ORPOโ462Updated last year
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Rewardโ913Updated 6 months ago
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuningโ360Updated 11 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).โ879Updated last month
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruningโ630Updated last year
- Recipes to scale inference-time compute of open modelsโ1,112Updated 3 months ago
- A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt ๆถๅฝๅ็งๅๆ ท็ๆไปคๆฐๆฎ้, ็จไบ่ฎญ็ป ChatLLM ๆจกๅใโ689Updated last year
- โ955Updated 6 months ago
- Evaluate your LLM's response with Prometheus and GPT4 ๐ฏโ979Updated 3 months ago
- The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]โ280Updated 5 months ago
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]โ566Updated 8 months ago
- Representation Engineering: A Top-Down Approach to AI Transparencyโ865Updated last year
- Code and data for "Lost in the Middle: How Language Models Use Long Contexts"โ355Updated last year
- Implementation of paper Data Engineering for Scaling Language Models to 128K Contextโ470Updated last year