ContextualAI / gritlm
Generative Representational Instruction Tuning
☆620Updated last month
Alternatives and similar repositories for gritlm:
Users that are interested in gritlm are comparing it to the libraries listed below
- ☆515Updated 5 months ago
- Official repository for ORPO☆448Updated 10 months ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆459Updated last year
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆436Updated this week
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆547Updated 4 months ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆681Updated last month
- RewardBench: the first evaluation tool for reward models.☆555Updated last month
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆480Updated 6 months ago
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆715Updated 6 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆833Updated last week
- Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning☆395Updated 11 months ago
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆481Updated 3 months ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆584Updated this week
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆651Updated 10 months ago
- Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]☆589Updated last year
- Code and data for "Lost in the Middle: How Language Models Use Long Contexts"☆340Updated last year
- distributed trainer for LLMs☆572Updated 11 months ago
- ☆278Updated last year
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆598Updated last year
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,438Updated last week
- Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning☆725Updated 2 years ago
- Codebase for Merging Language Models (ICML 2024)☆816Updated 11 months ago
- Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03…☆533Updated last year
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆354Updated 7 months ago
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆873Updated 2 months ago
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆463Updated last year
- ☆523Updated last year
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆692Updated last year
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.☆545Updated last year
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,148Updated 11 months ago