NVIDIA / RULERLinks

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

☆1,320

Alternatives and similar repositories for RULER

Users that are interested in RULER are comparing it to the libraries listed below

Sorting:

jquesnelle / yarn
YaRN: Efficient Context Window Extension of Large Language Models
☆1,613Updated last year
datamllab / LongLM
[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
☆659Updated last year
aphrodite-engine / aphrodite-engine
Large-scale LLM inference engine
☆1,562Updated this week
lmarena / arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
☆935Updated 3 months ago
gkamradt / LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆2,050Updated last year
apoorvumang / prompt-lookup-decoding
☆572Updated last year
chujiezheng / chat_templates
Chat Templates for 🤗 HuggingFace Large Language Models
☆703Updated 9 months ago
codelion / optillm
Optimizing inference proxy for LLMs
☆2,988Updated last week
NousResearch / Hermes-Function-Calling
☆1,093Updated last year
huggingface / lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
☆1,987Updated this week
huggingface / search-and-learn
Recipes to scale inference-time compute of open models
☆1,108Updated 4 months ago
jzhang38 / EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
☆746Updated last year
vllm-project / llm-compressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
☆2,073Updated this week
bespokelabsai / curator
Synthetic data curation for post-training and structured data extraction
☆1,512Updated 2 months ago
FailSpy / abliterator
Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens
☆511Updated last year
arcee-ai / DistillKit
An Open Source Toolkit For LLM Distillation
☆734Updated 3 months ago
Leeroo-AI / mergoo
A library for easily merging multiple LLM experts, and efficiently train the merged LLM.
☆492Updated last year
argilla-io / distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆2,895Updated this week
punica-ai / punica
Serving multiple LoRA finetuned LLM as one
☆1,097Updated last year
EQ-bench / EQ-Bench
A benchmark for emotional intelligence in large language models
☆365Updated last year
cpldcpu / MisguidedAttention
A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information
☆447Updated 2 months ago
AIDC-AI / Marco-o1
An Open Large Reasoning Model for Real-World Solutions
☆1,522Updated 4 months ago
S-LoRA / S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
☆1,854Updated last year
huggingface / cosmopedia
☆541Updated 10 months ago
zhentingqi / rStar
☆962Updated 8 months ago
dCaples / AutoDidact
Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.
☆667Updated 6 months ago
jondurbin / airoboros
Customizable implementation of the self-instruct paper.
☆1,048Updated last year
mlabonne / llm-autoeval
Automatically evaluate your LLMs in Google Colab
☆661Updated last year
noamgat / lm-format-enforcer
Enforce the output format (JSON Schema, Regex etc) of a language model
☆1,937Updated last month
allenai / OLMoE
OLMoE: Open Mixture-of-Experts Language Models
☆878Updated 2 weeks ago