Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
β41Sep 30, 2024Updated last year
Alternatives and similar repositories for Ruler
Users that are interested in Ruler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (ACL 2025) π₯π₯π₯Code for "Empowering Multimodal Large Language Models with Evol-Instruct"β22May 15, 2025Updated last year
- β10Oct 27, 2023Updated 2 years ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Modelsβ60Jul 23, 2024Updated last year
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.β10May 16, 2024Updated 2 years ago
- β31Sep 12, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Leβ¦β14Jan 16, 2025Updated last year
- β17Apr 9, 2025Updated last year
- This is the repository for NAACL'25 paper "TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning"β58May 3, 2025Updated last year
- β14Jan 22, 2025Updated last year
- The official implementation of Preference Data Reward-Augmentation.β18May 1, 2025Updated last year
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-β¦β25Nov 17, 2024Updated last year
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.β51Jul 1, 2025Updated 11 months ago
- [ACL 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"β17Apr 3, 2025Updated last year
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervisionβ19Apr 1, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Structured Generation Evalsβ14Sep 25, 2024Updated last year
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom oβ¦β19Oct 4, 2024Updated last year
- [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improvingβ25Apr 6, 2026Updated 2 months ago
- the datasets of our paperβ11Feb 26, 2024Updated 2 years ago
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agentsβ50Feb 2, 2026Updated 4 months ago
- JudgeLRM: Large Reasoning Models as a Judgeβ42May 6, 2026Updated last month
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Understβ¦β23Mar 16, 2025Updated last year
- [ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcutsβ18Mar 11, 2025Updated last year
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".β30Aug 9, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ACL 2026] Repository of IPBenchβ22Apr 6, 2026Updated 2 months ago
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modelingβ22Dec 16, 2024Updated last year
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memoryβ62Feb 10, 2025Updated last year
- [ACL 2024] Making Long-Context Language Models Better Multi-Hop Reasonersβ20May 28, 2024Updated 2 years ago
- This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"β¦β25Oct 15, 2025Updated 7 months ago
- LongAttn οΌSelecting Long-context Training Data via Token-level Attentionβ15Jul 16, 2025Updated 10 months ago
- Agent-RRM: Exploring Reasoning Reward Model for Agentsβ69Mar 17, 2026Updated 2 months ago
- [ACL 2025] RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generationβ117Jan 23, 2025Updated last year
- β13Aug 12, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learningβ45Mar 20, 2024Updated 2 years ago
- Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuningβ52Oct 17, 2025Updated 7 months ago
- Factored-NeuS: Reconstructing Surfaces, Illumination, and Materials of Possibly Glossy Objects (CVPR 2025)β27Apr 9, 2025Updated last year
- π₯ [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospeβ¦β57Jan 22, 2026Updated 4 months ago
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findingsβ34Nov 12, 2024Updated last year
- Benchmarking Social Intelligence of Language Agents through Interactive Scenariosβ13Jan 4, 2025Updated last year
- A new simple method for dataset distillation called Randomized Truncated Backpropagation Through Time (RaT-BPTT)β14Apr 21, 2024Updated 2 years ago