KYLN24 / CritiQLinks
Repository of the paper ''CritiQ: Mining Data Quality Criteria from Human Preferences". Code for CritiQ Flow & Training CritiQ Scorer.
☆18Updated 2 months ago
Alternatives and similar repositories for CritiQ
Users that are interested in CritiQ are comparing it to the libraries listed below
Sorting:
- Reformatted Alignment☆113Updated 10 months ago
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆119Updated last month
- ☆36Updated 11 months ago
- Official code for the paper: InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews (previo…☆83Updated 2 months ago
- ☆50Updated last year
- ☆108Updated last year
- Awesome-Large-Search-Models is a collection of papers and resources (Methods, Datasets and other resources) about awesome agentic search …☆113Updated last month
- ☆96Updated last year
- Code and Data for EMNLP 2024 Paper "Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent"☆130Updated 3 weeks ago
- Scaling Preference Data Curation via Human-AI Synergy☆95Updated last month
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆14Updated 6 months ago
- We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…☆19Updated 2 months ago
- Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"☆56Updated last year
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆85Updated last year
- This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.☆129Updated 10 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆127Updated 3 months ago
- ☆182Updated last month
- ☆96Updated this week
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆51Updated last year
- Official completion of “Training on the Benchmark Is Not All You Need”.☆35Updated 7 months ago
- Code and Data for the paper "Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works".☆19Updated last year
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆32Updated 2 years ago
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆63Updated 10 months ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆113Updated 2 months ago
- ☆91Updated 2 months ago
- Recent advancements propelled by large language models (LLMs), encompassing an array of domains including Vision, Audio, Agent, Robotics,…☆123Updated 2 months ago
- ☆159Updated 3 months ago
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆55Updated 4 months ago
- [ACL-25] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆63Updated 9 months ago
- ☆56Updated 9 months ago