ljcleo / debatrixLinks
LLM-based Multi-dimensional Debate Judge with Iterative Chronological Analysis
☆19Updated 4 months ago
Alternatives and similar repositories for debatrix
Users that are interested in debatrix are comparing it to the libraries listed below
Sorting:
- The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization☆128Updated last year
- repository for CharacterChat, a personalized social support system☆76Updated last year
- ☆96Updated last year
- ☆283Updated 8 months ago
- ☆147Updated last year
- Awesome papers for role-playing with language models☆218Updated last year
- Source code and demo for memory bank and SiliconFriend☆402Updated 2 years ago
- ☆142Updated 8 months ago
- ☆164Updated last year
- GAOGAO-Bench-Updates is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.☆38Updated last year
- Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"☆61Updated last year
- [ACL 2024] Official code for "IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation" (Theatr…☆47Updated 7 months ago
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆64Updated 8 months ago
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆63Updated last year
- Chinese version implementation of Generative Agents: Interactive Simulacra of Human Behavior☆87Updated 2 years ago
- SOTA Math Opensource LLM☆334Updated 2 years ago
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆172Updated last month
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆161Updated 8 months ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆48Updated last year
- Official code for the paper: InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews (previo…☆90Updated 8 months ago
- [NeurIPS 2024] Personal Agentic AI for MultiAgent Cooperation☆87Updated last year
- YuLan-IR: Information Retrieval Boosted LMs☆220Updated last year
- ☆97Updated last year
- FireAct: Toward Language Agent Fine-tuning☆292Updated 2 years ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆143Updated 11 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated 2 years ago
- Scaling Preference Data Curation via Human-AI Synergy☆141Updated 7 months ago
- ☆36Updated last year
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆112Updated 7 months ago
- A Toolkit for Table-based Question Answering☆115Updated 2 years ago