IAAR-Shanghai / xFinder
[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation
☆150Updated 3 weeks ago
Alternatives and similar repositories for xFinder:
Users that are interested in xFinder are comparing it to the libraries listed below
- The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"☆157Updated 3 months ago
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆153Updated 2 months ago
- An awesome repository & A comprehensive survey on interpretability of LLM attention heads.☆306Updated this week
- [ACL 2024] RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback.☆189Updated 5 months ago
- Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)☆146Updated 2 months ago
- Multi-Agent-GPT: 一款基于RAG和agent构建的多模态专家助手GPT。它集成了文本、图像和音频等模态工具。支持本地部署和私有数据库建设。☆214Updated last year
- [ICLR 2025] Tool-Planner: Task Planning with Clusters across Multiple Tools☆102Updated 2 weeks ago
- ☆204Updated 2 months ago
- A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement, ASE 2024 (Distinguished Pape…☆114Updated 2 months ago
- Toolkit for Prompt Compression☆242Updated this week
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆158Updated 3 months ago
- This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.☆58Updated 4 months ago
- OD-FinLLM is a refined model derived from the LLaMA series, with specific enhancements for Chinese financial knowledge. This model is bui…☆270Updated 5 months ago
- All-in-one Web Agent framework for post-training. Start building with a few clicks!☆224Updated last week
- ☆117Updated 8 months ago
- ☆79Updated 6 months ago
- Grimoire is All You Need for Enhancing Large Language Models☆109Updated 11 months ago
- Controllable Text Generation for Large Language Models: A Survey☆153Updated 5 months ago
- [ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts☆180Updated 4 months ago
- Official code for paper "Learning to Use Tools via Cooperative and Interactive Agents"☆59Updated 10 months ago
- Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"☆142Updated 10 months ago
- Pytorch Implementation of "Sinkhorn Distance Minimization for Knowledge Distillation", COLING 2024 and TNNLS 2024☆119Updated 2 months ago
- The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.☆256Updated last month
- Object_Detection_Dataset_Conversion☆122Updated last month
- MoH: Multi-Head Attention as Mixture-of-Head Attention☆201Updated 3 months ago
- [NeurIPS'24] "Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt Calibration"☆164Updated last month
- Creating a simple Go module for Backend Teams' DevOps Workflow☆114Updated last month
- We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that …☆93Updated last year