IAAR-Shanghai / xFinder
[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation
☆163Updated last month
Alternatives and similar repositories for xFinder:
Users that are interested in xFinder are comparing it to the libraries listed below
- The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"☆170Updated 5 months ago
- An awesome repository & A comprehensive survey on interpretability of LLM attention heads.☆336Updated last month
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆164Updated 4 months ago
- [ACL 2024] RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback.☆189Updated 7 months ago
- [ICLR 2025] Tool-Planner: Task Planning with Clusters across Multiple Tools☆105Updated 2 months ago
- A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement, ASE 2024 (Distinguished Pape…☆114Updated 4 months ago
- ☆204Updated 4 months ago
- Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)☆154Updated 4 months ago
- Toolkit for Prompt Compression☆251Updated 2 months ago
- Multi-Agent-GPT: 一款基于RAG和agent构建的多模态专家助手GPT。它集成了文本、图像和音频等模态工具。支持本地部署和私有数据库建设。☆222Updated last month
- OD-FinLLM is a refined model derived from the LLaMA series, with specific enhancements for Chinese financial knowledge. This model is bui…☆273Updated 7 months ago
- All-in-one Web Agent framework for post-training. Start building with a few clicks!☆242Updated 2 months ago
- Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"☆146Updated last year
- [ICLR 2025] Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models☆67Updated 2 weeks ago
- [ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts☆201Updated 6 months ago
- StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding☆116Updated 2 weeks ago
- Official Repository of paper OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference☆146Updated last month
- ☆117Updated 10 months ago
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆161Updated 5 months ago
- ☆105Updated 8 months ago
- Grimoire is All You Need for Enhancing Large Language Models☆113Updated last year
- Controllable Text Generation for Large Language Models: A Survey☆167Updated 7 months ago
- Official code for "🔍 Retrieval Models Aren’t Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models"☆160Updated 2 weeks ago
- This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.☆60Updated 6 months ago
- NeoBERT is an advanced model designed specifically for predicting the binding affinity between neoantigens and HLA. It is a variant of th…☆148Updated 3 months ago
- [EMNLP 2023] FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models☆87Updated last year
- Object_Detection_Dataset_Conversion☆130Updated 3 months ago
- Pytorch Implementation of "Sinkhorn Distance Minimization for Knowledge Distillation", COLING 2024 and TNNLS 2024☆121Updated last month
- MoH: Multi-Head Attention as Mixture-of-Head Attention☆236Updated 5 months ago
- https://arxiv.org/abs/2408.02032☆102Updated 3 months ago