open-compass / CompassJudger
☆67Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for CompassJudger
- ☆215Updated 3 months ago
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆50Updated 6 months ago
- The demo, code and data of FollowRAG☆61Updated 2 weeks ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆190Updated 3 weeks ago
- Reformatted Alignment☆112Updated last month
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆155Updated 7 months ago
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆75Updated 3 weeks ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆44Updated last week
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆230Updated 7 months ago
- [NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of…☆98Updated 3 weeks ago
- The code and data of DPA-RAG☆50Updated last month
- Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning☆66Updated 10 months ago
- FuseAI Project☆76Updated 2 months ago
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆169Updated last month
- Expert Specialized Fine-Tuning☆144Updated last month
- A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆30Updated 3 weeks ago
- The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.☆68Updated 2 months ago
- ☆283Updated last month
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆38Updated 4 months ago
- ☆116Updated 5 months ago
- ☆128Updated last week
- Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models☆80Updated 7 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆90Updated this week
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆114Updated this week
- Generative Judge for Evaluating Alignment☆216Updated 9 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆216Updated 6 months ago
- Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊☆230Updated last week
- ControlLLM: Augment Language Models with Tools by Searching on Graphs☆186Updated 3 months ago
- 💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.☆185Updated 2 weeks ago
- An Easy-to-use Hallucination Detection Framework for LLMs.☆48Updated 6 months ago