open-compass / CompassJudger
☆71Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for CompassJudger
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆50Updated 7 months ago
- ☆222Updated 3 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆193Updated last month
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆156Updated 7 months ago
- The demo, code and data of FollowRAG☆60Updated 3 weeks ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆231Updated 7 months ago
- ☆116Updated 5 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆93Updated 2 weeks ago
- ☆85Updated 2 weeks ago
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆170Updated 2 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆213Updated last week
- Reformatted Alignment☆112Updated 2 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆129Updated 2 months ago
- Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊☆236Updated 3 weeks ago
- Expert Specialized Fine-Tuning☆148Updated 2 months ago
- Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".☆179Updated 5 months ago
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆101Updated last month
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆124Updated 3 weeks ago
- This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for E…☆358Updated this week
- Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models☆81Updated 7 months ago
- ☆287Updated 2 months ago
- FuseAI Project☆76Updated 3 months ago
- awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.☆154Updated this week
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆139Updated last year
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆54Updated this week
- The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.☆69Updated 2 months ago
- ☆192Updated 3 months ago
- LongQLoRA: Extent Context Length of LLMs Efficiently☆159Updated last year
- Environments, tools, and benchmarks for general computer agents☆172Updated last month
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆75Updated last month