InternLM / InternLM-WQX
☆17Updated 2 months ago
Related projects: ⓘ
- code for Scaling Laws of RoPE-based Extrapolation☆68Updated 11 months ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆209Updated 5 months ago
- ☆148Updated 10 months ago
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆132Updated 10 months ago
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆277Updated this week
- An automated pipeline for evaluating LLMs for role-playing.☆118Updated this week
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆114Updated 2 months ago
- LongQLoRA: Extent Context Length of LLMs Efficiently☆156Updated 10 months ago
- A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks☆239Updated last month
- Official Repo of "MMBench: Is Your Multi-modal Model an All-around Player?"☆143Updated 2 weeks ago
- [CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback☆218Updated last week
- 大模型多维度中文对齐评测基准 (ACL 2024)☆293Updated last month
- Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718☆244Updated last week
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆37Updated 6 months ago
- 基于baichuan-7b的开源多模态大语言模型☆71Updated 9 months ago
- The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.☆64Updated 2 weeks ago
- OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text☆246Updated 3 weeks ago
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆172Updated 10 months ago
- Multimodal chatbot with computer vision capabilities integrated☆98Updated 4 months ago
- LongAlign: A Recipe for Long Context Alignment Encompassing Data, Training, and Evaluation☆194Updated 4 months ago
- ☆109Updated 5 months ago
- RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness☆200Updated last week
- [ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models☆316Updated 5 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆121Updated 3 months ago
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset☆77Updated 2 months ago
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆38Updated 7 months ago
- ☆265Updated 4 months ago
- ☆185Updated last month
- Touchstone: Evaluating Vision-Language Models by Language Models☆75Updated 8 months ago
- LVBench: An Extreme Long Video Understanding Benchmark☆51Updated 3 weeks ago