InternLM / InternLM-WQX
☆19Updated 8 months ago
Alternatives and similar repositories for InternLM-WQX:
Users that are interested in InternLM-WQX are comparing it to the libraries listed below
- code for Scaling Laws of RoPE-based Extrapolation☆72Updated last year
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆130Updated 9 months ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆264Updated 11 months ago
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆53Updated last year
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆56Updated 11 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆161Updated last week
- ☆166Updated last year
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆55Updated last week
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- An automated pipeline for evaluating LLMs for role-playing.☆163Updated 6 months ago
- The RedStone repository includes code for preparing extensive datasets used in training large language models.☆125Updated last month
- ☆60Updated 2 months ago
- ☆46Updated this week
- Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718☆313Updated 6 months ago
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆148Updated last year
- LongQLoRA: Extent Context Length of LLMs Efficiently☆164Updated last year
- ☆45Updated 9 months ago
- [ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models☆345Updated last year
- ☆264Updated 8 months ago
- Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs☆149Updated last week
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆124Updated 9 months ago
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆370Updated this week
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆247Updated 3 months ago
- 基于baichuan-7b的开源多模态大语言模型☆73Updated last year
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset☆97Updated 8 months ago
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆21Updated 3 months ago
- ☆29Updated 7 months ago
- Repository of LV-Eval Benchmark☆61Updated 7 months ago
- ☆225Updated 10 months ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆237Updated 5 months ago