InternLM / InternLM-WQXLinks
☆19Updated 11 months ago
Alternatives and similar repositories for InternLM-WQX
Users that are interested in InternLM-WQX are comparing it to the libraries listed below
Sorting:
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated last year
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆216Updated 4 months ago
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆121Updated 5 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆133Updated last year
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆57Updated last year
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆250Updated 6 months ago
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆393Updated 2 weeks ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- ☆150Updated last week
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆131Updated 2 months ago
- Repository of LV-Eval Benchmark☆67Updated 9 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆186Updated 3 months ago
- 基于baichuan-7b的开源多模态大语言模型☆73Updated last year
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆56Updated last year
- a-m-team's exploration in large language modeling☆161Updated 3 weeks ago
- ☆48Updated last year
- GOT的vLLM加速实现 并结合 MinerU 实现RAG中的pdf 解析☆58Updated 7 months ago
- ☆169Updated last year
- MMR1: Advancing the Frontiers of Multimodal Reasoning☆160Updated 3 months ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆276Updated last year
- ☆229Updated last year
- [ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models☆348Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆160Updated this week
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆59Updated last month
- LongQLoRA: Extent Context Length of LLMs Efficiently☆166Updated last year
- An automated pipeline for evaluating LLMs for role-playing.☆187Updated 9 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆113Updated 2 months ago
- ☆191Updated 2 months ago
- ☆29Updated last year
- 文本去重☆72Updated last year