InternLM / InternLM-WQX
☆19Updated 10 months ago
Alternatives and similar repositories for InternLM-WQX
Users that are interested in InternLM-WQX are comparing it to the libraries listed below
Sorting:
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated last year
- [ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models☆347Updated last year
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆270Updated last year
- Repository of LV-Eval Benchmark☆65Updated 8 months ago
- ☆48Updated last week
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆176Updated last month
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆132Updated 11 months ago
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆386Updated this week
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆55Updated last year
- 基于baichuan-7b的开源多模态大语言模型☆73Updated last year
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆249Updated 5 months ago
- ☆56Updated last year
- [NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of…☆116Updated 5 months ago
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆149Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆56Updated last month
- LongQLoRA: Extent Context Length of LLMs Efficiently☆164Updated last year
- ☆226Updated last year
- ☆280Updated 9 months ago
- Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources☆198Updated last month
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆224Updated last month
- [CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness☆363Updated this week
- MMR1: Advancing the Frontiers of Multimodal Reasoning☆158Updated 2 months ago
- GOT的vLLM加速实现 并结合 MinerU 实现RAG中的pdf 解析☆56Updated 6 months ago
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆121Updated 4 months ago
- Mixture-of-Experts (MoE) Language Model☆186Updated 8 months ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆189Updated 2 months ago
- ☆168Updated last year
- Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊☆268Updated 3 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆151Updated 8 months ago