SomeoneKong / llm_long_context_bench202405
☆28Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for llm_long_context_bench202405
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆38Updated 8 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆126Updated 5 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆70Updated last year
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆191Updated last month
- ☆129Updated 4 months ago
- A flexible and efficient training framework for large-scale alignment tasks☆209Updated this week
- Mixture-of-Experts (MoE) Language Model☆180Updated 2 months ago
- ☆78Updated last month
- 中文原生检索增强生成测评基准☆100Updated 7 months ago
- SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准☆78Updated last year
- Imitate OpenAI with Local Models☆85Updated 2 months ago
- Reformatted Alignment☆112Updated last month
- ☆40Updated 5 months ago
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆132Updated 7 months ago
- ☆128Updated last month
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆217Updated 6 months ago
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆124Updated 4 months ago
- Light local website for displaying performances from different chat models.☆85Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 7 months ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆220Updated 3 weeks ago
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆57Updated last year
- The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆99Updated 3 weeks ago
- LongQLoRA: Extent Context Length of LLMs Efficiently☆159Updated last year
- zero零训练llm调参☆30Updated last year
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆155Updated 8 months ago
- ☆193Updated 6 months ago
- large language model training-3-stages+deployment☆46Updated last year
- Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718☆285Updated last month
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆50Updated 3 months ago
- ☆78Updated 7 months ago