BaichuanSEED / BaichuanSEED.github.ioView external linksLinks
Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline"
☆18Aug 28, 2024Updated last year
Alternatives and similar repositories for BaichuanSEED.github.io
Users that are interested in BaichuanSEED.github.io are comparing it to the libraries listed below
Sorting:
- FlexiTokens☆19Dec 27, 2025Updated last month
- ☆18Nov 3, 2025Updated 3 months ago
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Nov 12, 2024Updated last year
- The code implementation of Symbolic-MoE☆46Sep 2, 2025Updated 5 months ago
- SWE-Exp: Experience-Driven Software Issue Resolution☆35Oct 17, 2025Updated 3 months ago
- code for Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Aligned with Human Cognitive Princip…☆23Jul 26, 2025Updated 6 months ago
- ☆25Nov 19, 2025Updated 2 months ago
- Beyond KV Caching: Shared Attention for Efficient LLMs☆20Jul 19, 2024Updated last year
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29May 22, 2025Updated 8 months ago
- PiKV: KV Cache Management System for Mixture of Experts [Efficient ML System]☆48Oct 19, 2025Updated 3 months ago
- This repository contains data, code and models for contextual noncompliance.☆25Jul 18, 2024Updated last year
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- Train, tune, and infer Bamba model☆137Jun 4, 2025Updated 8 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Mar 2, 2024Updated last year
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆81Dec 20, 2024Updated last year
- ☆18Jun 10, 2025Updated 8 months ago
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- ☆26Updated this week
- LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models☆79Oct 16, 2024Updated last year
- patches for huggingface transformers to save memory☆34Jun 2, 2025Updated 8 months ago
- ☆11Jun 22, 2025Updated 7 months ago
- ☆72Jan 29, 2026Updated 2 weeks ago
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆28Oct 23, 2025Updated 3 months ago
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆30Oct 2, 2025Updated 4 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆30Oct 30, 2025Updated 3 months ago
- Multi-Layer Key-Value sharing experiments on Pythia models☆34Jun 14, 2024Updated last year
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆37Jan 21, 2025Updated last year
- (ToCa-v2) A New version of ToCa,with faster speed and better acceleration!☆40Mar 13, 2025Updated 11 months ago
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆47Jul 17, 2025Updated 6 months ago
- FocusLLM: Scaling LLM’s Context by Parallel Decoding☆44Dec 8, 2024Updated last year
- ☆72Jun 10, 2025Updated 8 months ago
- ☆49Apr 4, 2025Updated 10 months ago
- ☆131May 29, 2025Updated 8 months ago
- NDIToolbox is an open source extensible signal and image processing application under development by TRI/Austin designed to assist with t…☆10Aug 19, 2018Updated 7 years ago
- Emotion based music recommender system☆11Mar 26, 2025Updated 10 months ago
- Continuous Pipelined Speculative Decoding☆16Jan 4, 2026Updated last month
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 4 months ago