Efficient, Low-Resource, Distributed transformer implementation based on BMTrain
☆268Nov 27, 2023Updated 2 years ago
Alternatives and similar repositories for ModelCenter
Users that are interested in ModelCenter are comparing it to the libraries listed below
Sorting:
- Model Compression for Big Models☆168Jun 30, 2023Updated 2 years ago
- Efficient Training (including pre-training and fine-tuning) for Big Models☆624Oct 27, 2025Updated 4 months ago
- Live Training for Open-source Big Models☆505May 30, 2023Updated 2 years ago
- Efficient Inference for Big Models☆587Jan 24, 2023Updated 3 years ago
- A List of Big Models☆347Jun 30, 2023Updated 2 years ago
- A plug-and-play library for parameter-efficient-tuning (Delta Tuning)☆1,041Sep 19, 2024Updated last year
- This repo contains the code for Late Prompt Tuning.☆12Dec 22, 2025Updated 2 months ago
- An Open-Source Package for Information Retrieval☆168Mar 9, 2026Updated last week
- Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins☆2,785Dec 5, 2023Updated 2 years ago
- 在线图书借阅系统 - 2017 THU OOP课大作业☆13Jul 1, 2018Updated 7 years ago
- EVA: Large-scale Pre-trained Chit-Chat Models☆305Mar 11, 2023Updated 3 years ago
- reStructured Pre-training☆99Dec 22, 2022Updated 3 years ago
- A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.☆938Oct 6, 2022Updated 3 years ago
- Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"☆61Feb 20, 2024Updated 2 years ago
- Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)☆2,805Mar 13, 2024Updated 2 years ago
- EMNLP 2022: Finding Dataset Shortcuts with Grammar Induction https://arxiv.org/abs/2210.11560☆58Feb 28, 2025Updated last year
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"☆109Jul 15, 2023Updated 2 years ago
- Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃☆116Oct 27, 2022Updated 3 years ago
- An Open-Source Framework for Prompt-Learning.☆4,838Jul 16, 2024Updated last year
- 百亿参数的中英文双语基座大模型☆2,413Jul 28, 2023Updated 2 years ago
- BMInf demos.☆16Oct 14, 2021Updated 4 years ago
- Gaokao Benchmark for AI☆108Jul 8, 2022Updated 3 years ago
- The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Mem…☆396Apr 20, 2024Updated last year
- 🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…☆12Feb 25, 2025Updated last year
- Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718☆379Sep 25, 2024Updated last year
- Expanding natural instructions☆1,036Dec 11, 2023Updated 2 years ago
- ☆11Nov 27, 2022Updated 3 years ago
- A large-scale, fine-grained, diverse preference dataset (and models).☆364Dec 29, 2023Updated 2 years ago
- Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"☆912Nov 25, 2023Updated 2 years ago
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆195Jun 14, 2023Updated 2 years ago
- [CVPR 2025 HIghlight] XLRS-Bench: ould Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?☆56Oct 31, 2025Updated 4 months ago
- ☆32Mar 31, 2020Updated 5 years ago
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated 11 months ago
- ☆99Jul 25, 2023Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,438Mar 20, 2024Updated 2 years ago
- Finetune CPM-2☆82Mar 18, 2023Updated 3 years ago
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆591Dec 9, 2024Updated last year
- Benchmarking large language models' complex reasoning ability with chain-of-thought prompting☆2,769Aug 4, 2024Updated last year
- Code and data for TACL paper It’s not Rocket Science: Interpreting Figurative Language in Narratives☆15Sep 4, 2023Updated 2 years ago