Efficient, Low-Resource, Distributed transformer implementation based on BMTrain
☆270Nov 27, 2023Updated 2 years ago
Alternatives and similar repositories for ModelCenter
Users that are interested in ModelCenter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Model Compression for Big Models☆169Jun 30, 2023Updated 2 years ago
- Efficient Training (including pre-training and fine-tuning) for Big Models☆625Oct 27, 2025Updated 5 months ago
- Live Training for Open-source Big Models☆505May 30, 2023Updated 2 years ago
- Efficient Inference for Big Models☆586Jan 24, 2023Updated 3 years ago
- A List of Big Models☆345Jun 30, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A plug-and-play library for parameter-efficient-tuning (Delta Tuning)☆1,041Sep 19, 2024Updated last year
- This repo contains the code for Late Prompt Tuning.☆12Dec 22, 2025Updated 3 months ago
- An Open-Source Package for Information Retrieval☆167Mar 16, 2026Updated 3 weeks ago
- Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins☆2,779Dec 5, 2023Updated 2 years ago
- 在线图书借阅系统 - 2017 THU OOP课大作业☆13Jul 1, 2018Updated 7 years ago
- EVA: Large-scale Pre-trained Chit-Chat Models☆305Mar 11, 2023Updated 3 years ago
- reStructured Pre-training☆99Dec 22, 2022Updated 3 years ago
- A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.☆938Oct 6, 2022Updated 3 years ago
- Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"☆62Feb 20, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)☆2,816Mar 13, 2024Updated 2 years ago
- EMNLP 2022: Finding Dataset Shortcuts with Grammar Induction https://arxiv.org/abs/2210.11560☆58Feb 28, 2025Updated last year
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"☆109Jul 15, 2023Updated 2 years ago
- An Open-Source Framework for Prompt-Learning.☆4,849Jul 16, 2024Updated last year
- 百亿参数的中英文双语基座大模型☆2,412Jul 28, 2023Updated 2 years ago
- Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃☆117Oct 27, 2022Updated 3 years ago
- BMInf demos.☆16Oct 14, 2021Updated 4 years ago
- Gaokao Benchmark for AI☆108Jul 8, 2022Updated 3 years ago
- The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Mem…☆399Apr 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…☆12Feb 25, 2025Updated last year
- Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718☆381Sep 25, 2024Updated last year
- ☆11Nov 27, 2022Updated 3 years ago
- Expanding natural instructions☆1,039Dec 11, 2023Updated 2 years ago
- Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"☆914Nov 25, 2023Updated 2 years ago
- A large-scale, fine-grained, diverse preference dataset (and models).☆367Dec 29, 2023Updated 2 years ago
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆195Jun 14, 2023Updated 2 years ago
- [CVPR 2025 HIghlight] XLRS-Bench: ould Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?☆56Oct 31, 2025Updated 5 months ago
- CHATGPT-In-Jupyter☆11Jun 2, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆32Mar 31, 2020Updated 6 years ago
- ☆99Jul 25, 2023Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,437Mar 20, 2024Updated 2 years ago
- Finetune CPM-2☆82Mar 18, 2023Updated 3 years ago
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆592Dec 9, 2024Updated last year
- Benchmarking large language models' complex reasoning ability with chain-of-thought prompting☆2,770Aug 4, 2024Updated last year
- Code and data for TACL paper It’s not Rocket Science: Interpreting Figurative Language in Narratives☆15Sep 4, 2023Updated 2 years ago