Model Compression for Big Models
☆168Jun 30, 2023Updated 2 years ago
Alternatives and similar repositories for BMCook
Users that are interested in BMCook are comparing it to the libraries listed below
Sorting:
- Efficient, Low-Resource, Distributed transformer implementation based on BMTrain☆267Nov 27, 2023Updated 2 years ago
- Efficient Training (including pre-training and fine-tuning) for Big Models☆621Oct 27, 2025Updated 4 months ago
- Efficient Inference for Big Models☆587Jan 24, 2023Updated 3 years ago
- Live Training for Open-source Big Models☆505May 30, 2023Updated 2 years ago
- BMInf demos.☆16Oct 14, 2021Updated 4 years ago
- ☆26Oct 2, 2023Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆19Jul 20, 2023Updated 2 years ago
- Finetune CPM-1☆75Mar 18, 2023Updated 2 years ago
- ☆17Aug 5, 2025Updated 6 months ago
- ☆11Sep 16, 2024Updated last year
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 3 years ago
- Official Pytorch Implementation for the paper 'SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients'☆17Jan 12, 2022Updated 4 years ago
- This repo explores how AMR to address tasks difficult for LLMs☆13Jan 15, 2024Updated 2 years ago
- Using Bayesian inference to mine rule sets☆12Jan 9, 2020Updated 6 years ago
- Long Text Generation via Adversarial Training with Leaked Information☆11Mar 8, 2022Updated 3 years ago
- CCL 2023 汉语学习者文本纠错评测☆30Jul 12, 2023Updated 2 years ago
- 百亿参数的中英文双语基座大模型☆2,413Jul 28, 2023Updated 2 years ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆120Dec 10, 2024Updated last year
- Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins☆2,786Dec 5, 2023Updated 2 years ago
- ☆17Dec 19, 2024Updated last year
- This repository contains PyTorch Implementation of CIKM 2022 research-track paper: OptEmbed: Learning Optimal Embedding Table for Click-t…☆18Oct 23, 2023Updated 2 years ago
- ☆38Feb 9, 2026Updated 2 weeks ago
- [EMNLP 2023 (Findings)] Schema-adaptable Knowledge Graph Construction☆22Jan 28, 2024Updated 2 years ago
- 2020语言与智能技术竞赛:面向推荐的对话任务☆52Jun 17, 2021Updated 4 years ago
- A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or l…☆285Aug 13, 2023Updated 2 years ago
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆89Jun 27, 2023Updated 2 years ago
- On Transferability of Prompt Tuning for Natural Language Processing☆101May 3, 2024Updated last year
- Unit Scaling demo and experimentation code☆16Mar 12, 2024Updated last year
- Jittor object detection ssd(Single Shot MultiBox Detector) implemention.☆14Mar 11, 2021Updated 4 years ago
- [SIGIR 2023] Schema-aware Reference as Prompt Improves Data-Efficient Knowledge Graph Construction☆42Apr 5, 2023Updated 2 years ago
- ☆36Jul 7, 2025Updated 7 months ago
- EVA: Large-scale Pre-trained Chit-Chat Models☆306Mar 11, 2023Updated 2 years ago
- [ACL 2023] Plug-and-Play Knowledge Injection for Pre-trained Language Models☆61Apr 1, 2024Updated last year
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆60May 28, 2024Updated last year
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆39Mar 11, 2024Updated last year
- [ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement☆39May 28, 2025Updated 9 months ago
- Paper notes for Information Extraction, including Relation Extraction (RE), Named Entity Recognition (NER), Entity Linking (EL), Event Ex…☆17Apr 1, 2021Updated 4 years ago
- Code for my ICLR 2024 TinyPapers paper "Prune and Tune: Improving Efficient Pruning Techniques for Massive Language Models"☆16May 26, 2023Updated 2 years ago
- Sampling Matters in Deep Embedding Learning (ICCV'17)☆16Oct 16, 2018Updated 7 years ago