A more efficient GLM implementation!
☆54Feb 18, 2023Updated 3 years ago
Alternatives and similar repositories for one-glm
Users that are interested in one-glm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆73Jun 5, 2023Updated 2 years ago
- Transformer related optimization, including BERT, GPT☆39Feb 10, 2023Updated 3 years ago
- A toolkit for developers to simplify the transformation of nn.Module instances. It's now corresponding to Pytorch.fx.☆13Apr 7, 2023Updated 3 years ago
- A multi-task learning approach for conditioned response generation (NAACL 2021)☆12Nov 18, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- CUDA 12.2 HMM demos☆21Jul 26, 2024Updated last year
- ☆13Mar 27, 2023Updated 3 years ago
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…☆21Jul 11, 2022Updated 3 years ago
- [ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang☆14Jan 4, 2024Updated 2 years ago
- ☆13Apr 15, 2024Updated 2 years ago
- [CVPR-2023] Towards Any Structural Pruning☆17Apr 27, 2023Updated 3 years ago
- Chrome extension for OA sites like arxiv, openreivew: 1. PDF back to abstract page, 2. Rename PDF page with paper title.☆18Oct 12, 2023Updated 2 years ago
- ⚡ boost inference speed of GPT models in transformers by onnxruntime☆51Aug 20, 2023Updated 2 years ago
- Code for paper "Dependency-based Mixture Language Models" by Zhixian Yang, and Xiaojun Wan. This paper is accepted by ACL 2022 Main Confe…☆26May 27, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [Findings of ACL 2022] Meta-Path Guided Contrastive Learning for Logical Reasoning of Text☆28Apr 8, 2026Updated 3 weeks ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Jul 29, 2024Updated last year
- ☆36Nov 22, 2024Updated last year
- This repository contains the code used for Ordered Memory paper☆29Jan 12, 2020Updated 6 years ago
- ☆37Jan 5, 2021Updated 5 years ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆404Jul 31, 2025Updated 9 months ago
- LORA微调BLOOMZ,参考BELLE☆25Mar 24, 2023Updated 3 years ago
- Template Filling with Generative Transformers☆22Jun 8, 2021Updated 4 years ago
- using lear to do ner extraction☆29Mar 13, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10Dec 3, 2021Updated 4 years ago
- 使用bert训练MRPC数据集,写成API接口模式以及简易的html界面☆21Jun 30, 2019Updated 6 years ago
- Pretrain CPM-1☆53Apr 20, 2021Updated 5 years ago
- ☆29May 13, 2024Updated last year
- Resources for our IJCAI 2020 paper, TopicKA: Generating Commonsense Knowledge-Aware Dialogue Responses Towards the Recommended Topic Fact☆12Nov 30, 2020Updated 5 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Jul 20, 2023Updated 2 years ago
- alpaca中文指令微调数据集☆396Mar 26, 2023Updated 3 years ago
- transformers implement (architecture, task example, serving and more)☆96Mar 23, 2022Updated 4 years ago
- GLM (General Language Model)☆3,489Nov 3, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Slot-filling based Dialog Manager for Task-oriented Bot☆12Dec 29, 2016Updated 9 years ago
- ☆33Oct 4, 2024Updated last year
- ☆44Mar 29, 2023Updated 3 years ago
- pCLUE: 1000000+多任务提示学习数据集☆506Oct 4, 2022Updated 3 years ago
- 微信开放平台☆10Nov 15, 2016Updated 9 years ago
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆263May 9, 2024Updated last year
- Code for paper "Stylized Dialogue Response Generation Using Stylized Unpaired Texts"☆31Aug 18, 2022Updated 3 years ago