A more efficient GLM implementation!
☆54Feb 18, 2023Updated 3 years ago
Alternatives and similar repositories for one-glm
Users that are interested in one-glm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆73Jun 5, 2023Updated 2 years ago
- Transformer related optimization, including BERT, GPT☆39Feb 10, 2023Updated 3 years ago
- A toolkit for developers to simplify the transformation of nn.Module instances. It's now corresponding to Pytorch.fx.☆13Apr 7, 2023Updated 3 years ago
- a single-header math library☆17Nov 7, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A multi-task learning approach for conditioned response generation (NAACL 2021)☆12Nov 18, 2022Updated 3 years ago
- CUDA 12.2 HMM demos☆21Jul 26, 2024Updated last year
- ☆13Mar 27, 2023Updated 3 years ago
- [ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang☆14Jan 4, 2024Updated 2 years ago
- ☆13Apr 15, 2024Updated 2 years ago
- Chrome extension for OA sites like arxiv, openreivew: 1. PDF back to abstract page, 2. Rename PDF page with paper title.☆18Oct 12, 2023Updated 2 years ago
- ⚡ boost inference speed of GPT models in transformers by onnxruntime☆51Aug 20, 2023Updated 2 years ago
- Code for paper "Dependency-based Mixture Language Models" by Zhixian Yang, and Xiaojun Wan. This paper is accepted by ACL 2022 Main Confe…☆26May 27, 2022Updated 3 years ago
- The code base for the SCL implementation used in "Neural Structural Correspondence Learning for Domain Adaptation", CoNLL 2017 and in "Pi…☆22Jul 2, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [Findings of ACL 2022] Meta-Path Guided Contrastive Learning for Logical Reasoning of Text☆28Apr 8, 2026Updated last month
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Jul 29, 2024Updated last year
- ☆37Jan 5, 2021Updated 5 years ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆403Jul 31, 2025Updated 9 months ago
- LORA微调BLOOMZ,参考BELLE☆25Mar 24, 2023Updated 3 years ago
- Template Filling with Generative Transformers☆22Jun 8, 2021Updated 4 years ago
- using lear to do ner extraction☆29Mar 13, 2022Updated 4 years ago
- 使用bert训练MRPC数据集,写成API接口模式以及简易的html界面☆21Jun 30, 2019Updated 6 years ago
- Pretrain CPM-1☆53Apr 20, 2021Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- OpenVINO backend for Triton.☆37May 8, 2026Updated 2 weeks ago
- End-to-End Speech Processing Toolkit☆16Jan 20, 2025Updated last year
- ☆30May 13, 2024Updated 2 years ago
- Resources for our IJCAI 2020 paper, TopicKA: Generating Commonsense Knowledge-Aware Dialogue Responses Towards the Recommended Topic Fact☆12Nov 30, 2020Updated 5 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Jul 20, 2023Updated 2 years ago
- ☆88Nov 4, 2021Updated 4 years ago
- Code and data for COLING 2022 paper titled "Structural Bias For Aspect Sentiment Triplet Extraction"☆26May 28, 2023Updated 2 years ago
- alpaca中文指令微调数据集☆395Mar 26, 2023Updated 3 years ago
- transformers implement (architecture, task example, serving and more)☆96Mar 23, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Slot-filling based Dialog Manager for Task-oriented Bot☆12Dec 29, 2016Updated 9 years ago
- GLM (General Language Model)☆3,501Nov 3, 2023Updated 2 years ago
- ☆33Oct 4, 2024Updated last year
- ☆44Mar 29, 2023Updated 3 years ago
- ☆13Aug 28, 2018Updated 7 years ago
- pCLUE: 1000000+多任务提示学习数据集☆508Oct 4, 2022Updated 3 years ago
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆263May 9, 2024Updated 2 years ago