A more efficient GLM implementation!
☆54Feb 18, 2023Updated 3 years ago
Alternatives and similar repositories for one-glm
Users that are interested in one-glm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆74Jun 5, 2023Updated 2 years ago
- Transformer related optimization, including BERT, GPT☆39Feb 10, 2023Updated 3 years ago
- A toolkit for developers to simplify the transformation of nn.Module instances. It's now corresponding to Pytorch.fx.☆13Apr 7, 2023Updated 2 years ago
- a single-header math library☆17Nov 7, 2025Updated 4 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A multi-task learning approach for conditioned response generation (NAACL 2021)☆12Nov 18, 2022Updated 3 years ago
- CUDA 12.2 HMM demos☆20Jul 26, 2024Updated last year
- ☆13Mar 27, 2023Updated 3 years ago
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…☆21Jul 11, 2022Updated 3 years ago
- [ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang☆14Jan 4, 2024Updated 2 years ago
- ☆13Apr 15, 2024Updated last year
- Chrome extension for OA sites like arxiv, openreivew: 1. PDF back to abstract page, 2. Rename PDF page with paper title.☆18Oct 12, 2023Updated 2 years ago
- ⚡ boost inference speed of GPT models in transformers by onnxruntime☆52Aug 20, 2023Updated 2 years ago
- Code for paper "Dependency-based Mixture Language Models" by Zhixian Yang, and Xiaojun Wan. This paper is accepted by ACL 2022 Main Confe…☆26May 27, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [Findings of ACL 2022] Meta-Path Guided Contrastive Learning for Logical Reasoning of Text☆28Mar 21, 2022Updated 4 years ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Jul 29, 2024Updated last year
- ☆35Nov 22, 2024Updated last year
- This repository contains the code used for Ordered Memory paper☆29Jan 12, 2020Updated 6 years ago
- LORA微调BLOOMZ,参考BELLE☆25Mar 24, 2023Updated 3 years ago
- Template Filling with Generative Transformers☆22Jun 8, 2021Updated 4 years ago
- using lear to do ner extraction☆29Mar 13, 2022Updated 4 years ago
- ☆10Dec 3, 2021Updated 4 years ago
- ☆104Jul 28, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- OpenVINO backend for Triton.☆37Mar 18, 2026Updated last week
- End-to-End Speech Processing Toolkit☆15Jan 20, 2025Updated last year
- ☆29May 13, 2024Updated last year
- Resources for our IJCAI 2020 paper, TopicKA: Generating Commonsense Knowledge-Aware Dialogue Responses Towards the Recommended Topic Fact☆12Nov 30, 2020Updated 5 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Jul 20, 2023Updated 2 years ago
- Code and data for COLING 2022 paper titled "Structural Bias For Aspect Sentiment Triplet Extraction"☆26May 28, 2023Updated 2 years ago
- alpaca中文指令微调数据集☆397Mar 26, 2023Updated 3 years ago
- ☆29Nov 29, 2025Updated 3 months ago
- Source code for "Domain-Aware Dialogue State Tracker for Multi-Domain Dialogue Systems"☆10Oct 5, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- GLM (General Language Model)☆3,463Nov 3, 2023Updated 2 years ago
- A Slot-filling based Dialog Manager for Task-oriented Bot☆12Dec 29, 2016Updated 9 years ago
- ☆44Mar 29, 2023Updated 2 years ago
- A small framework mimics PyTorch using CuPy or NumPy☆56Feb 16, 2022Updated 4 years ago
- pCLUE: 1000000+多任务提示学习数据集☆508Oct 4, 2022Updated 3 years ago
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆263May 9, 2024Updated last year
- Code for paper "Stylized Dialogue Response Generation Using Stylized Unpaired Texts"☆31Aug 18, 2022Updated 3 years ago