A more efficient GLM implementation!
☆54Feb 18, 2023Updated 3 years ago
Alternatives and similar repositories for one-glm
Users that are interested in one-glm are comparing it to the libraries listed below
Sorting:
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆74Jun 5, 2023Updated 2 years ago
- A multi-task learning approach for conditioned response generation (NAACL 2021)☆12Nov 18, 2022Updated 3 years ago
- ☆13Apr 15, 2024Updated last year
- a single-header math library☆17Nov 7, 2025Updated 3 months ago
- Chrome extension for OA sites like arxiv, openreivew: 1. PDF back to abstract page, 2. Rename PDF page with paper title.☆18Oct 12, 2023Updated 2 years ago
- ☆35Nov 22, 2024Updated last year
- End-to-End Speech Processing Toolkit☆15Jan 20, 2025Updated last year
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Jul 29, 2024Updated last year
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…☆21Jul 11, 2022Updated 3 years ago
- Template Filling with Generative Transformers☆23Jun 8, 2021Updated 4 years ago
- CUDA 12.2 HMM demos☆20Jul 26, 2024Updated last year
- 一个简单的股票预测算法,利用过去5天的涨幅,以及十余项市值因子和财务因子进行训练学习。训练数据被放在本地的mysql数据库中。☆26Jan 22, 2018Updated 8 years ago
- Code for paper "Dependency-based Mixture Language Models" by Zhixian Yang, and Xiaojun Wan. This paper is accepted by ACL 2022 Main Confe…☆26May 27, 2022Updated 3 years ago
- Code and data for COLING 2022 paper titled "Structural Bias For Aspect Sentiment Triplet Extraction"☆26May 28, 2023Updated 2 years ago
- This repository contains the code used for Ordered Memory paper☆29Jan 12, 2020Updated 6 years ago
- LORA微调BLOOMZ,参考BELLE☆25Mar 24, 2023Updated 2 years ago
- ☆33Oct 4, 2024Updated last year
- ☆29May 13, 2024Updated last year
- using lear to do ner extraction☆29Mar 13, 2022Updated 3 years ago
- [Findings of ACL 2022] Meta-Path Guided Contrastive Learning for Logical Reasoning of Text☆28Mar 21, 2022Updated 3 years ago
- OpenVINO backend for Triton.☆37Feb 9, 2026Updated 3 weeks ago
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆263May 9, 2024Updated last year
- Awesome Triton Resources☆39Apr 27, 2025Updated 10 months ago
- [SIGGRAPH Asia 2025] The official implementation of the paper "DvD: Unleashing a Generative Paradigm for Document Dewarping via Coordinat…☆32Nov 22, 2025Updated 3 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Jul 20, 2023Updated 2 years ago
- alpaca中文指令微调数据集☆397Mar 26, 2023Updated 2 years ago
- Code for paper "Stylized Dialogue Response Generation Using Stylized Unpaired Texts"☆31Aug 18, 2022Updated 3 years ago
- cuASR: CUDA Algebra for Semirings☆45Aug 22, 2022Updated 3 years ago
- code for Scaling Laws of RoPE-based Extrapolation☆73Oct 16, 2023Updated 2 years ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆405Jul 31, 2025Updated 7 months ago
- 机器学习使用过的API中文版及机器学习的理论知识☆13Jun 8, 2025Updated 8 months ago
- Implementation and data for the published paper "Recursive Neural Conditional Random Fields for Aspect-based Sentiment Analysis".☆30Mar 26, 2019Updated 6 years ago
- Neural (LSTM) version of the partial CRF model☆34Aug 4, 2019Updated 6 years ago
- Organized inventory of research using the Abstract Meaning Representation☆40Jan 2, 2026Updated 2 months ago
- ☆83Feb 10, 2026Updated 3 weeks ago
- Sparse symmetric indefinite solver implemented with a runtime system☆13May 11, 2020Updated 5 years ago
- 医疗陪诊服务小程序☆12May 25, 2024Updated last year
- OpenCore EFI for ASRock B650M PG Lightning☆10May 1, 2024Updated last year