bojone / analytical-classificationLinks
逻辑回归和单层softmax的解析解
☆12Updated 4 years ago
Alternatives and similar repositories for analytical-classification
Users that are interested in analytical-classification are comparing it to the libraries listed below
Sorting:
- Python下shuffle几百G文件☆33Updated 4 years ago
- A *tuned* minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆117Updated 4 years ago
- Finetune CPM-1☆24Updated 4 years ago
- 基于Transformer的单模型、多尺度的VAE模型☆57Updated 4 years ago
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆22Updated last year
- 简单的挖矿病毒查杀脚本☆18Updated 3 years ago
- ROUGE for multilingual Summarization☆25Updated 4 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆98Updated 2 years ago
- Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch☆46Updated 4 years ago
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆33Updated 2 years ago
- Virtual Adversarial Training (VAT) techniques in PyTorch☆17Updated 3 years ago
- Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".☆44Updated 3 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41Updated 4 years ago
- A pytorch &keras implementation and demo of Fastformer.☆189Updated 3 years ago
- reformer-pytorch中文版本,简单高效的生成模型。类似GPT2的效果☆16Updated 2 years ago
- A more efficient GLM implementation!☆54Updated 2 years ago
- ☆19Updated last year
- A simple middleware to improving GPU utilization then speedup online inference.☆19Updated 4 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Updated 3 years ago
- Official code for ICLR 2022 paper: "PoNet: Pooling Network for Efficient Token Mixing in Long Sequences".☆33Updated 2 years ago
- [EMNLP'19] Summary for Transformer Understanding☆53Updated 5 years ago
- Fast instruction tuning with Llama2☆11Updated last year
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆56Updated 2 years ago
- ☆23Updated 2 years ago
- 一些RNN的实现☆51Updated 2 years ago
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆10Updated last year
- Source code and checkpoints for legal pre-trained language models.☆15Updated 4 years ago
- Notes of my introduction about NLP in Fudan University☆37Updated 4 years ago
- A pre-trained model with multi-exit transformer architecture.☆56Updated 2 years ago
- Source code for "Training Generative Adversarial Networks Via Turing Test".☆13Updated 5 years ago