bojone / analytical-classificationLinks
逻辑回归和单层softmax的解析解
☆12Updated 4 years ago
Alternatives and similar repositories for analytical-classification
Users that are interested in analytical-classification are comparing it to the libraries listed below
Sorting:
- Python下shuffle几百G文件☆33Updated 4 years ago
- A *tuned* minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆117Updated 4 years ago
- Finetune CPM-1☆24Updated 4 years ago
- 基于Transformer的单模型、多尺度的VAE模型☆57Updated 4 years ago
- 简单的挖矿病毒查杀脚本☆18Updated 3 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆98Updated 2 years ago
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆22Updated last year
- ROUGE for multilingual Summarization☆25Updated 3 years ago
- ☆19Updated last year
- Source code for COLING 2022 paper "Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models"☆24Updated 3 years ago
- An end to end ASR Transformer model training repo☆13Updated 3 years ago
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆33Updated 2 years ago
- Source code and checkpoints for legal pre-trained language models.☆15Updated 4 years ago
- A more efficient GLM implementation!☆54Updated 2 years ago
- Official code for ICLR 2022 paper: "PoNet: Pooling Network for Efficient Token Mixing in Long Sequences".☆33Updated 2 years ago
- Notes of my introduction about NLP in Fudan University☆37Updated 4 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Updated 3 years ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- A small framework mimics PyTorch using CuPy or NumPy☆47Updated 3 years ago
- Fast instruction tuning with Llama2☆11Updated last year
- Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".☆44Updated 3 years ago
- Virtual Adversarial Training (VAT) techniques in PyTorch☆17Updated 3 years ago
- A Python implementation of Toolformer using Huggingface Transformers☆14Updated 2 years ago
- Code & Data for our Paper "PATTERN-BASED CHINESE HYPERNYM-HYPONYM RELATION EXTRACTION METHOD"☆12Updated 5 years ago
- RoFormer升级版☆154Updated 3 years ago
- 用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]☆21Updated 2 years ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆56Updated 2 years ago
- KuaiSearch PERKS☆12Updated 3 years ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Updated last year
- For paper《Gaussian Transformer: A Lightweight Approach for Natural Language Inference》☆28Updated 5 years ago