fire / pytorch-nncpLinks
☆13Updated 2 years ago
Alternatives and similar repositories for pytorch-nncp
Users that are interested in pytorch-nncp are comparing it to the libraries listed below
Sorting:
- This repository contains the source code and dataset link mentioned in WWW 2022 accepted paper "TRACE:A Fast Transformer-based General-Pu…☆30Updated 3 years ago
- Dzip: improved general-purpose lossless compression based on novel neural network modeling☆76Updated 3 years ago
- An implementation of LLMzip using GPT-2☆13Updated 2 years ago
- Here we collect trick questions and failed tasks for open source LLMs to improve them.☆32Updated 2 years ago
- openvino version of openai/whisper☆182Updated 2 years ago
- Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch☆231Updated last year
- Structural Pruning for LLaMA☆54Updated 2 years ago
- RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.☆67Updated 3 years ago
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta☆125Updated 3 weeks ago
- Customizable machine translation in C++☆56Updated last year
- QuIP quantization☆61Updated last year
- Longitudinal Evaluation of LLMs via Data Compression☆33Updated last year
- ☆63Updated last year
- ☆150Updated 2 years ago
- A converter and basic tester for rwkv onnx☆43Updated 2 years ago
- O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, …☆87Updated 2 years ago
- Low-bit optimizers for PyTorch☆138Updated 2 years ago
- Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent,…☆227Updated last year
- Implementation of Google's USM speech model in Pytorch☆34Updated 3 weeks ago
- Unofficial PyTorch Implementation for pNLP-Mixer: an Efficient all-MLP Architecture for Language (https://arxiv.org/abs/2202.04350)☆65Updated 3 years ago
- A simple but robust PyTorch implementation of RetNet from "Retentive Network: A Successor to Transformer for Large Language Models" (http…☆106Updated 2 years ago
- Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates☆473Updated last year
- 32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.☆50Updated 2 years ago
- A repository for log-time feedforward networks☆224Updated last year
- Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible…☆98Updated 2 weeks ago
- SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia☆42Updated 2 years ago
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆75Updated last year
- Root Mean Square Layer Normalization☆261Updated 2 years ago
- Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.☆12Updated last year
- RWKV, in easy to read code☆72Updated 10 months ago