bazingagin / npc_gzipLinks
Code for Paper: “Low-Resource” Text Classification: A Parameter-Free Classification Method with Compressors
☆1,776Updated 2 years ago
Alternatives and similar repositories for npc_gzip
Users that are interested in npc_gzip are comparing it to the libraries listed below
Sorting:
- LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…☆1,462Updated last year
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,011Updated last year
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,477Updated 6 months ago
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"☆1,063Updated last year
- Run inference on MPT-30B using CPU☆575Updated 2 years ago
- 🤖 A PyTorch library of curated Transformer models and their composable components☆893Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆52Updated 2 years ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,349Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,498Updated last year
- The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”☆977Updated last year
- Explore large language models in 512MB of RAM☆1,196Updated 3 months ago
- Finetuning Large Language Models on One Consumer GPU in 2 Bits☆730Updated last year
- C++ implementation for BLOOM☆806Updated 2 years ago
- Inference Llama 2 in one file of pure 🔥☆2,119Updated 2 weeks ago
- String-to-String Algorithms for Natural Language Processing☆556Updated last year
- Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"☆711Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆821Updated 2 years ago
- A curated list of awesome transformer models.☆666Updated 2 years ago
- LOMO: LOw-Memory Optimization☆989Updated last year
- prompt2model - Generate Deployable Models from Natural Language Instructions☆2,008Updated 10 months ago
- ☆587Updated 2 years ago
- [NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.☆771Updated last year
- Convolutions for Sequence Modeling☆901Updated last year
- Explore and understand your training and validation data.☆848Updated 10 months ago
- ☆1,494Updated 2 years ago
- [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings☆2,015Updated 9 months ago
- [NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333☆1,132Updated last year
- The Official Python Client for Lamini's API☆2,541Updated 6 months ago
- Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised…☆3,094Updated last year
- ☆1,052Updated last year