bazingagin / npc_gzipLinks
Code for Paper: “Low-Resource” Text Classification: A Parameter-Free Classification Method with Compressors
☆1,773Updated 2 years ago
Alternatives and similar repositories for npc_gzip
Users that are interested in npc_gzip are comparing it to the libraries listed below
Sorting:
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"☆1,062Updated last year
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,479Updated 3 months ago
- 🤖 A PyTorch library of curated Transformer models and their composable components☆892Updated last year
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,007Updated last year
- Finetuning Large Language Models on One Consumer GPU in 2 Bits☆728Updated last year
- prompt2model - Generate Deployable Models from Natural Language Instructions☆2,011Updated 7 months ago
- LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…☆1,463Updated last year
- Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI☆2,047Updated last year
- Cramming the training of a (BERT-type) language model into limited compute.☆1,345Updated last year
- LOMO: LOw-Memory Optimization☆989Updated last year
- Run inference on MPT-30B using CPU☆575Updated 2 years ago
- Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A☆964Updated last year
- Foundation Architecture for (M)LLMs☆3,099Updated last year
- [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings☆1,997Updated 7 months ago
- [NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333☆1,120Updated last year
- SGPT: GPT Sentence Embeddings for Semantic Search☆870Updated last year
- The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”☆967Updated last year
- The hub for EleutherAI's work on interpretability and learning dynamics☆2,597Updated 2 months ago
- Convolutions for Sequence Modeling☆895Updated last year
- Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript☆599Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆821Updated 2 years ago
- Salesforce open-source LLMs with 8k sequence length.☆721Updated 6 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,496Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆48Updated 2 years ago
- Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-sour…☆2,663Updated 10 months ago
- Python package for easily interfacing with chat apps, with robust features and minimal code complexity.☆3,523Updated last year
- The Official Python Client for Lamini's API☆2,543Updated 4 months ago
- Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"☆709Updated last year
- Explore large language models in 512MB of RAM☆1,195Updated 3 weeks ago
- C++ implementation for BLOOM☆809Updated 2 years ago