zsc / llama_infer
Inference script for Meta's LLaMA models using Hugging Face wrapper
☆111Updated last year
Alternatives and similar repositories for llama_infer:
Users that are interested in llama_infer are comparing it to the libraries listed below
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year
- ☆105Updated last year
- An experimental implementation of the retrieval-enhanced language model☆74Updated 2 years ago
- Experiments with generating opensource language model assistants☆97Updated last year
- An Experiment on Dynamic NTK Scaling RoPE☆62Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated last year
- Unofficial implementation of AlpaGasus☆90Updated last year
- A Multilingual Replicable Instruction-Following Model☆94Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆67Updated 4 months ago
- ☆96Updated last year
- Pipeline for pulling and processing online language model pretraining data from the web☆175Updated last year
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated 5 months ago
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆179Updated 2 years ago
- ☆73Updated last year
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆50Updated last year
- ☆178Updated last year
- Code and models for BERT on STILTs☆53Updated last year
- Code for ACL2023 paper: Pre-Training to Learn in Context☆108Updated 6 months ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆57Updated last year
- Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding☆48Updated last year
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆119Updated last year
- The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)☆157Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆89Updated last year
- reStructured Pre-training☆98Updated 2 years ago
- ☆96Updated 2 years ago
- This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/…☆96Updated 11 months ago
- The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1…☆172Updated last month
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 2 years ago
- code for Scaling Laws of RoPE-based Extrapolation☆70Updated last year