zsc / llama_infer
Inference script for Meta's LLaMA models using Hugging Face wrapper
☆110Updated 2 years ago
Alternatives and similar repositories for llama_infer:
Users that are interested in llama_infer are comparing it to the libraries listed below
- ☆104Updated last year
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year
- An Experiment on Dynamic NTK Scaling RoPE☆62Updated last year
- ☆96Updated last year
- An experimental implementation of the retrieval-enhanced language model☆74Updated 2 years ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆89Updated last year
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated 6 months ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆119Updated 2 years ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated 2 years ago
- Unofficial implementation of AlpaGasus☆90Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆208Updated last year
- Experiments with generating opensource language model assistants☆97Updated last year
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 2 years ago
- A Multilingual Replicable Instruction-Following Model☆93Updated last year
- Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding☆48Updated last year
- Code for ACL2023 paper: Pre-Training to Learn in Context☆108Updated 7 months ago
- This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/…☆95Updated last year
- ☆178Updated 2 years ago
- Pipeline for pulling and processing online language model pretraining data from the web☆177Updated last year
- code for Scaling Laws of RoPE-based Extrapolation☆70Updated last year
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning☆99Updated last year
- ☆72Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Updated last year
- Code and models for BERT on STILTs☆53Updated 2 years ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆57Updated last year
- [ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM☆185Updated last month
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆179Updated 2 years ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆206Updated 10 months ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆162Updated last year