cedrickchee / transformers-llamaLinks

LLaMA implementation for HuggingFace Transformers

☆38

Alternatives and similar repositories for transformers-llama

Users that are interested in transformers-llama are comparing it to the libraries listed below

Sorting:

deep-diver / gradio-chat
HuggingChat like UI in Gradio
☆71Updated 2 years ago
young-geng / koala_data_pipeline
The data processing pipeline for the Koala chatbot language model
☆117Updated 2 years ago
FreedomIntelligence / GPT-API-Accelerate
The "GPT-API-Accelerate" project provides a set of Python classes for accelerating the process of generating responses to prompts using t…
☆23Updated 8 months ago
geronimi73 / 3090_shorts
minimal scripts for 24GB VRAM GPUs. training, inference, whatever
☆40Updated last week
lxe / llama-tune
LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers
☆51Updated 2 years ago
emrgnt-cmplxty / zero-shot-replication
☆73Updated last year
Agora-Lab-AI / Orca
An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"
☆43Updated 8 months ago
kyegomez / Kosmos-X
The Next Generation Multi-Modality Superintelligence
☆71Updated 9 months ago
kyegomez / phi-1
Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation
☆76Updated last year
my-other-github-account / llm-humaneval-benchmarks
☆84Updated 2 years ago
khaimt / qa_expert
This repo is for handling Question Answering, especially for Multi-hop Question Answering
☆67Updated last year
LAION-AI / blade2blade
Adversarial Training and SFT for Bot Safety Models
☆40Updated 2 years ago
kyegomez / Finetuning-Suite
Finetune any model on HF in less than 30 seconds
☆57Updated 2 months ago
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆41Updated 7 months ago
togethercomputer / Llama-2-7B-32K-Instruct
☆84Updated last year
zsc / llama_infer
Inference script for Meta's LLaMA models using Hugging Face wrapper
☆110Updated 2 years ago
GeneZC / MiniMA
Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"
☆100Updated 11 months ago
soochan-lee / RoT
Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…
☆43Updated 2 years ago
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆71Updated last year
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆45Updated last year
toufunao / SCM4LLMs
☆33Updated 2 years ago
kyegomez / Algorithm-Of-Thoughts
My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"
☆98Updated last year
gigio1023 / alpaca-lora-for-huggingface
Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel
☆24Updated 2 years ago
swj0419 / detect-pretrain-code-contamination
☆76Updated last year
laramohan / wikillm
LLMs as Collaboratively Edited Knowledge Bases
☆45Updated last year
deep-diver / LLM-Pref-Mark-UI
☆37Updated 2 years ago
NolanoOrg / sparse_quant_llms
SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia
☆41Updated 2 years ago
leehanchung / lora-instruct
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA
☆103Updated last month
THU-KEG / ChatLog
⏳ ChatLog: Recording and Analysing ChatGPT Across Time
☆99Updated last year
mrcabbage972 / simple-toolformer
A Python implementation of Toolformer using Huggingface Transformers
☆14Updated 2 years ago