galatolofederico/vanilla-llama

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/galatolofederico/vanilla-llama)

galatolofederico / vanilla-llama

Plain pytorch implementation of LLaMA

☆187

Alternatives and similar repositories for vanilla-llama

Users that are interested in vanilla-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

henrywoo / pyllama
View on GitHub
LLaMA: Open and Efficient Foundation Language Models
☆2,785Nov 8, 2023Updated 2 years ago
cedrickchee / llama
View on GitHub
Inference code for LLaMA 2 models
☆30Jul 7, 2024Updated last year
nju-websoft / KnowLA
View on GitHub
KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation, NAACL 2024
☆16Jul 29, 2024Updated last year
Jingyu6 / speculative_prefill
View on GitHub
☆59May 19, 2025Updated 11 months ago
AmericanPresidentJimmyCarter / yal-discord-bot
View on GitHub
Yet Another LLaMA/ALPACA Discord Bot
☆69Apr 15, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ConiferLabsWA / flan-ul2-alpaca
View on GitHub
☆33Apr 23, 2023Updated 3 years ago
tloen / alpaca-lora
View on GitHub
Instruct-tune LLaMA on consumer hardware
☆18,945Jul 29, 2024Updated last year
pointnetwork / point-alpaca
View on GitHub
☆404Mar 22, 2023Updated 3 years ago
tloen / llama-int8
View on GitHub
Quantized inference code for LLaMA models
☆1,040Mar 17, 2023Updated 3 years ago
swayfreeda / IGITBuildingReconstruction
View on GitHub
A softeware for image based building modeling.
☆15Nov 26, 2014Updated 11 years ago
zphang / minimal-llama
View on GitHub
☆456Oct 15, 2023Updated 2 years ago
wangcunxiang / Graph-aS-Tokens
View on GitHub
☆10Nov 29, 2024Updated last year
FSoft-AI4Code / CodeCapybara
View on GitHub
Open-source Self-Instruction Tuning Code LLM
☆172Apr 26, 2023Updated 3 years ago
BryceZhuo / HybridNorm
View on GitHub
The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
☆19Mar 7, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
facebookexperimental / protoquant
View on GitHub
Prototype routines for GPU quantization written using PyTorch.
☆21Apr 15, 2026Updated 3 weeks ago
linjianz / tf-flownet
View on GitHub
This is an implementation of FlowNet with tensorflow
☆10Aug 5, 2017Updated 8 years ago
qwopqwop200 / GPTQ-for-LLaMa
View on GitHub
4 bits quantization of LLaMA using GPTQ
☆3,072Jul 13, 2024Updated last year
nju-websoft / MBE
View on GitHub
Inductive Knowledge Graph Reasoning for Multi-batch Emerging Entities, CIKM 2022
☆17Dec 27, 2022Updated 3 years ago
AetherCortex / Llama-X
View on GitHub
Open Academic Research on Improving LLaMA to SOTA LLM
☆1,606Aug 30, 2023Updated 2 years ago
fbsamples / pytorch-quantization-workshop
View on GitHub
Code for a workshop hosted at the MLOps World Summit '22
☆18Jun 14, 2022Updated 3 years ago
colorfulscoop / sbert-ja
View on GitHub
Code to train Sentence BERT Japanese model for Hugging Face Model Hub
☆11Aug 8, 2021Updated 4 years ago
Koorye / pose-estimation
View on GitHub
基于Pytorch实现的姿态估计，实现了Hourglass、HRNet、Simple Baselines等模型的训练和测试
☆10Feb 23, 2022Updated 4 years ago
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆15Mar 13, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Mihaiii / backtrack_sampler
View on GitHub
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆150Jan 7, 2026Updated 3 months ago
shawnpang / startup-founder-skills
View on GitHub
AI agent skills for tech startup founders — fundraising, sales, product, recruiting, engineering, legal, ops, and growth. Works with Clau…
☆117Mar 16, 2026Updated last month
ZhangTeng2017 / SRCNN
View on GitHub
SRCNN论文复现
☆11Aug 2, 2018Updated 7 years ago
microsoft / EfficientLongSequenceModeling
View on GitHub
☆53Jan 19, 2023Updated 3 years ago
Tencent / TencentPretrain
View on GitHub
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
☆1,087Aug 4, 2024Updated last year
ConiferLabsWA / flan-ul2-dolly
View on GitHub
☆34Apr 23, 2023Updated 3 years ago
cdcepi / WNV-forecast-project-2023
View on GitHub
Data and forecast submission repository for the 2023 CDC West Nile virus Forecasting Challenge
☆11Oct 30, 2023Updated 2 years ago
bipashasen / INR-V-VideoGenerationSpace
View on GitHub
The Official Implementation for INR-V: A Continuous Representation Space for Video-based Generative Tasks
☆15Mar 31, 2023Updated 3 years ago
GanjinZero / math401-llm
View on GitHub
Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?
☆57Apr 17, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
IST-DASLab / gptq
View on GitHub
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
☆2,301Mar 27, 2024Updated 2 years ago
s-JoL / Open-Llama
View on GitHub
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆52May 17, 2023Updated 2 years ago
c-sky / xuantie-vector-demos
View on GitHub
☆10Sep 1, 2020Updated 5 years ago
radi-cho / botbots
View on GitHub
A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering vario…
☆164Apr 1, 2023Updated 3 years ago
Project-Splinter / MonoPortDataset
View on GitHub
☆13Jun 19, 2020Updated 5 years ago
Lightning-AI / lit-llama
View on GitHub
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…
☆6,084Jul 1, 2025Updated 10 months ago
practical-dreamer / vicuna_to_alpacan
View on GitHub
Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer
☆13Jun 21, 2023Updated 2 years ago