Plain pytorch implementation of LLaMA
☆187May 22, 2023Updated 2 years ago
Alternatives and similar repositories for vanilla-llama
Users that are interested in vanilla-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLaMA: Open and Efficient Foundation Language Models☆2,785Nov 8, 2023Updated 2 years ago
- Inference code for LLaMA 2 models☆30Jul 7, 2024Updated last year
- KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation, NAACL 2024☆16Jul 29, 2024Updated last year
- ☆59May 19, 2025Updated 11 months ago
- Yet Another LLaMA/ALPACA Discord Bot☆69Apr 15, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆33Apr 23, 2023Updated 3 years ago
- Instruct-tune LLaMA on consumer hardware☆18,945Jul 29, 2024Updated last year
- ☆404Mar 22, 2023Updated 3 years ago
- Quantized inference code for LLaMA models☆1,040Mar 17, 2023Updated 3 years ago
- A softeware for image based building modeling.☆15Nov 26, 2014Updated 11 years ago
- ☆456Oct 15, 2023Updated 2 years ago
- ☆10Nov 29, 2024Updated last year
- Open-source Self-Instruction Tuning Code LLM☆172Apr 26, 2023Updated 3 years ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆19Mar 7, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Prototype routines for GPU quantization written using PyTorch.☆21Apr 15, 2026Updated 3 weeks ago
- This is an implementation of FlowNet with tensorflow☆10Aug 5, 2017Updated 8 years ago
- 4 bits quantization of LLaMA using GPTQ☆3,072Jul 13, 2024Updated last year
- Inductive Knowledge Graph Reasoning for Multi-batch Emerging Entities, CIKM 2022☆17Dec 27, 2022Updated 3 years ago
- Open Academic Research on Improving LLaMA to SOTA LLM☆1,606Aug 30, 2023Updated 2 years ago
- Code for a workshop hosted at the MLOps World Summit '22☆18Jun 14, 2022Updated 3 years ago
- Code to train Sentence BERT Japanese model for Hugging Face Model Hub☆11Aug 8, 2021Updated 4 years ago
- 基于Pytorch实现的姿态估计,实现了Hourglass、HRNet、Simple Baselines等模型的训练和测试☆10Feb 23, 2022Updated 4 years ago
- Forced alignment decoder for Whisper.☆15Mar 13, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Jan 7, 2026Updated 3 months ago
- AI agent skills for tech startup founders — fundraising, sales, product, recruiting, engineering, legal, ops, and growth. Works with Clau…☆117Mar 16, 2026Updated last month
- SRCNN论文复现☆11Aug 2, 2018Updated 7 years ago
- ☆53Jan 19, 2023Updated 3 years ago
- Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo☆1,087Aug 4, 2024Updated last year
- ☆34Apr 23, 2023Updated 3 years ago
- Data and forecast submission repository for the 2023 CDC West Nile virus Forecasting Challenge☆11Oct 30, 2023Updated 2 years ago
- The Official Implementation for INR-V: A Continuous Representation Space for Video-based Generative Tasks☆15Mar 31, 2023Updated 3 years ago
- Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?☆57Apr 17, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".☆2,301Mar 27, 2024Updated 2 years ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆52May 17, 2023Updated 2 years ago
- ☆10Sep 1, 2020Updated 5 years ago
- A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering vario…☆164Apr 1, 2023Updated 3 years ago
- ☆13Jun 19, 2020Updated 5 years ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,084Jul 1, 2025Updated 10 months ago
- Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer☆13Jun 21, 2023Updated 2 years ago