RahulSChand / llama2.c-for-dummiesView external linksLinks
Step by step explanation/tutorial of llama2.c
☆226Oct 9, 2023Updated 2 years ago
Alternatives and similar repositories for llama2.c-for-dummies
Users that are interested in llama2.c-for-dummies are comparing it to the libraries listed below
Sorting:
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- Inference Llama 2 in one file of pure C☆19,162Aug 6, 2024Updated last year
- minimal C implementation of speculative decoding based on llama2.c☆25Jul 15, 2024Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Oct 9, 2025Updated 4 months ago
- Efficient Finetuning for OpenAI GPT-OSS☆23Oct 2, 2025Updated 4 months ago
- CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API☆35Sep 15, 2023Updated 2 years ago
- llama INT4 cuda inference with AWQ☆54Jan 20, 2025Updated last year
- Inference Llama 2 in one file of pure JavaScript(HTML)☆36May 20, 2025Updated 8 months ago
- 2023년 고려대학교 MatKor 스터디 - Rust 기초 프로그래밍 + 인터프리터 만들기☆346Aug 10, 2023Updated 2 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- Inference Llama/Llama2/Llama3 Modes in NumPy☆21Nov 22, 2023Updated 2 years ago
- 한글을 제대로 지원하는 텍스트 확장기. A text expander that fully supports Hangeul.☆62Feb 8, 2026Updated last week
- A tool for manual conversion of BGE-M3 models with preserved trainable variables and direct control over model outputs.☆44Sep 7, 2025Updated 5 months ago
- Inference Llama 2 in one file of pure Python☆426Nov 21, 2025Updated 2 months ago
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Feb 5, 2026Updated last week
- LLM inference in C/C++☆23Oct 4, 2024Updated last year
- Mixed precision training from scratch with Tensors and CUDA☆28May 14, 2024Updated last year
- A tiny reinforcement learning codebase for continuous control, built on top of JAX.☆15Mar 28, 2023Updated 2 years ago
- 친구가 풀길래 나도 풀어보는 Baekjoon Online Judge☆10May 24, 2020Updated 5 years ago
- ☆15Apr 26, 2025Updated 9 months ago
- socat version 2☆10Aug 30, 2012Updated 13 years ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆24Aug 2, 2025Updated 6 months ago
- ☆11Sep 18, 2023Updated 2 years ago
- Simple HTTP serving for PyTorch 🚀☆10Oct 15, 2020Updated 5 years ago
- Inference Llama 2 in one file of pure 🔥☆2,117Feb 9, 2026Updated last week
- KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델 (KoAlpaca: An open-source language model to understand Korean instructions)☆1,577Oct 25, 2024Updated last year
- ☆12Sep 1, 2023Updated 2 years ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- goroutine extension for c++☆13Sep 15, 2016Updated 9 years ago
- Visualize neural networks using TikZ in Julia☆15Jan 29, 2025Updated last year
- Telegram chatbot for ChatGPT that can be used personally☆11Apr 18, 2023Updated 2 years ago
- Port of GGML to C#☆13Jul 1, 2023Updated 2 years ago
- Convert PDF into images faster with serverless architecture☆11Nov 20, 2022Updated 3 years ago
- Demonstration of a factory pattern where the types automatically register themselves☆13Mar 13, 2019Updated 6 years ago
- 一个用Apple Metal实现的Llama和通义千问大模型本地推理☆10Apr 26, 2024Updated last year
- A collection of instruction data and scripts for machine translation.☆20Sep 23, 2023Updated 2 years ago
- 수능 국어 1등급에 도전하는 AI☆531Oct 6, 2024Updated last year
- C++ implementation of Qwen-LM☆615Dec 6, 2024Updated last year
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,087Jul 1, 2025Updated 7 months ago