Step by step explanation/tutorial of llama2.c
☆234Oct 9, 2023Updated 2 years ago
Alternatives and similar repositories for llama2.c-for-dummies
Users that are interested in llama2.c-for-dummies are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- Inference Llama 2 in one file of pure C☆19,631Aug 6, 2024Updated last year
- Reinforcement Learning Algorithms☆11Sep 9, 2021Updated 4 years ago
- minimal C implementation of speculative decoding based on llama2.c☆30Jul 15, 2024Updated last year
- ☆14Mar 28, 2014Updated 12 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Oct 9, 2025Updated 8 months ago
- Efficient Finetuning for OpenAI GPT-OSS☆24Oct 2, 2025Updated 8 months ago
- llama INT4 cuda inference with AWQ☆54Jan 20, 2025Updated last year
- A tool for manual conversion of BGE-M3 models with preserved trainable variables and direct control over model outputs.☆44Sep 7, 2025Updated 9 months ago
- Inference Llama 2 in one file of pure JavaScript(HTML)☆36May 20, 2025Updated last year
- Inference Llama 2 in one file of pure Cuda☆17Aug 20, 2023Updated 2 years ago
- 2023년 고려대학교 MatKor 스터디 - Rust 기초 프로그래밍 + 인터프리터 만들기☆344Aug 10, 2023Updated 2 years ago
- Fast and slim Javascript implementation of AES in ECB and CTR modes☆15May 13, 2025Updated last year
- ☆12Jan 7, 2026Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Inference Llama 2 in one file of pure Python☆424Nov 21, 2025Updated 6 months ago
- Optimizing the Deployment of Tiny Transformers on Low-Power MCUs☆37Sep 2, 2024Updated last year
- Inference Llama/Llama2/Llama3 Modes in NumPy☆20Nov 22, 2023Updated 2 years ago
- LLM as a Chatbot Service☆17Aug 28, 2023Updated 2 years ago
- Port of GGML to C#☆13Jul 1, 2023Updated 2 years ago
- KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델 (KoAlpaca: An open-source language model to understand Korean instructions)☆1,576Oct 25, 2024Updated last year
- Telegram chatbot for ChatGPT that can be used personally☆11Apr 18, 2023Updated 3 years ago
- ☆20Apr 26, 2026Updated last month
- xgboost go wrapper for c_api☆22Apr 18, 2018Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Mixed precision training from scratch with Tensors and CUDA☆30May 14, 2024Updated 2 years ago
- This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT)…☆75Oct 1, 2023Updated 2 years ago
- Inference Llama 2 in one file of pure 🔥☆2,124Feb 9, 2026Updated 4 months ago
- Implementation for IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs (ICLR 2024).☆25Jun 9, 2026Updated last week
- AWS SageMaker를 이용한 MLOps와 LLMOps☆30Aug 4, 2023Updated 2 years ago
- 수능 국어 1등급에 도전하는 AI☆531Apr 2, 2026Updated 2 months ago
- 2019 AI Robotics Korea 1st NLP Study session [DONE]☆10Oct 10, 2019Updated 6 years ago
- ☆12Sep 1, 2023Updated 2 years ago
- ☆11Sep 18, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆17Nov 9, 2017Updated 8 years ago
- Zig wrapper around the RISC-V SBI specification☆19Apr 27, 2026Updated last month
- Korean SAT leader board☆168Nov 20, 2025Updated 6 months ago
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆202Mar 18, 2026Updated 2 months ago
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆45Feb 27, 2025Updated last year
- 一个用Apple Metal实现的Llama和通义千问大模型本地推理☆10Apr 26, 2024Updated 2 years ago
- ☆12Aug 19, 2023Updated 2 years ago