AmeyaWagh / llama2.cppLinks
Inference Llama 2 in C++
☆43Updated last year
Alternatives and similar repositories for llama2.cpp
Users that are interested in llama2.cpp are comparing it to the libraries listed below
Sorting:
- Make triton easier☆50Updated last year
- Andrej Kapathy's micrograd implemented in c☆30Updated last year
- ☆55Updated last year
- A really tiny autograd engine☆99Updated 8 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated last month
- Multi-Layer Key-Value sharing experiments on Pythia models☆34Updated last year
- ☆101Updated last year
- A collection of lightweight interpretability scripts to understand how LLMs think☆89Updated 2 weeks ago
- LLM training in simple, raw C/CUDA☆112Updated last year
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆93Updated 2 weeks ago
- Eh, simple and works.☆27Updated 2 years ago
- Rust Implementation of micrograd☆53Updated last year
- Using multiple LLMs for ensemble Forecasting☆16Updated 2 years ago
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆64Updated 9 months ago
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆92Updated 2 years ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- ☆169Updated last year
- Very minimal (and stateless) agent framework☆44Updated last year
- NanoGPT (124M) quality in 2.67B tokens☆28Updated 4 months ago
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆118Updated last year
- ☆16Updated 8 months ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆72Updated 3 weeks ago
- Score LLM pretraining data with classifiers☆55Updated 2 years ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆53Updated 10 months ago
- ☆19Updated last year
- A collection of reproducible inference engine benchmarks☆38Updated 9 months ago
- LLaVA server (llama.cpp).☆183Updated 2 years ago
- Scripts to create your own moe models using mlx☆90Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85Updated last year
- Cerule - A Tiny Mighty Vision Model☆68Updated 2 months ago