AmeyaWagh / llama2.cpp
Inference Llama 2 in C++
☆45Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for llama2.cpp
- Make triton easier☆41Updated 4 months ago
- Multi-Layer Key-Value sharing experiments on Pythia models☆32Updated 4 months ago
- ☆93Updated 2 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated 9 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆20Updated 4 months ago
- ☆19Updated 3 months ago
- Scripts to create your own moe models using mlx☆86Updated 8 months ago
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…☆19Updated last month
- LLM training in simple, raw C/CUDA☆12Updated last month
- ☆51Updated 6 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated 10 months ago
- Standalone commandline CLI tool for compiling Triton kernels☆15Updated last month
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆43Updated 6 months ago
- Tools for merging pretrained large language models.☆19Updated 4 months ago
- ☆16Updated last month
- Eh, simple and works.☆27Updated 11 months ago
- a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.☆21Updated 4 months ago
- ☆40Updated 3 weeks ago
- BH hackathon☆14Updated 7 months ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆37Updated last month
- ☆44Updated 2 months ago
- An introduction to LLM Sampling☆18Updated this week
- DPO, but faster 🚀☆20Updated last week
- alternative way to calculating self attention☆18Updated 5 months ago
- Score LLM pretraining data with classifiers☆55Updated last year
- Gpu benchmark☆43Updated last month
- GPT-4 Level Conversational QA Trained In a Few Hours☆55Updated 2 months ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆43Updated this week