isaacperez / tinygpt
A tiny version of GPT fully implemented in Python with zero dependencies
☆63Updated 2 months ago
Alternatives and similar repositories for tinygpt:
Users that are interested in tinygpt are comparing it to the libraries listed below
- Inference Llama/Llama2/Llama3 Modes in NumPy☆20Updated last year
- A CLI to manage install and configure llama inference implemenation in multiple languages☆65Updated last year
- Generate ideal question-answers for testing RAG☆126Updated 2 weeks ago
- Tools for LLM agents.☆59Updated 2 months ago
- Mistral7B playing DOOM☆127Updated 7 months ago
- Ipython notebook copy of Andrej Karpathy's llama2.c☆23Updated last year
- A simple app for downloading YouTube Shorts transcripts. Built to self-host with Python and Streamlit. Free and open source.☆27Updated 2 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆32Updated 9 months ago
- Visual inference exploration & experimentation playground☆87Updated 2 months ago
- Transformer GPU VRAM estimator☆50Updated 10 months ago
- 360M model running in the browser on WebGPU☆21Updated 6 months ago
- Chat strategies for LLMs☆92Updated 6 months ago
- A pure NumPy implementation of Mamba.☆219Updated 7 months ago
- a curated list of data for reasoning ai☆128Updated 6 months ago
- An easily-trained baby GPT that can stand in for the real thing. Based on Andrej Karpathy's makemore, but set up to mimic a llama-cpp ser…☆27Updated last year
- Structured Output Is All You Need!☆53Updated 11 months ago
- Autograd to GPT-2 completely from scratch☆110Updated 2 weeks ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated last year
- Deepmark AI enables a unique testing environment for language models (LLM) assessment on task-specific metrics and on your own data so yo…☆105Updated last year
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆277Updated last week
- A star for organising blocks and playing with transformers.☆23Updated 9 months ago
- ☆111Updated 2 weeks ago
- Analyze your image in seconds with AI☆60Updated 8 months ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆45Updated last week
- Embedding models from Jina AI☆58Updated last year
- Documentation for the Krixik Python client.☆38Updated 3 months ago
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆51Updated 6 months ago
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆217Updated 2 months ago