lutzroeder / models
A minimal version of GPT-2 in 175 lines of PyTorch code.
☆41Updated last week
Alternatives and similar repositories for models:
Users that are interested in models are comparing it to the libraries listed below
- alternative way to calculating self attention☆18Updated 11 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 8 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆28Updated 3 months ago
- An implementation of delta-iris in tinygrad☆72Updated 8 months ago
- ☆61Updated last year
- [WIP] A 🔥 interface for running code in the cloud☆85Updated 2 years ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 7 months ago
- LLM as a Chatbot Service☆16Updated last year
- Testing KAN-based text generation GPT models☆16Updated 11 months ago
- The package used to build the documentation of our Hugging Face repos☆110Updated last week
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Updated last year
- Using langchain, deeplake and openai to create a Q&A on the Mojo lang programming manual☆22Updated last year
- In this repository I have a code and brief explanations of the attempts that I made at the ARC-AGI (2024) challenges :)☆23Updated 5 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆25Updated 10 months ago
- https://mlabonne.github.io/blog/☆36Updated last month
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆23Updated this week
- Rust Implementation of micrograd☆51Updated 9 months ago
- ☆24Updated last year
- A 7B parameter model for mathematical reasoning☆29Updated 2 months ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆16Updated 3 years ago
- a simplified version of Google's Gemma model to be used for learning☆24Updated last year
- ☆35Updated 2 years ago
- A synthetic story narration dataset to study small audio LMs.☆32Updated last year
- GitHub action that'll sync files from a GitHub Repo with the Hugging Face Hub 🤗☆70Updated 5 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- Tensor library for Zig☆11Updated 5 months ago
- Verbosity control for AI agents☆62Updated 11 months ago
- "PyTorch in Rust"☆16Updated last year
- Thispersondoesnotexist went down, so this time, while building it back up, I am going to open source all of it.☆90Updated last year