evintunador / FractalFormer
A GPT with self-similar nested properties
☆20Updated last year
Alternatives and similar repositories for FractalFormer:
Users that are interested in FractalFormer are comparing it to the libraries listed below
- ☆112Updated 4 months ago
- ☆66Updated 10 months ago
- 5X faster 60% less memory QLoRA finetuning☆21Updated 10 months ago
- entropix style sampling + GUI☆25Updated 5 months ago
- All the world is a play, we are but actors in it.☆49Updated this week
- Cerule - A Tiny Mighty Vision Model☆67Updated 7 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆42Updated 10 months ago
- Video+code lecture on building nanoGPT from scratch☆66Updated 10 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆56Updated 2 months ago
- look how they massacred my boy☆63Updated 6 months ago
- ☆27Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆96Updated last month
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆49Updated 2 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 11 months ago
- ☆28Updated last year
- Code for ExploreTom☆79Updated 4 months ago
- Collection of autoregressive model implementation☆85Updated 2 months ago
- Modeling code for a BitNet b1.58 Llama-style model.☆23Updated 11 months ago
- Full finetuning of large language models without large memory requirements☆94Updated last year
- The next evolution of Agents☆48Updated this week
- Genertaes control vectors for use with llama.cpp in GGUF format.☆22Updated last month
- a simplified version of Google's Gemma model to be used for learning☆24Updated last year
- 1.58-bit LLaMa model☆81Updated last year
- ☆129Updated 8 months ago
- GPT-2 small trained on phi-like data☆66Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆90Updated 9 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆22Updated 9 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆33Updated 9 months ago
- ☆130Updated this week
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 4 months ago