evintunador / FractalFormerLinks

A GPT with self-similar nested properties

☆20

Alternatives and similar repositories for FractalFormer

Users that are interested in FractalFormer are comparing it to the libraries listed below

Sorting:

nivibilla / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆69Updated last year
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
teknium1 / ShareGPT-Builder
☆115Updated 7 months ago
QuixiAI / kraken
☆66Updated last year
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 9 months ago
sshh12 / multi_token
Embed arbitrary modalities (images, audio, documents, etc) into large language models.
☆185Updated last year
jadechip / nanoXLSTM
The simplest, fastest repository for training/finetuning medium-sized xLSTMs.
☆41Updated last year
QuixiAI / grokadamw
☆134Updated 11 months ago
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆177Updated last week
bdambrosio / AllTheWorldAPlay
All the world is a play, we are but actors in it.
☆50Updated this week
facebookresearch / ExploreToM
Code for ExploreTom
☆84Updated 3 weeks ago
evintunador / matryoshkaGPT
GPTs inside of GPTs like Russian nesting dolls
☆9Updated last year
the-crypt-keeper / LLooM
Experimental LLM Inference UX to aid in creative writing
☆116Updated 7 months ago
kerekovskik / autologic
autologic is a Python package that implements the SELF-DISCOVER framework proposed in the paper SELF-DISCOVER: Large Language Models Self…
☆60Updated last year
joey00072 / ohara
Collection of autoregressive model implementation
☆85Updated 2 months ago
tairov / QStarLearning.mojo
☆111Updated last year
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆101Updated 4 months ago
VatsaDev / NanoPhi-alpha
GPT-2 small trained on phi-like data
☆66Updated last year
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆66Updated 10 months ago
fairydreaming / farel-bench
Testing LLM reasoning abilities with family relationship quizzes.
☆62Updated 5 months ago
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆173Updated last year
normal-computing / extended-mind-transformers
☆122Updated 11 months ago
serp-ai / unsloth
5X faster 60% less memory QLoRA finetuning
☆21Updated last year
SkunkworksAI / CodeFusion
☆15Updated last year
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆140Updated 4 months ago
kyegomez / swarms-pytorch
Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊
☆126Updated last month
evintunador / minGemma
a simplified version of Google's Gemma model to be used for learning
☆26Updated last year
flawedmatrix / mamba-ssm
Implementation of mamba with rust
☆87Updated last year
holo-q / OpenQ
The open-source implementation of Q*, achieved in context as a zero-shot reprogramming of the attention mechanism. (synthetic data)
☆1Updated 7 months ago