olivkoch / nano-trmLinks
☆36Updated this week
Alternatives and similar repositories for nano-trm
Users that are interested in nano-trm are comparing it to the libraries listed below
Sorting:
- Simple high-throughput inference library☆149Updated 6 months ago
- ☆11Updated last year
- ☆40Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated last month
- Training Models Daily☆16Updated last year
- look how they massacred my boy☆63Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆107Updated 8 months ago
- ☆62Updated 4 months ago
- webgpu autograd library☆33Updated 5 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 6 months ago
- Tokun to can tokens☆18Updated 5 months ago
- RWKV-7: Surpassing GPT☆100Updated last year
- ☆105Updated 3 months ago
- Latent Large Language Models☆19Updated last year
- new optimizer☆20Updated last year
- https://hf.co/hexgrad/Kokoro-82M☆14Updated 9 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 8 months ago
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- ☆43Updated last month
- Project code for training LLMs to write better unit tests + code☆21Updated 6 months ago
- smolLM with Entropix sampler on pytorch☆150Updated last year
- Efficient non-uniform quantization with GPTQ for GGUF☆53Updated 2 months ago
- A byte-level decoder architecture that matches the performance of tokenized Transformers.☆66Updated last year
- Implementation of mamba with rust☆88Updated last year
- Alice in Wonderland code base for experiments and raw experiments data☆131Updated 2 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated last year
- ☆136Updated last year
- ☆39Updated 6 months ago
- Collection of autoregressive model implementation☆86Updated 6 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆72Updated 6 months ago