evintunador / templateGPTLinks
customizable template GPT code designed for easy novel architecture experimentation
☆26Updated 7 months ago
Alternatives and similar repositories for templateGPT
Users that are interested in templateGPT are comparing it to the libraries listed below
Sorting:
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 10 months ago
- A compact LLM pretrained in 9 days by using high quality data☆333Updated 7 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆327Updated last year
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆202Updated last year
- ☆136Updated last year
- rl from zero pretrain, can it be done? yes.☆280Updated last month
- Long context evaluation for large language models☆224Updated 8 months ago
- ☆126Updated 10 months ago
- ☆135Updated 7 months ago
- Async RL Training at Scale☆770Updated this week
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆450Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆145Updated 8 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated last year
- ☆138Updated 2 months ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆743Updated last week
- This is our own implementation of 'Layer Selective Rank Reduction'☆239Updated last year
- Reference implementation of Mistral AI 7B v0.1 model.☆28Updated last year
- Plotting (entropy, varentropy) for small LMs☆98Updated 5 months ago
- Training-Ready RL Environments + Evals☆174Updated this week
- Fast bare-bones BPE for modern tokenizer training☆168Updated 4 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆275Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆242Updated last year
- Exploring Applications of GRPO☆248Updated 2 months ago
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆926Updated 2 weeks ago
- smolLM with Entropix sampler on pytorch☆150Updated last year
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget☆161Updated 3 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆355Updated 11 months ago
- An Open Source Toolkit For LLM Distillation☆779Updated 4 months ago
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆113Updated last year
- ☆111Updated 2 months ago