customizable template GPT code designed for easy novel architecture experimentation
☆26Mar 19, 2025Updated last year
Alternatives and similar repositories for templateGPT
Users that are interested in templateGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a WIP architecture designed to allow transformers to think in a manner without tokens☆20Apr 12, 2024Updated 2 years ago
- code to train a gpt-2 model to train it on tiny stories dataset according to the TinyStories paper☆39Nov 24, 2023Updated 2 years ago
- ☆43Nov 16, 2021Updated 4 years ago
- The official Languini Kitchen repository☆14May 6, 2024Updated 2 years ago
- "Head-to-Tail How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs?" (NAACL 2024)☆19Jul 1, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Our solution of the Kaggle Abstraction and Reasoning Challenge☆23May 30, 2020Updated 5 years ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated 2 months ago
- Remove generated stories with stray unicode characters☆12Jan 3, 2024Updated 2 years ago
- Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch☆31May 11, 2026Updated 2 weeks ago
- ☆23May 21, 2025Updated last year
- A copy of the DirectX Headers from MinGW-64.☆14Sep 7, 2023Updated 2 years ago
- Vite + Mantine + Vanilla extract template☆12May 14, 2026Updated 2 weeks ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated 2 years ago
- Codebase for VidHal: Benchmarking Hallucinations in Vision LLMs☆14Apr 23, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!☆13Oct 22, 2023Updated 2 years ago
- Download TikTok videos online with TikTok Video Downloader. Completely free.☆13May 19, 2026Updated last week
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆31Nov 14, 2023Updated 2 years ago
- A simple web-app for generating glassmorphism UI effect!☆12Aug 5, 2023Updated 2 years ago
- PyTorch Implementation of FractalNet☆28Dec 15, 2018Updated 7 years ago
- Transformers components but in Triton☆34May 9, 2025Updated last year
- Official repo for the paper "Bilinear MLPs enable weight-based mechanistic interpretability".☆39Apr 13, 2026Updated last month
- ☆34Feb 9, 2026Updated 3 months ago
- A clone of Twitter made using React, Firebase☆13May 17, 2021Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- An application that brings together several anime streaming platforms☆12Mar 1, 2025Updated last year
- goat the GOAT LLM CLI☆23Nov 20, 2025Updated 6 months ago
- Deepseek-CoT☆10Oct 6, 2024Updated last year
- a neural network trainer for weebs☆14May 18, 2026Updated last week
- The GIF-to-Chatter app you didn't know you needed!☆15Feb 12, 2022Updated 4 years ago
- Using Reinforcement Learning algorithms to teach the computer to beat Super Mario Bros☆40Aug 6, 2024Updated last year
- The advanced TypeScript Discord Moderation & Utilities bot made for big public server(s). Fully written in TypeScript and discord.js.☆10Aug 3, 2023Updated 2 years ago
- Ember is a hosted API/SDK that lets you shape AI model behavior by directly controlling a model's internal units of computation, or "feat…☆51Jul 14, 2025Updated 10 months ago
- Simple Model Similarities Analysis☆21Feb 3, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)☆31Mar 3, 2025Updated last year
- ☆21Jun 26, 2023Updated 2 years ago
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆21Jun 29, 2024Updated last year
- Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princ…☆42Jul 18, 2025Updated 10 months ago
- ☆17Nov 8, 2023Updated 2 years ago
- A simple and modular imageboard scraper written in python☆15Apr 24, 2023Updated 3 years ago
- Train your Self Driving RC Car or Donkey Car project quickly for free using Google Colab.☆17Jul 6, 2019Updated 6 years ago