fattorib/Little-GPT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fattorib/Little-GPT)

fattorib / Little-GPT

GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!

☆20

Alternatives and similar repositories for Little-GPT

Users that are interested in Little-GPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

blookethaack / all-blooks
View on GitHub
get all blooks
☆23Dec 11, 2023Updated 2 years ago
xjdr-alt / mla_blog_translation
View on GitHub
☆13Jun 18, 2024Updated 2 years ago
horizonsdv1 / Minescraft2-Blooket-Cheats-not-mine-
View on GitHub
Best Blooket Hacks
☆28Feb 10, 2023Updated 3 years ago
okarthikb / DPO
View on GitHub
Implementation of Direct Preference Optimization
☆17Jul 17, 2023Updated 3 years ago
HarlynDN / WebCiteS
View on GitHub
[ACL'24] WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations
☆13Sep 11, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
crowsonkb / dice-mc
View on GitHub
DiCE: The Infinitely Differentiable Monte-Carlo Estimator
☆33Jul 28, 2023Updated 2 years ago
ganprad / rentorbuy
View on GitHub
A Project that uses Zillow research data on Quandl, Prophet for time series forecasting, Altair for vega-lite charts and Folium for an cr…
☆12Dec 8, 2022Updated 3 years ago
SeunghyunSEO / optimized_hf_llama_class_for_training
View on GitHub
☆47Aug 29, 2024Updated last year
facebookresearch / Opacus-lab
View on GitHub
Research and experimental code related to Opacus, an open-source library for training PyTorch models with Differential Privacy
☆18Oct 9, 2024Updated last year
RiccardoBiosas / LeanGPT
View on GitHub
Experiments with interactive theorem provers, LLMs and formal systems
☆23Jul 10, 2023Updated 3 years ago
Smash15195 / Traduction-Francaise-The-Walking-Dead-EP1-PC
View on GitHub
Traduction Française The Walking Dead Episode 1 (Telltales)
☆27Dec 25, 2012Updated 13 years ago
cjlovering / interpretable-reinforcement-learning-using-attention
View on GitHub
[NeurIPS, 2020 - Reproducibility Challenge]: [RE] Towards Interpretable Reinforcement Learning Using Attention Augmented Agents
☆13Apr 26, 2021Updated 5 years ago
dwyl / hits-nodejs
View on GitHub
General purpose hits (page views) counter written in Node.js backed by filesystem. (MVP)
☆14Oct 7, 2022Updated 3 years ago
pacman100 / peft-codegen-25
View on GitHub
☆23Jul 10, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
CrimsonCrips / AlexsMobsInteraction
View on GitHub
☆27Apr 21, 2026Updated 3 months ago
ionathanch / msc-thesis
View on GitHub
LaTeX source for Sized Dependent Types via Extensional Type Theory
☆12Jan 28, 2026Updated 5 months ago
UoA-CARES-Student / TalkingFaceGeneration-with-Emotion
View on GitHub
Talking Face Generation system
☆17Oct 16, 2023Updated 2 years ago
JuliaApproximation / QuasiArrays.jl
View on GitHub
A package for representing quasi-arrays
☆11Jul 7, 2026Updated 2 weeks ago
vaguenebula / AlpacaDataReflect
View on GitHub
An experiment to see if chatgpt can improve the output of the stanford alpaca dataset
☆12Mar 29, 2023Updated 3 years ago
Njaecha / ObjImport
View on GitHub
Mod for Koikatsu/Koikatsu Sunshine that lets you import .obj files into Charastudio.
☆11Sep 28, 2025Updated 9 months ago
balena-io-experimental / cellular-test
View on GitHub
A sample app to debug and validate cellular modems on balena devices
☆13Jun 5, 2019Updated 7 years ago
hekike / ES6-Immutable-React
View on GitHub
React 0.13 with ES6, Immutable.js and Flux, Isomorphic as well
☆11Mar 10, 2015Updated 11 years ago
SungjoonPark / FactorRotation
View on GitHub
Rotated Word Vector Representations and their Interpretability (EMNLP 2017)
☆18Jul 13, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aslanismailgit / HuggingFace-Transformers-Model-Docker-Container
View on GitHub
HuggingFace Transformers Model Docker Container
☆18Aug 16, 2021Updated 4 years ago
ArticuNode / PreppieGit
View on GitHub
A C# GitHub client with a Winforms UI
☆16Jul 21, 2018Updated 8 years ago
chaitanyamalaviya / ExpertQA
View on GitHub
[Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers
☆139Mar 14, 2024Updated 2 years ago
bentherien / mu_learned_optimization
View on GitHub
[Poster; ICLR 2026] [Oral; Neurips OPT2024] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers
☆16Apr 15, 2026Updated 3 months ago
AleMorales / UnitfulDual.jl
View on GitHub
Dual numbers compatible with physical units
☆11Dec 4, 2023Updated 2 years ago
michaelshiyu / kerNET
View on GitHub
Modular learning for deep classifiers
☆23Feb 4, 2022Updated 4 years ago
Nicolas-BZRD / llm-distillation
View on GitHub
☆11Feb 3, 2025Updated last year
PhantomWing / RusticDelight
View on GitHub
☆26Jul 13, 2026Updated last week
coin-au-carre / datetimepp
View on GitHub
C++ port of Python datetime
☆25Apr 9, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
msobroza / compositional_code_learning
View on GitHub
Reproduce the results of paper "Compressing Word Embeddings via Deep Compositional Code Learning" accepted ICLR 2018
☆24May 16, 2018Updated 8 years ago
Alexthw46 / Ars-Elemental
View on GitHub
Add-on to Ars Nouveau, based on elemental stuff
☆27Jul 9, 2026Updated 2 weeks ago
g588928812 / qlora
View on GitHub
QLoRA: Efficient Finetuning of Quantized LLMs
☆11Jul 22, 2023Updated 3 years ago
VaclavSynacek / react-isomorphic-static-site-generator-boilerplate
View on GitHub
Yet another isomorphic react boilerplate. This one does not require node on server.
☆10Apr 4, 2017Updated 9 years ago
britt-allen / classifying_reddit_posts
View on GitHub
Leveraging NLP and supervised learning methods to classify posts scraped via Reddit's API
☆11Feb 5, 2019Updated 7 years ago
multimeric / RustLangRetweet
View on GitHub
Rust bot that runs periodically on AWS Lambda and retweets any Tweets matching a query
☆17Mar 24, 2023Updated 3 years ago
charlesgery / viseagull
View on GitHub
☆15Aug 29, 2021Updated 4 years ago