thepowerfuldeez/sample_efficient_gpt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thepowerfuldeez/sample_efficient_gpt)

thepowerfuldeez / sample_efficient_gpt

Training framework with a goal to explore the frontier of sample efficiency of small language models

☆98

Alternatives and similar repositories for sample_efficient_gpt

Users that are interested in sample_efficient_gpt are comparing it to the libraries listed below

Sorting:

HarleyCoops / smolThinker-.5B
View on GitHub
A Qwen .5B reasoning model trained on OpenR1-Math-220k
☆14Oct 11, 2025Updated 4 months ago
StephAO / olfmlm
View on GitHub
☆18Nov 25, 2022Updated 3 years ago
andrewcharlesjones / multi-group-GP
View on GitHub
Multi-group Gaussian process (MGGP)
☆23Jul 24, 2024Updated last year
Anonymous1252022 / fp4-all-the-way
View on GitHub
☆46May 20, 2025Updated 9 months ago
malteos / clp-transfer
View on GitHub
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning
☆30Jan 25, 2023Updated 3 years ago
MaximeRivest / ovllm
View on GitHub
☆37Aug 4, 2025Updated 6 months ago
ChinmayK0607 / heiretsu
View on GitHub
Educational WIP
☆68Feb 16, 2026Updated 2 weeks ago
viemccoy / grimoire
View on GitHub
Synthetic Hypertext and Homomorphic Catalogue
☆15Dec 28, 2024Updated last year
ESHyperscale / HyperscaleES
View on GitHub
Jax Codebase for Evolutionary Strategies at the Hyperscale
☆228Updated this week
GeeeekExplorer / transformers-patch
View on GitHub
patches for huggingface transformers to save memory
☆34Jun 2, 2025Updated 9 months ago
gucci-j / light-transformer-emnlp2021
View on GitHub
EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling
☆34Nov 21, 2021Updated 4 years ago
ayushbits / Saamayik
View on GitHub
Source code and dataset for the paper 'Saamayik: A Benchmark and Dataset for English-Sanskrit Translation'
☆15Oct 11, 2025Updated 4 months ago
SvTPM-impl / SvTPM
View on GitHub
vTPM with SGX protection
☆11May 30, 2019Updated 6 years ago
AMD-AGI / Primus-SaFE
View on GitHub
Primus-SaFE(Stability and Fault Endurance)
☆52Updated this week
catniplab / tree_structured_rslds
View on GitHub
Tree-structured recurrent switching linear dynamical systems
☆38Jul 13, 2020Updated 5 years ago
frankxwang / dpo-prefix-sharing
View on GitHub
DPO, but faster 🚀
☆48Dec 6, 2024Updated last year
unity3d-jp / TEMPRUN
View on GitHub
☆13Sep 11, 2014Updated 11 years ago
daviesl / trjp
View on GitHub
Source code repository for the AISTAT 2023 paper Transport Reversible Jump Proposals.
☆10Mar 3, 2023Updated 3 years ago
ottowg / gsap-ner
View on GitHub
☆10Oct 2, 2024Updated last year
sofa-org / sofa-protocol
View on GitHub
☆10Sep 4, 2025Updated 5 months ago
diegocarrera89 / quantTree
View on GitHub
☆11Jul 25, 2023Updated 2 years ago
LARK-AI-Lab / CodeScaler
View on GitHub
The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"
☆29Feb 23, 2026Updated last week
ComposioHQ / open-poke
View on GitHub
☆34Sep 22, 2025Updated 5 months ago
kaistAI / LangBridge
View on GitHub
[ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision
☆96Oct 30, 2024Updated last year
Jerome-Alvarez / TEMPRO
View on GitHub
nanobody melting temperature prediction using protein embeddings
☆11Feb 24, 2025Updated last year
evgenii-nikishin / omd
View on GitHub
JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"
☆44Jun 14, 2021Updated 4 years ago
alvarobartt / safejax
View on GitHub
Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`
☆47May 31, 2024Updated last year
genmon / lares
View on GitHub
A simple AI agent controlling a simulation of a smart home
☆13Jun 13, 2024Updated last year
richardeakin / Cinder-AzureKinect
View on GitHub
Cinder support for Azure Kinect depth capture device.
☆12Nov 20, 2023Updated 2 years ago
E-Health / drishti
View on GitHub
Drishti | An Open mHealth sense-plan-act framework based on FHIR!
☆11Oct 7, 2022Updated 3 years ago
plexagon / lucius-ltv
View on GitHub
A simple multicohort LTV calculator for subscriptions
☆11Mar 7, 2023Updated 2 years ago
UniversalDependencies / UD_German-HDT
View on GitHub
☆13Nov 28, 2025Updated 3 months ago
hekike / ES6-Immutable-React
View on GitHub
React 0.13 with ES6, Immutable.js and Flux, Isomorphic as well
☆11Mar 10, 2015Updated 10 years ago
code-423n4 / 2024-04-dyad
View on GitHub
☆10Jun 14, 2024Updated last year
Alignment-Lab-AI / ai-notes
View on GitHub
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and produc…
☆10Dec 25, 2024Updated last year
facebookresearch / synth_gen
View on GitHub
Synthetic Data Generation with Execution-Based Verification and Grounding for LLM Training.
☆19Feb 7, 2025Updated last year
open-risk / openSecuritisation
View on GitHub
Demonstrating technical elements in support of open source securitisation frameworks
☆14Sep 5, 2024Updated last year
davisyoshida / jax-gptq
View on GitHub
JAX implementation of GPTQ quantization algorithm
☆10Jul 19, 2023Updated 2 years ago
mechanism-learning-research / two-player-auctions
View on GitHub
JAX/Haiku implementation of "Auction Learning as a Two-Player Game"
☆11Jul 6, 2024Updated last year