Training framework with a goal to explore the frontier of sample efficiency of small language models
☆98Jan 25, 2026Updated last month
Alternatives and similar repositories for sample_efficient_gpt
Users that are interested in sample_efficient_gpt are comparing it to the libraries listed below
Sorting:
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 4 months ago
- ☆18Nov 25, 2022Updated 3 years ago
- Multi-group Gaussian process (MGGP)☆23Jul 24, 2024Updated last year
- ☆46May 20, 2025Updated 9 months ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- ☆37Aug 4, 2025Updated 6 months ago
- Educational WIP☆68Feb 16, 2026Updated 2 weeks ago
- Synthetic Hypertext and Homomorphic Catalogue☆15Dec 28, 2024Updated last year
- Jax Codebase for Evolutionary Strategies at the Hyperscale☆228Updated this week
- patches for huggingface transformers to save memory☆34Jun 2, 2025Updated 9 months ago
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆34Nov 21, 2021Updated 4 years ago
- Source code and dataset for the paper 'Saamayik: A Benchmark and Dataset for English-Sanskrit Translation'☆15Oct 11, 2025Updated 4 months ago
- vTPM with SGX protection☆11May 30, 2019Updated 6 years ago
- Primus-SaFE(Stability and Fault Endurance)☆52Updated this week
- Tree-structured recurrent switching linear dynamical systems☆38Jul 13, 2020Updated 5 years ago
- DPO, but faster 🚀☆48Dec 6, 2024Updated last year
- ☆13Sep 11, 2014Updated 11 years ago
- Source code repository for the AISTAT 2023 paper Transport Reversible Jump Proposals.☆10Mar 3, 2023Updated 3 years ago
- ☆10Oct 2, 2024Updated last year
- ☆10Sep 4, 2025Updated 5 months ago
- ☆11Jul 25, 2023Updated 2 years ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆29Feb 23, 2026Updated last week
- ☆34Sep 22, 2025Updated 5 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆96Oct 30, 2024Updated last year
- nanobody melting temperature prediction using protein embeddings☆11Feb 24, 2025Updated last year
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆44Jun 14, 2021Updated 4 years ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆47May 31, 2024Updated last year
- A simple AI agent controlling a simulation of a smart home☆13Jun 13, 2024Updated last year
- Cinder support for Azure Kinect depth capture device.☆12Nov 20, 2023Updated 2 years ago
- Drishti | An Open mHealth sense-plan-act framework based on FHIR!☆11Oct 7, 2022Updated 3 years ago
- A simple multicohort LTV calculator for subscriptions☆11Mar 7, 2023Updated 2 years ago
- ☆13Nov 28, 2025Updated 3 months ago
- React 0.13 with ES6, Immutable.js and Flux, Isomorphic as well☆11Mar 10, 2015Updated 10 years ago
- ☆10Jun 14, 2024Updated last year
- notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and produc…☆10Dec 25, 2024Updated last year
- Synthetic Data Generation with Execution-Based Verification and Grounding for LLM Training.☆19Feb 7, 2025Updated last year
- Demonstrating technical elements in support of open source securitisation frameworks☆14Sep 5, 2024Updated last year
- JAX implementation of GPTQ quantization algorithm☆10Jul 19, 2023Updated 2 years ago
- JAX/Haiku implementation of "Auction Learning as a Two-Player Game"☆11Jul 6, 2024Updated last year