Training framework with a goal to explore the frontier of sample efficiency of small language models
☆100Jan 25, 2026Updated 4 months ago
Alternatives and similar repositories for sample_efficient_gpt
Users that are interested in sample_efficient_gpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- decontamination☆33Mar 4, 2026Updated 2 months ago
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- Code for paper Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks☆13Aug 9, 2022Updated 3 years ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- nanobody melting temperature prediction using protein embeddings☆12Feb 24, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NeurIPS 2023] and [ICLR 2024] for robustness certification.☆10Nov 30, 2024Updated last year
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆34Nov 21, 2021Updated 4 years ago
- ☆18Apr 9, 2025Updated last year
- Code for Spectral Norm of Convolutional Layers with Circular and Zero Paddings and Efficient Bound of Lipschitz Constant for Convolutiona…☆15Feb 2, 2024Updated 2 years ago
- ☆13Jul 2, 2024Updated last year
- notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and produc…☆10Dec 25, 2024Updated last year
- This is my GitHub main page that highlights a list of projects, experience and certifications in the field of data science, machine learn…☆10Apr 6, 2026Updated last month
- ☆49May 20, 2025Updated last year
- Spectral Sphere Optimizer☆118Mar 23, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Ludic – an LLM-RL library for the era of experience☆63Jan 9, 2026Updated 4 months ago
- Code for the ICML 2021 and ICLR 2022 papers: Skew Orthogonal Convolutions, Improved deterministic l2 robustness on CIFAR-10 and CIFAR-100☆18Feb 20, 2022Updated 4 years ago
- Synthetic Hypertext and Homomorphic Catalogue☆15Dec 28, 2024Updated last year
- ☆13May 30, 2024Updated last year
- Recursive Self-Aggregation evals on ARC-AGI☆36Jan 26, 2026Updated 4 months ago
- [ 👾 ] ➡️ 💾 ➡️ { 🎮🕹️ } Extra Stable-Baselines3 buffer classes. Reducing RL memory usage drastically with minimal overhead.☆23Dec 9, 2025Updated 5 months ago
- Codebase, data and models for hallucination of pruned models☆16Jan 11, 2025Updated last year
- prediction market indexer with semantic search☆37Jan 27, 2026Updated 3 months ago
- ☆40Mar 26, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A simple AI agent controlling a simulation of a smart home☆13Jun 13, 2024Updated last year
- Generic build server☆65May 25, 2014Updated 12 years ago
- ☆138Mar 20, 2025Updated last year
- vTPM with SGX protection☆11May 30, 2019Updated 6 years ago
- German Alpaca Dataset (Cleaned + Translated)☆26Apr 6, 2023Updated 3 years ago
- Accompanying source code for Machine Learning with TensorFlow. Refer to the book for step-by-step explanations.☆15Dec 16, 2016Updated 9 years ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- minGPT in JAX☆49Jan 10, 2022Updated 4 years ago
- text classification using ELMO☆16Dec 8, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆33Mar 26, 2026Updated 2 months ago
- A collection of reusable, high-performance, well-documented, thorough-tested layers and models in Jax☆24Jun 8, 2025Updated 11 months ago
- Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible…☆108May 6, 2026Updated 2 weeks ago
- ☆21Dec 9, 2025Updated 5 months ago
- Educational WIP☆70Feb 16, 2026Updated 3 months ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆35Mar 8, 2025Updated last year
- ☆27Apr 14, 2025Updated last year