fast trainer for educational purposes
☆26Apr 24, 2026Updated last week
Alternatives and similar repositories for mini_trainer
Users that are interested in mini_trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep neural models for core NLP tasks☆13Nov 9, 2017Updated 8 years ago
- A pytorch implementation of Deep Functional Map (FMNet).☆15May 6, 2020Updated 5 years ago
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆24Oct 10, 2023Updated 2 years ago
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆29Jul 25, 2023Updated 2 years ago
- H3M-SSMoEs: Hypergraph-based Multimodal Learning with LLM Reasoning and Style-Structured Mixture of Experts☆29Feb 20, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official implementation of the paper "Large Scale Knowledge Washing"☆10Jun 12, 2024Updated last year
- [TPAMI] "Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search"…☆17Jan 4, 2023Updated 3 years ago
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- (NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps☆26Mar 27, 2026Updated last month
- [ICML'24] TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks☆33Sep 20, 2024Updated last year
- Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering☆30Dec 2, 2022Updated 3 years ago
- ☆18Jan 17, 2024Updated 2 years ago
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 2 years ago
- Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning☆27Jul 4, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Dec 9, 2020Updated 5 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- Detect-Then-Explain Framework for Text-to-SQL task☆10Dec 6, 2023Updated 2 years ago
- Blog post☆17Feb 16, 2024Updated 2 years ago
- ☆14Aug 18, 2022Updated 3 years ago
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆22Feb 23, 2026Updated 2 months ago
- An implementation of semi-supervised VAE for morphology reinflection.☆26Jun 3, 2019Updated 6 years ago
- ☆12Dec 13, 2022Updated 3 years ago
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Feb 1, 2023Updated 3 years ago
- Code for the paper "Secure Distributed Training at Scale" (ICML 2022)☆16Feb 4, 2025Updated last year
- a starter-kit for jaynes, the cloud-agnostic launch library☆17Apr 1, 2026Updated last month
- Logic Reinforcement Learning☆21Oct 20, 2025Updated 6 months ago
- ☆13Feb 25, 2025Updated last year
- Russian dialog datasets parsers and crawlers.☆15Sep 6, 2021Updated 4 years ago
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆20Jan 7, 2022Updated 4 years ago
- Boolean Question Answering with multi-task learning and uses large LM embeddings like BERT, RoBERTa☆18Aug 30, 2019Updated 6 years ago
- [ICLR 2024] Dynamic Sparse Training with Structured Sparsity☆22Apr 12, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆54Aug 25, 2023Updated 2 years ago
- [ICLR 2023] This repository contains the official Pytorch implementation for the paper "Transformer-based model for symbolic regression v…☆28Jul 2, 2025Updated 10 months ago
- [NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models☆34Jun 9, 2025Updated 10 months ago
- 华中科技大学人工智能与自动化学院19级模式识别课程代码作业☆21Nov 21, 2021Updated 4 years ago
- Collection of resources on plasticity loss in deep reinforcement learning☆23Nov 12, 2024Updated last year
- A programming framework based on PyTorch for hybrid neural networks with automatic quantization☆20Apr 11, 2024Updated 2 years ago
- ☆25Aug 19, 2024Updated last year