google-research / optformerLinks
☆226Updated this week
Alternatives and similar repositories for optformer
Users that are interested in optformer are comparing it to the libraries listed below
Sorting:
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆122Updated this week
- ☆63Updated 4 months ago
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆172Updated 2 years ago
- Evaluation of neuro-symbolic engines☆39Updated last year
- Tabular In-Context Learning☆84Updated 5 months ago
- ☆35Updated 8 months ago
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆100Updated 7 months ago
- Our maintained PFN repository. Come here to train SOTA PFNs.☆97Updated this week
- Gradient Boosting Reinforcement Learning (GBRL)☆118Updated 2 weeks ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆105Updated this week
- A MAD laboratory to improve AI architecture designs 🧪☆123Updated 7 months ago
- nanoGPT-like codebase for LLM training☆102Updated 2 months ago
- Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their…☆153Updated 11 months ago
- Interpret text data using LLMs (scikit-learn compatible).☆169Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆149Updated last month
- Repository for code used in the xVal paper☆140Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆93Updated 4 months ago
- Learning Universal Predictors☆78Updated last year
- Implementation of SOAR☆38Updated last week
- Discovering Data-driven Hypotheses in the Wild☆104Updated 2 months ago
- Extending Conformal Prediction to LLMs☆67Updated last year
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆290Updated 11 months ago
- ☆174Updated 4 months ago
- Getting crystal-like representations with harmonic loss☆193Updated 4 months ago
- Cost aware hyperparameter tuning algorithm☆166Updated last year
- ☆81Updated last year
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆78Updated 3 years ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆101Updated 7 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆176Updated 5 months ago
- Pre-trained Gaussian processes for Bayesian optimization☆94Updated 3 months ago