NVlabs / Forecasting-Model-Search
A system for automating selection and optimization of pre-trained models from the TAO Model Zoo
☆24Updated 9 months ago
Alternatives and similar repositories for Forecasting-Model-Search:
Users that are interested in Forecasting-Model-Search are comparing it to the libraries listed below
- ☆52Updated 6 months ago
- Triton Implementation of HyperAttention Algorithm☆47Updated last year
- ☆59Updated 5 months ago
- ☆31Updated last year
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆18Updated 5 months ago
- Code for the paper "Function-Space Learning Rates"☆18Updated this week
- ☆31Updated 11 months ago
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆28Updated 4 years ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆18Updated last month
- Source code for the paper "Positional Attention: Out-of-Distribution Generalization and Expressivity for Neural Algorithmic Reasoning"☆14Updated 2 months ago
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆36Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆17Updated last month
- Parallel Associative Scan for Language Models☆18Updated last year
- ☆79Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆83Updated last year
- Scalable and Stable Parallelization of Nonlinear RNNS☆14Updated 2 months ago
- Jax like function transformation engine but micro, microjax☆30Updated 5 months ago
- nanoGPT-like codebase for LLM training☆93Updated 2 weeks ago
- ☆30Updated 4 months ago
- ☆27Updated 9 months ago
- Your favourite classical machine learning algos on the GPU/TPU☆20Updated 3 months ago
- An implementation of the Llama architecture, to instruct and delight☆21Updated 3 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆71Updated 5 months ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆30Updated last year
- A State-Space Model with Rational Transfer Function Representation.☆78Updated 11 months ago
- Official repository for the paper "Zero-Shot AutoML with Pretrained Models"☆45Updated last year
- ☆32Updated 6 months ago
- JAX Scalify: end-to-end scaled arithmetics☆16Updated 5 months ago
- ☆39Updated last year
- ☆53Updated last year