BobMcDear / flaim
Flax Image Models - State-of-the-art pre-trained vision backbones for Flax.
☆19Updated last year
Alternatives and similar repositories for flaim
Users that are interested in flaim are comparing it to the libraries listed below
Sorting:
- Training hybrid models for dummies.☆21Updated 4 months ago
- Implementation of Spectral State Space Models☆16Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆13Updated 9 months ago
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 3 years ago
- Personal solutions to the Triton Puzzles☆18Updated 9 months ago
- ☆18Updated last year
- Minimum Description Length probing for neural network representations☆19Updated 3 months ago
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆28Updated 4 years ago
- Code for the note "NF4 Isn't Information Theoretically Optimal (and that's Good)☆18Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 11 months ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated last year
- RWKV model implementation☆37Updated last year
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- Easily serialize dataclasses to and from tensors (PyTorch, NumPy)☆18Updated 4 years ago
- ☆21Updated 5 months ago
- Because it's there.☆16Updated 7 months ago
- Analogous Safe-state Exploration (ASE) is an algorithm for provably safe and optimal exploration in MDPs with unknown, stochastic dynamic…☆11Updated 4 years ago
- ☆27Updated last year
- ☆20Updated 3 years ago
- ☆13Updated 8 months ago
- GoldFinch and other hybrid transformer components☆10Updated this week
- 🧮 Algebraic Positional Encodings.☆13Updated 4 months ago
- Describe the format of image/text datasets☆11Updated 3 years ago
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆22Updated 7 months ago
- ☆16Updated last year
- Implementation of N-Grammer in Flax☆17Updated 2 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆15Updated 3 years ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 6 months ago
- [Oral; Neurips OPT2024 ] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers☆12Updated last month
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago