LiibanMo / scikit-jax
Your favourite classical machine learning algos on the GPU/TPU
☆20Updated 2 months ago
Alternatives and similar repositories for scikit-jax:
Users that are interested in scikit-jax are comparing it to the libraries listed below
- ☆22Updated 6 months ago
- Source code for the paper "Positional Attention: Out-of-Distribution Generalization and Expressivity for Neural Algorithmic Reasoning"☆14Updated 2 months ago
- Graph neural networks in JAX.☆67Updated 9 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 10 months ago
- JAX/Flax implementation of the Hyena Hierarchy☆34Updated last year
- ☆47Updated 4 months ago
- ☆27Updated 8 months ago
- Simple Scalable Discrete Diffusion for text in PyTorch☆33Updated 6 months ago
- An implementation of ESM2 in Equinox+JAX☆25Updated last month
- Generative cellular automaton-like learning environments for RL.☆19Updated 2 months ago
- Jax like function transformation engine but micro, microjax☆30Updated 5 months ago
- Agent framework for constructing language model agents and training on constructive tasks.☆68Updated last week
- Code for minimum-entropy coupling.☆31Updated 9 months ago
- My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other h…☆52Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆76Updated 3 weeks ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆49Updated 8 months ago
- Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster☆63Updated last month
- ☆38Updated 8 months ago
- ☆11Updated last month
- ☆79Updated 11 months ago
- Code associated to papers on superposition (in ML interpretability)☆28Updated 2 years ago
- An annotated implementation of the Hyena Hierarchy paper☆32Updated last year
- Official implementation of "BERTs are Generative In-Context Learners"☆26Updated 2 weeks ago
- gzip Predicts Data-dependent Scaling Laws☆34Updated 10 months ago
- ☆28Updated last month
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆23Updated 2 months ago
- ☆42Updated last week
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆83Updated last year
- Because we don't want a jupyter notebook mess...☆62Updated 2 weeks ago
- Fine-grained, dynamic control of neural network topology in JAX.☆21Updated last year