conceptofmind / PaLM-flaxLinks
Implementation of the SOTA Transformer architecture from PaLM - Scaling Language Modeling with Pathways in JAX/Flax
☆14Updated 3 years ago
Alternatives and similar repositories for PaLM-flax
Users that are interested in PaLM-flax are comparing it to the libraries listed below
Sorting:
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆188Updated 3 years ago
- Submissions for AI and Efficiency SOTA's☆57Updated 5 years ago
- 👑 Pytorch code for the Nero optimiser.☆20Updated 3 years ago
- ☆26Updated 3 years ago
- Learned Hyperparameter Optimizers☆60Updated 4 years ago
- 3rd party dependencies for DALI project☆10Updated last week
- ☆62Updated 3 years ago
- One stop shop for all things carp☆59Updated 3 years ago
- Python Research Framework☆106Updated 3 years ago
- How to use the Flax Linen API to build a convolutional neural network model and train it for image classification (using TensorFlow Datas…☆24Updated 2 years ago
- Implementation of a holodeck, written in Pytorch☆18Updated 2 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆17Updated 3 years ago
- Hugging Face and Pyserini interoperability☆19Updated 2 years ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆74Updated 4 months ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆45Updated 2 years ago
- ☆20Updated 2 years ago
- My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other h…☆54Updated 2 years ago
- Simple implementation of a GPT (training and inference) in PyTorch.☆13Updated last year
- RWKV model implementation☆38Updated 2 years ago
- Train very large language models in Jax.☆209Updated 2 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research☆15Updated last year
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆47Updated last year
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- Fine-grained, dynamic control of neural network topology in JAX.☆21Updated 2 years ago
- First-order logic theorem prover supporting unification with approximate vector similarity☆13Updated 2 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆42Updated 2 years ago
- ☆14Updated 2 years ago
- JAX implementations of RWKV☆19Updated 2 years ago