conceptofmind / PaLM-flax
Implementation of the SOTA Transformer architecture from PaLM - Scaling Language Modeling with Pathways in JAX/Flax
☆13Updated 2 years ago
Alternatives and similar repositories for PaLM-flax
Users that are interested in PaLM-flax are comparing it to the libraries listed below
Sorting:
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆188Updated 2 years ago
- ☆39Updated 2 years ago
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Updated 3 years ago
- Hugging Face and Pyserini interoperability☆20Updated last year
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆33Updated 2 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Latent Large Language Models☆18Updated 8 months ago
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- RWKV model implementation☆37Updated last year
- Hugging Face's Zapier Integration 🤗⚡️☆48Updated 2 years ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 6 months ago
- Rust bindings for CTranslate2☆14Updated last year
- ☆15Updated 2 years ago
- Training hybrid models for dummies.☆21Updated 4 months ago
- ☆59Updated 3 years ago
- A GPT, made only of MLPs, in Jax☆58Updated 3 years ago
- GoldFinch and other hybrid transformer components☆10Updated this week
- Utilities for Training Very Large Models☆58Updated 7 months ago
- A JAX nn library☆21Updated 2 months ago
- My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other h…☆53Updated last year
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆50Updated 3 years ago
- ☆18Updated last year
- One stop shop for all things carp☆59Updated 2 years ago
- Deploy your HPC Cluster on AWS in 20min. with just 1-Click.☆55Updated last month
- ☆26Updated 2 years ago
- The first AI artist☆32Updated 2 years ago
- Python Research Framework☆106Updated 2 years ago
- GPT-jax based on the official huggingface library☆13Updated 3 years ago
- Implementation of Metaformer, but in an autoregressive manner☆24Updated 2 years ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆44Updated last year