conceptofmind / PaLM-flaxLinks
Implementation of the SOTA Transformer architecture from PaLM - Scaling Language Modeling with Pathways in JAX/Flax
☆14Updated 3 years ago
Alternatives and similar repositories for PaLM-flax
Users that are interested in PaLM-flax are comparing it to the libraries listed below
Sorting:
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆189Updated 3 years ago
- Submissions for AI and Efficiency SOTA's☆56Updated 5 years ago
- Learned Hyperparameter Optimizers☆59Updated 4 years ago
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Updated 3 years ago
- Implementation of a holodeck, written in Pytorch☆18Updated 2 years ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆74Updated 6 months ago
- 👑 Pytorch code for the Nero optimiser.☆20Updated 3 years ago
- ☆40Updated 3 years ago
- Hugging Face and Pyserini interoperability☆19Updated 2 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆43Updated 2 years ago
- Python Research Framework☆107Updated 3 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 3 years ago
- Train very large language models in Jax.☆210Updated 2 years ago
- A GPT, made only of MLPs, in Jax☆58Updated 4 years ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Updated 2 years ago
- One stop shop for all things carp☆59Updated 3 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 4 years ago
- Blazing fast training of 🤗 Transformers on Graphcore IPUs☆86Updated last year
- ☆26Updated 3 years ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆46Updated 2 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated 2 years ago
- RWKV model implementation☆38Updated 2 years ago
- JAX implementations of RWKV☆19Updated 2 years ago
- ☆63Updated 3 years ago
- [NeurIPS 2022] DataMUX: Data Multiplexing for Neural Networks☆60Updated 3 years ago
- ☆26Updated 2 years ago
- ☆31Updated last month
- How to use the Flax Linen API to build a convolutional neural network model and train it for image classification (using TensorFlow Datas…☆24Updated 2 years ago
- Code base for internal reward models and PPO training☆24Updated 2 years ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆47Updated last year