conceptofmind / PaLM-flaxLinks
Implementation of the SOTA Transformer architecture from PaLM - Scaling Language Modeling with Pathways in JAX/Flax
☆13Updated 3 years ago
Alternatives and similar repositories for PaLM-flax
Users that are interested in PaLM-flax are comparing it to the libraries listed below
Sorting:
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆187Updated 3 years ago
- Submissions for AI and Efficiency SOTA's☆56Updated 5 years ago
- Learned Hyperparameter Optimizers☆59Updated 4 years ago
- Hugging Face and Pyserini interoperability☆20Updated 2 years ago
- A simple way to manage and store the data related to all your research papers!☆18Updated 2 years ago
- One stop shop for all things carp☆59Updated 2 years ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆71Updated 2 weeks ago
- Python Research Framework☆106Updated 2 years ago
- ☆20Updated last year
- How to use the Flax Linen API to build a convolutional neural network model and train it for image classification (using TensorFlow Datas…☆24Updated last year
- ☆76Updated last week
- Simple Autogpt with tree of thoughts☆14Updated 2 years ago
- ☆13Updated 2 years ago
- ☆61Updated 3 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Implementation of a holodeck, written in Pytorch☆18Updated last year
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆16Updated 3 years ago
- Train very large language models in Jax.☆204Updated last year
- 👑 Pytorch code for the Nero optimiser.☆20Updated 2 years ago
- ☆61Updated last week
- ☆34Updated 2 years ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Updated 2 years ago
- Scripts to parse arxiv documents for NLP tasks☆18Updated 2 years ago
- Implementation of Stable Diffusion from scratch [WORK IN PROGRESS]☆22Updated 2 years ago
- ☆14Updated last year
- Fine-grained, dynamic control of neural network topology in JAX.☆21Updated last year
- My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other h…☆53Updated 2 years ago
- RWKV model implementation☆38Updated 2 years ago
- ☆26Updated 2 years ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆45Updated last year