conceptofmind / PaLM-flaxLinks
Implementation of the SOTA Transformer architecture from PaLM - Scaling Language Modeling with Pathways in JAX/Flax
☆14Updated 3 years ago
Alternatives and similar repositories for PaLM-flax
Users that are interested in PaLM-flax are comparing it to the libraries listed below
Sorting:
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆188Updated 3 years ago
- Submissions for AI and Efficiency SOTA's☆57Updated 5 years ago
- One stop shop for all things carp☆59Updated 3 years ago
- ☆40Updated 2 years ago
- A GPT, made only of MLPs, in Jax☆58Updated 4 years ago
- ☆26Updated 3 years ago
- Learned Hyperparameter Optimizers☆59Updated 4 years ago
- RWKV model implementation☆38Updated 2 years ago
- Python Research Framework☆106Updated 2 years ago
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Updated 3 years ago
- 3rd party dependencies for DALI project☆10Updated this week
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆31Updated 2 years ago
- Implementation of a holodeck, written in Pytorch☆18Updated last year
- 👑 Pytorch code for the Nero optimiser.☆20Updated 3 years ago
- ☆15Updated 3 years ago
- Causal Analysis of Agent Behavior for AI Safety☆18Updated 2 years ago
- ☆21Updated 2 years ago
- A benchmark of programming tasks for LLMs that supports almost any programming language.☆13Updated 3 months ago
- Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research☆15Updated last year
- Fine-grained, dynamic control of neural network topology in JAX.☆21Updated 2 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆16Updated 4 years ago
- [NeurIPS 2022] DataMUX: Data Multiplexing for Neural Networks☆60Updated 2 years ago
- Alphazero on GPU thanks to CUDA.jl☆32Updated 4 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- ☆52Updated last year
- ☆70Updated last year
- ChatGPT Participates in a Computer Science Exam (2023)☆31Updated 2 years ago
- I clearly unravel how I came to invent the supermanifold hypothesis in deep learning, (a part of a system called 'thought curvature') in …☆20Updated 2 years ago
- Hugging Face and Pyserini interoperability☆19Updated 2 years ago