conceptofmind / PaLM-flax
Implementation of the SOTA Transformer architecture from PaLM - Scaling Language Modeling with Pathways in JAX/Flax
☆13Updated 2 years ago
Alternatives and similar repositories for PaLM-flax:
Users that are interested in PaLM-flax are comparing it to the libraries listed below
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- JAX implementations of RWKV☆19Updated last year
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆186Updated 2 years ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 3 months ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Updated last year
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆20Updated last year
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Updated 2 years ago
- Training hybrid models for dummies.☆20Updated last month
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆48Updated 3 years ago
- Minimum Description Length probing for neural network representations☆18Updated 3 weeks ago
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆14Updated last week
- RWKV model implementation☆37Updated last year
- One stop shop for all things carp☆59Updated 2 years ago
- Latent Large Language Models☆17Updated 5 months ago
- Implementation of Metaformer, but in an autoregressive manner☆23Updated 2 years ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆44Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 3 months ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- ☆26Updated 2 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- ☆44Updated 8 months ago
- GPT-jax based on the official huggingface library☆13Updated 3 years ago
- ☆65Updated 2 years ago
- Python Research Framework☆106Updated 2 years ago
- Efficiently computing & storing token n-grams from large corpora☆18Updated 4 months ago
- ☆58Updated 2 years ago
- This repo contains a set of notebooks to reproduce reinforcement learning algorithms.☆15Updated 2 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆18Updated last year
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆15Updated 3 years ago