yiyixuxu / n-grammer-flax
Implementation of N-Grammer in Flax
☆17Updated 2 years ago
Alternatives and similar repositories for n-grammer-flax:
Users that are interested in n-grammer-flax are comparing it to the libraries listed below
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- ☆31Updated this week
- A library for squeakily cleaning and filtering language datasets.☆46Updated last year
- Minimum Description Length probing for neural network representations☆19Updated 2 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 6 months ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated last year
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 3 years ago
- machine learning model performance metrics & charts with confidence intervals, optimized with numba to be fast☆16Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆26Updated 11 months ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆49Updated last year
- This repository contains example code to build models on TPUs☆30Updated 2 years ago
- Automatically take good care of your preemptible TPUs☆36Updated last year
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- My explorations into editing the knowledge and memories of an attention network☆34Updated 2 years ago
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.☆11Updated 4 years ago
- Using short models to classify long texts☆21Updated 2 years ago
- Contains Colab Notebooks show cool use-cases of different GCP ML APIs.☆10Updated 4 years ago
- ☆11Updated 3 months ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 4 months ago
- A sample pattern for running CI tests on Modal☆16Updated 6 months ago
- Load any clip model with a standardized interface☆21Updated 11 months ago
- PyTorch implementation of GLOM☆21Updated 3 years ago
- Standalone pre-training recipe with JAX+Flax☆31Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 10 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated 2 weeks ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆32Updated 10 months ago
- AdamW optimizer for bfloat16 models in pytorch 🔥.☆32Updated 9 months ago
- Code for scaling Transformers☆26Updated 4 years ago
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated last year
- Jax like function transformation engine but micro, microjax☆30Updated 5 months ago