lightonai / lairgpt
Inference code in Pytorch for GPT-like models, such as PAGnol, a family of models with up to 1.5B parameters, trained on datasets in French.
☆20Updated last year
Related projects: ⓘ
- The French summarization dataset introduced in "BARThez: a Skilled Pretrained French Sequence-to-Sequence Model".☆22Updated 3 years ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…☆47Updated last year
- Standalone pre-training recipe with JAX+Flax☆31Updated last year
- A framework for implementing equivariant DL☆10Updated 3 years ago
- Python client for the LightOn Muse API☆14Updated 2 years ago
- ☆18Updated 2 years ago
- ☆31Updated 2 years ago
- ☆11Updated 4 years ago
- ✨🌲 Hierarchical extreme multiclass and multi-label classification.☆16Updated last year
- Study on the applicability of Direct Feedback Alignment to neural view synthesis, recommender systems, geometric learning, and natural la…☆84Updated 2 years ago
- ☆42Updated 3 years ago
- A biologically inspired method to create sparse, binary word vectors☆36Updated 2 years ago
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆70Updated 3 years ago
- Code for scaling Transformers☆26Updated 3 years ago
- Generate BERT vocabularies and pretraining examples from Wikipedias☆18Updated 4 years ago
- Code to perform Model-Free Episodic Control using Aurora OPUs☆17Updated 4 years ago
- This repo contains code to reproduce some of the results presented in the paper "SentenceMIM: A Latent Variable Language Model"☆28Updated 2 years ago
- Implementation of N-Grammer in Flax☆16Updated last year
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆50Updated 2 years ago
- A list of resources dedicated to compositionality☆14Updated 5 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Updated last year
- Conformational exploration SARS-CoV-2 (coronavirus responsible for COVID-19)☆16Updated 2 years ago
- Framework-agnostic library for checking array/tensor shapes at runtime.☆47Updated 3 years ago
- ☆108Updated last year
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆146Updated 3 years ago
- A JAX implementation of stochastic addition.☆12Updated 2 years ago
- A minimal implementation of a VAE with BinConcrete (relaxed Bernoulli) latent distribution in TensorFlow.☆21Updated 4 years ago
- Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling☆30Updated 3 years ago
- Python implementation of supervised PCA, supervised random projections, and their kernel counterparts.☆20Updated 4 years ago