EleutherAGI / summarisationLinks

The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as this has been done by OpenAI and provides a good benchmark to compare against. We will use this intermediate step as a way to lay the groundwork needed for on the fly learning using implicit models.

☆12

Alternatives and similar repositories for summarisation

Users that are interested in summarisation are comparing it to the libraries listed below

Sorting:

lucidrains / ESBN-pytorch
Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch
☆25Updated 4 years ago
lucidrains / esbn-transformer
An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols
☆16Updated 4 years ago
antofuller / configaformers
A python library for highly configurable transformers - easing model architecture search and experimentation.
☆49Updated 3 years ago
lucidrains / token-shift-gpt
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆50Updated 3 years ago
lucidrains / mlp-gpt-jax
A GPT, made only of MLPs, in Jax
☆58Updated 4 years ago
EleutherAI / magiCARP
One stop shop for all things carp
☆59Updated 2 years ago
rajammanabrolu / Q-BERT
Agents that build knowledge graphs and explore textual worlds by asking questions
☆79Updated last year
EleutherAI / rnngineering
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆32Updated last year
EleutherAI / exploring-contrastive-topology
☆15Updated 3 years ago
shoaibahmed / metadata_archaeology
Official code for the paper: "Metadata Archaeology"
☆19Updated 2 years ago
Sea-Snell / CALM-Dialogue
Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"
☆34Updated 2 years ago
EleutherAI / equivariance
A framework for implementing equivariant DL
☆10Updated 4 years ago
RobertCsordas / transformer_generalization
The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…
☆67Updated 2 years ago
PAL-ML / PEARL_v1
☆30Updated 3 years ago
ekinakyurek / google-research
Google Research
☆46Updated 2 years ago
frankaging / Reason-SCAN
ReaSCAN is a synthetic navigation task that requires models to reason about surroundings over syntactically difficult languages. (NeurIPS…
☆20Updated 3 years ago
allenai / dream
☆24Updated 11 months ago
jenni-ai / T2FW
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆19Updated 2 years ago
AranKomat / Metroplex
☆21Updated 2 years ago
allenai / interscript
The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.
☆11Updated 3 years ago
ColinQiyangLi / AdaCat
AdaCat
☆49Updated 3 years ago
NohTow / PPL-MCTS
Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22
☆66Updated 2 years ago
crowsonkb / dice-mc
DiCE: The Infinitely Differentiable Monte-Carlo Estimator
☆31Updated 2 years ago
HazyResearch / ludwig-benchmarking-toolkit
Ludwig benchmark
☆20Updated 3 years ago
peterbhase / SLAG-Belief-Updating
Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"
☆28Updated 3 years ago
ethancaballero / broken_neural_scaling_laws
Code Release for "Broken Neural Scaling Laws" (BNSL) paper
☆59Updated last year
google-deepmind / affordances_option_models
☆23Updated 3 years ago
JeremyAlain / imitation_learning_from_language_feedback
This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"
☆27Updated 2 years ago
UKPLab / on-emergence
Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning
☆33Updated 6 months ago
aks2203 / easy-to-hard
Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"
☆59Updated 3 years ago