booydar / recurrent-memory-transformer
[NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.
☆763Updated 6 months ago
Alternatives and similar repositories for recurrent-memory-transformer
Users that are interested in recurrent-memory-transformer are comparing it to the libraries listed below
Sorting:
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,630Updated last year
- Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".☆1,129Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆820Updated 2 years ago
- Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI☆2,032Updated 9 months ago
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"☆1,060Updated last year
- MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.☆930Updated last month
- ☆458Updated last year
- A school for camelids☆1,210Updated 2 years ago
- Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".☆792Updated last year
- ☆1,030Updated last year
- ☆589Updated last year
- Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks☆604Updated 2 years ago
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆810Updated 10 months ago
- ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration ca…☆1,459Updated 11 months ago
- ☆357Updated 2 years ago
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆302Updated last year
- Official repo for MM-REACT☆948Updated last year
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆587Updated last year
- Reflexion: an autonomous agent with dynamic memory and self-reflection☆386Updated last year
- Code for fine-tuning Platypus fam LLMs using LoRA☆629Updated last year
- ☆444Updated 2 years ago
- Salesforce open-source LLMs with 8k sequence length.☆717Updated 3 months ago
- ☆405Updated 2 years ago
- Alpaca dataset from Stanford, cleaned and curated☆1,553Updated 2 years ago
- Tune any FALCON in 4-bit☆467Updated last year
- PaL: Program-Aided Language Models (ICML 2023)☆493Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers☆422Updated last year
- This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Da…☆488Updated last year
- LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…☆1,457Updated last year
- SkyAGI: Emerging human-behavior simulation capability in LLM☆780Updated last year