CarperAI / OpenELMLinks
Evolution Through Large Models
β732Updated last year
Alternatives and similar repositories for OpenELM
Users that are interested in OpenELM are comparing it to the libraries listed below
Sorting:
- Code for Parsel π - generate complex programs with language modelsβ432Updated 2 years ago
- β546Updated last year
- β415Updated last year
- β865Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.β308Updated 2 years ago
- β1,048Updated last year
- Ask Me Anything language model promptingβ546Updated 2 years ago
- Minimal library to train LLMs on TPU in JAX with pjit().β298Updated last year
- A repository for research on medium sized language models.β514Updated 4 months ago
- Inference code for Persimmon-8Bβ413Updated 2 years ago
- Reflexion: an autonomous agent with dynamic memory and self-reflectionβ388Updated last year
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathwaysβ823Updated 2 years ago
- Dromedary: towards helpful, ethical and reliable LLMs.β1,142Updated last month
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diveβ¦β965Updated 11 months ago
- Ongoing research training transformer models at scaleβ392Updated last year
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"β1,062Updated last year
- PaL: Program-Aided Language Models (ICML 2023)β511Updated 2 years ago
- [NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.β770Updated 11 months ago
- An open-source implementation of Google's PaLM modelsβ820Updated last year
- β304Updated last year
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.β825Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agentsβ555Updated last year
- Salesforce open-source LLMs with 8k sequence length.β722Updated 8 months ago
- Language Modeling with the H3 State Space Modelβ518Updated 2 years ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retrainingβ722Updated last year
- [NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinkingβ269Updated last year
- A method to fix GPT-3 after deployment with user feedback, without re-training.β330Updated 2 years ago
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neurβ¦β548Updated 8 months ago
- β666Updated 11 months ago
- ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration caβ¦β1,505Updated 2 months ago