Rallio67 / language-model-agentsLinks

Experiments with generating opensource language model assistants

☆97

Alternatives and similar repositories for language-model-agents

Users that are interested in language-model-agents are comparing it to the libraries listed below

Sorting:

kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆116Updated 2 years ago
google-research-datasets / presto
A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs
☆114Updated 2 years ago
huggingface / olm-datasets
Pipeline for pulling and processing online language model pretraining data from the web
☆178Updated 2 years ago
huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆93Updated 2 years ago
google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆34Updated last year
kernelmachine / cbtm
Code repository for the c-BTM paper
☆107Updated last year
shayne-longpre / a-pretrainers-guide
☆72Updated 2 years ago
EleutherAI / semantic-memorization
☆44Updated 8 months ago
sileod / tasksource
Datasets collection and preprocessings framework for NLP extreme multitask learning
☆185Updated 3 weeks ago
LAION-AI / Anh
Anh - LAION's multilingual assistant datasets and models
☆27Updated 2 years ago
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated last year
huu4ontocord / MDEL
Multi-Domain Expert Learning
☆67Updated last year
CarperAI / cheese
Used for adaptive human in the loop evaluation of language and embedding models.
☆310Updated 2 years ago
CarperAI / squeakily
A library for squeakily cleaning and filtering language datasets.
☆47Updated 2 years ago
LAION-AI / blade2blade
Adversarial Training and SFT for Bot Safety Models
☆40Updated 2 years ago
ConiferLabsWA / flan-ul2-alpaca
☆32Updated 2 years ago
gsarti / t5-flax-gcp
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
☆58Updated 3 years ago
microsoft / mutransformers
some common Huggingface transformers in maximal update parametrization (µP)
☆82Updated 3 years ago
CarperAI / InstructGPT
For experiments involving instruct gpt. Currently used for documenting open research questions.
☆71Updated 2 years ago
zphang / minimal-gpt-neox-20b
☆130Updated 3 years ago
seonghyeonye / Flipped-Learning
[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners
☆116Updated last month
CarperAI / decontamination
This repository contains code for cleaning your training data of benchmark data to help combat data snooping.
☆25Updated 2 years ago
McGill-NLP / length-generalization
Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023
☆136Updated last year
Langboat / mengzi-retrieval-lm
An experimental implementation of the retrieval-enhanced language model
☆75Updated 2 years ago
zphang / minimal-opt
☆67Updated 2 years ago
huggingface / bloom-jax-inference
☆67Updated 3 years ago
bminixhofer / zett
Code for Zero-Shot Tokenizer Transfer
☆133Updated 6 months ago
imoneoi / multipack_sampler
Multipack distributed sampler for fast padding-free training of LLMs
☆199Updated 11 months ago
facebookresearch / NPM
The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)
☆157Updated 2 years ago
radi-cho / RSTOD
Auxiliary tasks for task-oriented dialogue systems. Published in ICNLSP'22 and indexed in the ACL Anthology.
☆17Updated 2 years ago