huggingface / transformers_bloom_parallelLinks

Techniques used to run BLOOM at inference in parallel

☆37

Alternatives and similar repositories for transformers_bloom_parallel

Users that are interested in transformers_bloom_parallel are comparing it to the libraries listed below

Sorting:

huggingface / olm-datasets
Pipeline for pulling and processing online language model pretraining data from the web
☆178Updated 2 years ago
huggingface / bloom-jax-inference
☆66Updated 3 years ago
huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆96Updated 2 years ago
Rallio67 / language-model-agents
Experiments with generating opensource language model assistants
☆97Updated 2 years ago
shayne-longpre / a-pretrainers-guide
☆72Updated 2 years ago
leogao2 / lm_dataformat
☆78Updated 2 years ago
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆116Updated 2 years ago
Langboat / mengzi-retrieval-lm
An experimental implementation of the retrieval-enhanced language model
☆75Updated 2 years ago
LAION-AI / Open-Instruction-Generalist
Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks
☆209Updated last year
zsc / llama_infer
Inference script for Meta's LLaMA models using Hugging Face wrapper
☆110Updated 2 years ago
EleutherAI / stackexchange-dataset
Python tools for processing the stackexchange data dumps into a text dataset for Language Models
☆85Updated 2 years ago
sgugger / torchdynamo-tests
☆19Updated 3 years ago
kernelmachine / cbtm
Code repository for the c-BTM paper
☆108Updated 2 years ago
bigcode-project / bigcode-analysis
Repository for analysis and experiments in the BigCode project.
☆127Updated last year
allenai / bff
☆38Updated last year
bigscience-workshop / lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
☆105Updated 2 years ago
SeanNaren / min-LLM
Minimal code to train a Large Language Model (LLM).
☆172Updated 3 years ago
EleutherAI / openwebtext2
☆92Updated 3 years ago
LAION-AI / Anh
Anh - LAION's multilingual assistant datasets and models
☆27Updated 2 years ago
zphang / minimal-opt
☆67Updated 3 years ago
google-research / t5x_retrieval
☆101Updated 2 years ago
google-research / longt5
☆184Updated 2 years ago
seonghyeonye / TAPP
[AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
☆78Updated last year
Dahoas / reward-modeling
☆98Updated 2 years ago
sileod / tasksource
Datasets collection and preprocessings framework for NLP extreme multitask learning
☆189Updated 5 months ago
EleutherAI / semantic-memorization
☆44Updated last year
seonghyeonye / Flipped-Learning
[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners
☆116Updated 5 months ago
google-research-datasets / presto
A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs
☆115Updated 2 years ago
bigscience-workshop / data_tooling
Tools for managing datasets for governance and training.
☆87Updated 2 weeks ago
jaymody / speculative-sampling
Simple implementation of Speculative Sampling in NumPy for GPT-2.
☆98Updated 2 years ago