LowinLi / transformers-stream-generatorLinks

This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/Transformers.

☆95

Alternatives and similar repositories for transformers-stream-generator

Users that are interested in transformers-stream-generator are comparing it to the libraries listed below

Sorting:

THUDM / icetk
A unified tokenization tool for Images, Chinese and English.
☆151Updated 2 years ago
HuangLK / transpeeder
train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism
☆224Updated last year
OpenLMLab / scaling-rope
code for Scaling Laws of RoPE-based Extrapolation
☆73Updated last year
alanshi / charset_mnbvc
本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作
☆61Updated 9 months ago
ProjectD-AI / LLaMA-Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆69Updated 2 years ago
llm-factory / imitater
Imitate OpenAI with Local Models
☆88Updated 11 months ago
zsc / llama_infer
Inference script for Meta's LLaMA models using Hugging Face wrapper
☆110Updated 2 years ago
chu-tianxiang / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆131Updated last year
seanzhang-zhichen / baichuan-Dynamic-NTK-ALiBi
百川Dynamic NTK-ALiBi的代码实现：无需微调即可推理更长文本
☆47Updated last year
keezen / ntk_alibi
NTK scaled version of ALiBi position encoding in Transformer.
☆69Updated last year
bigscience-workshop / data-preparation
Code used for sourcing and cleaning the BigScience ROOTS corpus
☆313Updated 2 years ago
liutiedong / goat
a Fine-tuned LLaMA that is Good at Arithmetic Tasks
☆178Updated last year
QwenLM / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆137Updated 8 months ago
FlagOpen / FlagInstruct
☆172Updated 2 years ago
K024 / chatglm-q
Another ChatGLM2 implementation for GPTQ quantization
☆54Updated last year
SunLemuria / OpenGPTAndBeyond
Open efforts to implement ChatGPT-like models and beyond.
☆108Updated last year
Oneflow-Inc / one-glm
A more efficient GLM implementation!
☆55Updated 2 years ago
yangjianxin1 / LongQLoRA
LongQLoRA: Extent Context Length of LLMs Efficiently
☆166Updated last year
DachengLi1 / LongChat
Official repository for LongChat and LongEval
☆524Updated last year
shjwudp / c4-dataset-script
Inspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese…
☆129Updated 2 years ago
OpenLMLab / ChatZoo
Light local website for displaying performances from different chat models.
☆87Updated last year
IEIT-Yuan / Yuan2.0-M32
Mixture-of-Experts (MoE) Language Model
☆189Updated 11 months ago
Guanaco-Model / Guanaco-Model.github.io
☆124Updated last year
beichao1314 / Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆66Updated 2 years ago
jiahe7ay / infini-mini-transformer
This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…
☆58Updated last year
OpenBMB / DecT
Source code for ACL 2023 paper Decoder Tuning: Efﬁcient Language Understanding as Decoding
☆51Updated 2 years ago
RWKV-Wiki / MultilingualShareGPT
MultilingualShareGPT, the free multi-language corpus for LLM training
☆72Updated 2 years ago
LC1332 / Luotuo-Silk-Road
Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…
☆40Updated last year
hpcaitech / PaLM-colossalai
Scalable PaLM implementation of PyTorch
☆190Updated 2 years ago
THUDM / Multilingual-GLM
The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective
☆62Updated 2 years ago