hpcaitech / TitansLinks

A collection of models built with ColossalAI

☆32

Alternatives and similar repositories for Titans

Users that are interested in Titans are comparing it to the libraries listed below

Sorting:

hpcaitech / PaLM-colossalai
Scalable PaLM implementation of PyTorch
☆190Updated 2 years ago
hpcaitech / CachedEmbedding
A memory efficient DLRM training solution using ColossalAI
☆105Updated 2 years ago
young-geng / koala_data_pipeline
The data processing pipeline for the Koala chatbot language model
☆117Updated 2 years ago
FreedomIntelligence / FastLLM
Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];
☆40Updated last year
bigcode-project / bigcode-analysis
Repository for analysis and experiments in the BigCode project.
☆120Updated last year
EleutherAI / stackexchange-dataset
Python tools for processing the stackexchange data dumps into a text dataset for Language Models
☆83Updated last year
xlang-ai / batch-prompting
[EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.
☆74Updated last year
AI21Labs / Parallel-Context-Windows
☆104Updated 2 years ago
THUDM / Multilingual-GLM
The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective
☆62Updated 2 years ago
LLM360 / Analysis360
Open Implementations of LLM Analyses
☆105Updated 9 months ago
dropreg / efficient_alpaca
The aim of this repository is to utilize LLaMA to reproduce and enhance the Stanford Alpaca
☆97Updated 2 years ago
thunlp / Ouroboros
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)
☆107Updated 3 months ago
mrcabbage972 / simple-toolformer
A Python implementation of Toolformer using Huggingface Transformers
☆14Updated 2 years ago
liutiedong / goat
a Fine-tuned LLaMA that is Good at Arithmetic Tasks
☆178Updated last year
THUDM / ChatGLM-Math
☆82Updated last year
jina-ai / textbook
distill chatGPT coding ability into small model (1b)
☆30Updated last year
Glaciohound / LM-Infinite
Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆148Updated 4 months ago
THU-KEG / ChatLog
⏳ ChatLog: Recording and Analysing ChatGPT Across Time
☆100Updated last year
umass-ml4ed / mathGPT
A GPT-based generative LM for combined text and math formulas, leveraging tree-based formula encoding.
☆40Updated 2 years ago
onesuper / HuggingFace-Datasets-Text-Quality-Analysis
Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in datase…
☆53Updated 2 years ago
hpcaitech / ColossalAI-Examples
Examples of training models with hybrid parallelism using ColossalAI
☆340Updated 2 years ago
lxe / llama-tune
LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers
☆51Updated 2 years ago
thomfoster / minRLHF
A (somewhat) minimal library for finetuning language models with PPO on human feedback.
☆85Updated 2 years ago
Lightning-Universe / lightning-ColossalAI
Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI
☆57Updated last year
MayDomine / Burst-Attention
Distributed IO-aware Attention algorithm
☆20Updated 10 months ago
Agora-Lab-AI / Orca
An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"
☆43Updated 9 months ago
kemingy / vllm-env
setup the env for vllm users
☆16Updated last year
huggingface / transformers_bloom_parallel
Techniques used to run BLOOM at inference in parallel
☆37Updated 2 years ago
CodeGeeX / codegeex-fastertransformer
fastertransformer for codegeex model
☆63Updated 2 years ago
LAION-AI / blade2blade
Adversarial Training and SFT for Bot Safety Models
☆40Updated 2 years ago