hpcaitech / GPT-Demo

GPT Demo with hybrid distributed training

☆10

Alternatives and similar repositories for GPT-Demo

Users that are interested in GPT-Demo are comparing it to the libraries listed below

Sorting:

kyegomez / Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆30Updated 3 weeks ago
Agora-Lab-AI / Orca
An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"
☆43Updated 7 months ago
apple / ml-toad
☆14Updated 8 months ago
aiwaves-cn / Dive-into-LLMs
The official github repo for the open online courses: "Dive into LLMs".
☆10Updated last year
choosewhatulike / case2code
☆15Updated last month
kyegomez / EAOT
The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"
☆20Updated last year
sunyt32 / torchscale
Transformers at any scale
☆41Updated last year
xhan77 / in-context-alignment
In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning
☆34Updated last year
psunlpgroup / ReaLMistake
This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".
☆29Updated 9 months ago
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆29Updated last week
allenai / sso
Repository for Skill Set Optimization
☆12Updated 9 months ago
Tencent / Tencent-Hunyuan-7B
☆18Updated 3 months ago
thunlp / APB
☆27Updated 2 months ago
scottlogic-alex / prm800k-denorm
Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format
☆27Updated last year
mrcabbage972 / simple-toolformer
A Python implementation of Toolformer using Huggingface Transformers
☆14Updated 2 years ago
cxa-unique / Simplified-TinyBERT
ECIR'21: Simplified TinyBERT: Knowledge Distillation for Document Retrieval
☆17Updated 4 years ago
kyegomez / MM1
PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"
☆23Updated 3 weeks ago
camenduru / UniControl-colab
☆13Updated last year
microsoft / ARXGEN
Scripts to parse arxiv documents for NLP tasks
☆18Updated last year
modelscope / mcp-central
Collection of model-centric MCP servers
☆14Updated last week
NL2Code / CodeM
☆44Updated 11 months ago
bigai-nlco / CREAM
[NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding
☆17Updated 7 months ago
facebookresearch / DIG-In
This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.
☆20Updated 11 months ago
OpenLLMAI / OpenLLMDE
OpenLLMDE: An open source data engineering framework for LLMs
☆17Updated last year
ctlllll / reward_collapse
☆27Updated last year
likenneth / persona_drift
Measuring and Controlling Persona Drift in Language Model Dialogs
☆17Updated last year
jquesnelle / ctranslate2-rs
Rust bindings for CTranslate2
☆14Updated last year
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆45Updated 9 months ago
haotian-liu / transformers_llava
☆13Updated 2 years ago
The-Inscrutable-X / TACQ
Official Repository for Task-Circuit Quantization
☆20Updated 2 weeks ago