hpcaitech / GPT-Demo
GPT Demo with hybrid distributed training
☆10Updated 2 years ago
Alternatives and similar repositories for GPT-Demo:
Users that are interested in GPT-Demo are comparing it to the libraries listed below
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Updated this week
- A GPT-based generative LM for combined text and math formulas, leveraging tree-based formula encoding.☆33Updated last year
- Self-Controlled Memory System for LLMs☆44Updated 9 months ago
- OpenLLMDE: An open source data engineering framework for LLMs☆17Updated last year
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆44Updated 3 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Rust bindings for CTranslate2☆14Updated last year
- ☆13Updated last year
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆13Updated last year
- Towards Semantics-Enhanced Pre-Training: Can Lexicon Definitions Help Learning Sentence Meanings? (AAAI 2021)☆9Updated 3 years ago
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Updated 10 months ago
- A Python implementation of Toolformer using Huggingface Transformers☆15Updated last year
- ROUGE for multilingual Summarization☆23Updated 3 years ago
- 中文原生等级化代码能力测试基准☆12Updated 9 months ago
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Updated 10 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆36Updated last year
- The official fork of THoR Chain-of-Thought framework, enhanced and adapted for Emotion Cause Analysis (ECAC-2024)☆10Updated 4 months ago
- distill chatGPT coding ability into small model (1b)☆26Updated last year
- ☆44Updated 7 months ago
- Repository for "Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages"☆13Updated 3 months ago
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- Finetune CPM-1☆24Updated 3 years ago
- Transformers at any scale☆41Updated last year
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆58Updated last year
- Scripts to parse arxiv documents for NLP tasks☆17Updated last year
- Downloads 2020 English Wikipedia articles as plaintext☆22Updated last year