โ24Nov 22, 2022Updated 3 years ago
Alternatives and similar repositories for ColossalAI-Pytorch-lightning
Users that are interested in ColossalAI-Pytorch-lightning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train ๐คtransformers with DeepSpeed: ZeRO-2, ZeRO-3โ23May 20, 2021Updated 5 years ago
- Convenient Text-to-Text Training for Transformersโ18Dec 10, 2021Updated 4 years ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AIโ56Jun 20, 2026Updated last week
- Korean Named Entity Corpusโ25May 12, 2023Updated 3 years ago
- A collection of models built with ColossalAIโ33Nov 22, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer โข AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- โ๏ธ Utilizing RBERT model structure for KLUE Relation Extraction taskโ15Nov 15, 2022Updated 3 years ago
- A Pytorch-Lightning Implementation of Transformer Networkโ11Oct 22, 2020Updated 5 years ago
- โ39Mar 25, 2024Updated 2 years ago
- Convert pretrained RoBerta models to various long-document transformer modelsโ11Apr 5, 2022Updated 4 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluationโ11May 27, 2022Updated 4 years ago
- Performance benchmarking with ColossalAIโ39Jul 6, 2022Updated 3 years ago
- A utility for storing and reading files for Korean LM training ๐พโ35Oct 15, 2025Updated 8 months ago
- NumPy๋ก ๊ตฌํํ ๋ฅ๋ฌ๋ ๋ผ์ด๋ธ๋ฌ๋ฆฌ์ ๋๋ค. (์๋ ๋ฏธ๋ถ ์ง์)โ15May 4, 2021Updated 5 years ago
- ๐ค ์ต์ํ์ ์ธํ ์ผ๋ก LM์ ํ์ตํ๊ธฐ ์ํ ์ํ์ฝ๋โ59May 23, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform โข AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)โ22Jun 16, 2021Updated 5 years ago
- Code for Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution (ACL2021)โ13Jun 2, 2021Updated 5 years ago
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findingsโ15May 3, 2023Updated 3 years ago
- Simple Bert Implementation (TensorFlow 2.0)โ13Aug 9, 2019Updated 6 years ago
- Scalable PaLM implementation of PyTorchโ190Dec 19, 2022Updated 3 years ago
- annotated-transformer-krโ15May 16, 2019Updated 7 years ago
- ELECTRA๊ธฐ๋ฐ ํ๊ตญ์ด ๋ํ์ฒด ์ธ์ด๋ชจ๋ธโ53Aug 4, 2021Updated 4 years ago
- KoGPT2 on Huggingface Transformersโ33May 4, 2021Updated 5 years ago
- ์ฌ์ ์์ ๋ํ ์๋ฌธ๋ง ์ถ์ถํ ๋ฐ์ดํฐโ16Apr 24, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer โข AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- High Performance Grouped GEMM in PyTorchโ30May 10, 2022Updated 4 years ago
- โ39Jul 25, 2024Updated last year
- Sky Computing: Accelerating Geo-distributed Computing in Federated Learningโ90Nov 22, 2022Updated 3 years ago
- Code for the paper "A Structural Model for Contextual Code Changes"โ32Oct 25, 2023Updated 2 years ago
- bpe based korean t5 model for text-to-text unified frameworkโ63Apr 17, 2024Updated 2 years ago
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasetsโ130Nov 12, 2022Updated 3 years ago
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?โ19Jan 31, 2025Updated last year
- kogpt๋ฅผ oslo๋ก ํ์ธํ๋ํ๋ ์์ .โ23Aug 26, 2022Updated 3 years ago
- KLUE Benchmark 1st place (2021.12) solutions. (RE, MRC, NLI, STS, TC)โ25Apr 11, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean โข AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Deploy KoGPT with Triton Inference Serverโ14Nov 18, 2022Updated 3 years ago
- This repository forked from parlAI. Korean Wizard of Wikipedia task was added to this repo. This repository is going to be moved after EMโฆโ16Dec 9, 2022Updated 3 years ago
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.โ10Feb 10, 2022Updated 4 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.โ48Jul 25, 2023Updated 2 years ago
- Training code repo of the paper "DeepDance: Music-to-Dance Motion Choreography with Adversarial Learning"โ11May 18, 2021Updated 5 years ago
- ๆ นๆฎ็ปดๅบ็พ็งๅๅฒ็ผ่พๆฐๆฎๆๅ็บ ้่ฏญๆใโ12Apr 6, 2022Updated 4 years ago
- ํธ๋์คํฌ๋จธ ๋ธ๋ก์ ํ์ฉํ ์ํ๋ช ์์ฐ์ด์ฒ๋ฆฌ ๊ธฐ๋ฐ ์นดํ ๊ณ ๋ฆฌ ๋ถ๋ฅ ๋ชจ๋ธโ10Dec 5, 2022Updated 3 years ago