β24Nov 22, 2022Updated 3 years ago
Alternatives and similar repositories for ColossalAI-Pytorch-lightning
Users that are interested in ColossalAI-Pytorch-lightning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train π€transformers with DeepSpeed: ZeRO-2, ZeRO-3β23May 20, 2021Updated 4 years ago
- Convenient Text-to-Text Training for Transformersβ19Dec 10, 2021Updated 4 years ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AIβ56Sep 1, 2023Updated 2 years ago
- Korean Named Entity Corpusβ25May 12, 2023Updated 2 years ago
- A collection of models built with ColossalAIβ32Nov 22, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- βοΈ Utilizing RBERT model structure for KLUE Relation Extraction taskβ15Nov 15, 2022Updated 3 years ago
- A Pytorch-Lightning Implementation of Transformer Networkβ11Oct 22, 2020Updated 5 years ago
- β39Mar 25, 2024Updated 2 years ago
- Convert pretrained RoBerta models to various long-document transformer modelsβ11Apr 5, 2022Updated 4 years ago
- GPT Demo with hybrid distributed trainingβ10Dec 1, 2022Updated 3 years ago
- Performance benchmarking with ColossalAIβ39Jul 6, 2022Updated 3 years ago
- A utility for storing and reading files for Korean LM training πΎβ35Oct 15, 2025Updated 5 months ago
- Elixir: Train a Large Language Model on a Small GPU Clusterβ15Jun 8, 2023Updated 2 years ago
- π€ μ΅μνμ μΈν μΌλ‘ LMμ νμ΅νκΈ° μν μνμ½λβ59May 23, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)β22Jun 16, 2021Updated 4 years ago
- Code for Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution (ACL2021)β13Jun 2, 2021Updated 4 years ago
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findingsβ15May 3, 2023Updated 2 years ago
- Simple Bert Implementation (TensorFlow 2.0)β13Aug 9, 2019Updated 6 years ago
- 'κ·Έλ΄ λ―ν' νκ΅μ΄ μ΄λ¦ λλ€ μμ±κΈ°β19Oct 21, 2019Updated 6 years ago
- Scalable PaLM implementation of PyTorchβ190Dec 19, 2022Updated 3 years ago
- Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetesβ20May 30, 2023Updated 2 years ago
- Korean Commonsense Knowledge Graphβ15Dec 23, 2022Updated 3 years ago
- ELECTRAκΈ°λ° νκ΅μ΄ λν체 μΈμ΄λͺ¨λΈβ53Aug 4, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- KoGPT2 on Huggingface Transformersβ33May 4, 2021Updated 4 years ago
- μ¬μ μμ λν μλ¬Έλ§ μΆμΆν λ°μ΄ν°β16Apr 24, 2023Updated 2 years ago
- High Performance Grouped GEMM in PyTorchβ30May 10, 2022Updated 3 years ago
- β39Jul 25, 2024Updated last year
- Code for the paper "A Structural Model for Contextual Code Changes"β32Oct 25, 2023Updated 2 years ago
- bpe based korean t5 model for text-to-text unified frameworkβ63Apr 17, 2024Updated last year
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasetsβ130Nov 12, 2022Updated 3 years ago
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?β18Jan 31, 2025Updated last year
- δΈθ―εθε€θ―ζ¬ηΏ»θ―转述θ―ζγθ―ζδ» ιδΊη¨δΊη§η ζε¦ζ΄»ε¨γζζ¬θδ½ζε½εθθ γβ11Jul 26, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- kogptλ₯Ό osloλ‘ νμΈνλνλ μμ .β23Aug 26, 2022Updated 3 years ago
- KLUE Benchmark 1st place (2021.12) solutions. (RE, MRC, NLI, STS, TC)β25Apr 11, 2022Updated 3 years ago
- Deploy KoGPT with Triton Inference Serverβ14Nov 18, 2022Updated 3 years ago
- This repository forked from parlAI. Korean Wizard of Wikipedia task was added to this repo. This repository is going to be moved after EMβ¦β16Dec 9, 2022Updated 3 years ago
- Depict GPU memory footprint during DNN training of PyTorchβ11Nov 17, 2022Updated 3 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.β47Jul 25, 2023Updated 2 years ago
- I hope to this list will contribute good influence in Korean online services.β63Feb 10, 2019Updated 7 years ago