SeanNaren / min-LLMView external linksLinks
Minimal code to train a Large Language Model (LLM).
☆170Jul 22, 2022Updated 3 years ago
Alternatives and similar repositories for min-LLM
Users that are interested in min-LLM are comparing it to the libraries listed below
Sorting:
- Triton Server Component for lightning.ai☆14Feb 15, 2023Updated 2 years ago
- Simple repository contribution statistics☆15Jan 20, 2026Updated 3 weeks ago
- ☆14May 3, 2022Updated 3 years ago
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- Smoothly deprecate and redirect Python functions/classes with smart warnings and auto-routing—keep your codebase clean while maintaining …☆53Updated this week
- Plug-and-Play Document Modules for Pre-trained Models☆25May 28, 2023Updated 2 years ago
- A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!☆113May 11, 2023Updated 2 years ago
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 4 years ago
- ☆15Aug 3, 2021Updated 4 years ago
- Korean Named Entity Corpus☆25May 12, 2023Updated 2 years ago
- ☆10May 22, 2023Updated 2 years ago
- Code for embedding and retrieval research.☆16Oct 24, 2023Updated 2 years ago
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆41Sep 29, 2024Updated last year
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated 10 months ago
- NSMC, KorSTS ... fine-tunings☆18Feb 23, 2022Updated 3 years ago
- ☆18Mar 10, 2023Updated 2 years ago
- Simple and easy stable diffusion inference with LightningModule on GPU, CPU and MPS (Possibly all devices supported by Lightning).☆17Jul 27, 2023Updated 2 years ago
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)☆17Jan 23, 2024Updated 2 years ago
- code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"☆10Oct 20, 2022Updated 3 years ago
- starting point for Kaggle attempt :]☆18Jan 20, 2026Updated 3 weeks ago
- 청와대 국민청원 데이터 아카이브☆15Aug 29, 2020Updated 5 years ago
- Convenient Text-to-Text Training for Transformers☆19Dec 10, 2021Updated 4 years ago
- Placeholder for the opensource Grid AI components☆45Jun 6, 2022Updated 3 years ago
- Few Shot Learning using EleutherAI's GPT-Neo an Open-source version of GPT-3☆18Jul 8, 2021Updated 4 years ago
- ↔️ T5 Machine Translation from English to Korean☆18Aug 11, 2022Updated 3 years ago
- Minimalistic large language model 3D-parallelism training☆2,544Dec 11, 2025Updated 2 months ago
- [EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".☆20Oct 17, 2023Updated 2 years ago
- Adversarial Test Dataset for Korean Multi-turn Response Selection☆34Dec 16, 2021Updated 4 years ago
- ☆20Nov 23, 2022Updated 3 years ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆56Sep 1, 2023Updated 2 years ago
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆791Apr 24, 2023Updated 2 years ago
- OSLO: Open Source framework for Large-scale model Optimization☆309Aug 25, 2022Updated 3 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆828Nov 9, 2022Updated 3 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 3 months ago
- The data and code for EmailSum☆63Aug 1, 2021Updated 4 years ago
- A Mechanistic‑Interpretability study that finds the structural dynamics of Large Language Models under fine‑tuning.☆16May 30, 2025Updated 8 months ago
- Accelerate PyTorch models with ONNX Runtime☆368Feb 5, 2026Updated last week
- ☆75Sep 1, 2022Updated 3 years ago
- Google's Conceptual Captions Dataset translated into Korean☆23Aug 28, 2022Updated 3 years ago