Minimal code to train a Large Language Model (LLM).
☆174Jul 22, 2022Updated 3 years ago
Alternatives and similar repositories for min-LLM
Users that are interested in min-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Triton Server Component for lightning.ai☆14Feb 15, 2023Updated 3 years ago
- Simple repository contribution statistics☆15Jun 2, 2026Updated 3 weeks ago
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- Smoothly deprecate and redirect Python functions/classes with smart warnings and auto-routing—keep your codebase clean while maintaining …☆57Updated this week
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for embedding and retrieval research.☆16Oct 24, 2023Updated 2 years ago
- ☆14May 3, 2022Updated 4 years ago
- ☆15Aug 3, 2021Updated 4 years ago
- A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!☆112May 11, 2023Updated 3 years ago
- ☆14Aug 18, 2022Updated 3 years ago
- Memory-efficient transformer. Work in progress.☆19Sep 17, 2022Updated 3 years ago
- Convenient Text-to-Text Training for Transformers☆18Dec 10, 2021Updated 4 years ago
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆41Sep 29, 2024Updated last year
- Korean Named Entity Corpus☆25May 12, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Placeholder for the opensource Grid AI components☆44Mar 27, 2026Updated 3 months ago
- starting point for Kaggle attempt :]☆18May 14, 2026Updated last month
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 5 years ago
- Python bindings for NVIDIA CUDA APIs.☆13Mar 2, 2024Updated 2 years ago
- code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"☆10Oct 20, 2022Updated 3 years ago
- [EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".☆20Oct 17, 2023Updated 2 years ago
- ☆18Mar 10, 2023Updated 3 years ago
- NSMC, KorSTS ... fine-tunings☆18Feb 23, 2022Updated 4 years ago
- ☆20Nov 23, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Slicing a PyTorch Tensor Into Parallel Shards☆300Jun 7, 2025Updated last year
- Accelerate PyTorch models with ONNX Runtime☆368Feb 5, 2026Updated 4 months ago