soketlabs / coomLinks
A training framework for large-scale language models based on Megatron-Core, the COOM Training Framework is designed to efficiently handle extensive model training inspired by Deepseek's HAI-LLM optimizations.
☆21Updated last month
Alternatives and similar repositories for coom
Users that are interested in coom are comparing it to the libraries listed below
Sorting:
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆191Updated 3 months ago
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆38Updated last year
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆331Updated last week
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆105Updated 5 months ago
- ☆155Updated 9 months ago
- everything i know about cuda and triton☆13Updated 7 months ago
- rl from zero pretrain, can it be done? yes.☆261Updated 2 weeks ago
- ☆28Updated 10 months ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆32Updated this week
- ⚖️ Awesome LLM Judges ⚖️☆124Updated 4 months ago
- ☆222Updated 2 months ago
- ☆46Updated 5 months ago
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆67Updated 3 months ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆111Updated 10 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆227Updated 8 months ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆651Updated this week
- ☆214Updated 6 months ago
- Notes from the Latent Space paper club. Follow along or start your own!☆238Updated last year
- ☆68Updated 3 months ago
- Basically a repo containing architectures/algorithms/papers from scratch in pytorch☆30Updated 2 months ago
- ☆98Updated 3 weeks ago
- Simple & Scalable Pretraining for Neural Architecture Research☆290Updated last week
- An extension of the nanoGPT repository for training small MOE models.☆181Updated 5 months ago
- ☆92Updated 10 months ago
- Simple UI for debugging correlations of text embeddings☆290Updated 3 months ago
- ☆44Updated 3 months ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆813Updated last month
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆450Updated 11 months ago
- Simple Transformer in Jax☆140Updated last year