Ongoing research training transformer models at scale
☆38Jan 19, 2024Updated 2 years ago
Alternatives and similar repositories for Megatron-LM
Users that are interested in Megatron-LM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Aug 27, 2023Updated 2 years ago
- Bridging Large Language Models with Scala 3 Functions☆11Aug 31, 2024Updated last year
- Lightweight tools for quick and easy LLM demo's☆28Sep 22, 2024Updated last year
- Smithy4s client directly using Fetch APIs, without bringing http4s/cats, to dramatically reduce bundle size☆13Jul 7, 2024Updated last year
- Experimenting text-embeddings-inference server on both CPU and GPU☆18Oct 25, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Scala interfaces to huggingface transformers and tokenizers☆13Updated this week
- ☆868Dec 8, 2023Updated 2 years ago
- Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation☆73Sep 18, 2023Updated 2 years ago
- ☆17Apr 7, 2025Updated 11 months ago
- Images of example pages from Transkribus model training sets to make it easier to find a match.☆15Jan 25, 2022Updated 4 years ago
- AI-based Trading assistant based on Binance data and machine learning algorithms.☆17Jan 29, 2024Updated 2 years ago
- ☆22Jun 15, 2023Updated 2 years ago
- A SQLite-backed event and work engine that stays consistent across retries, restarts, and failures.☆41Mar 23, 2026Updated last week
- A collection of interesting papers on Diffusion Models☆17Dec 19, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- inference code for mixtral-8x7b-32kseqlen☆104Dec 12, 2023Updated 2 years ago
- Query your typescript codebase using GPT☆11May 29, 2023Updated 2 years ago
- A code sample that shows how to use 🦜️🔗langchain, 🦙llama_index and a hosted LLM endpoint to do a standard chat or Q&A about a pdf doc…☆19Oct 24, 2023Updated 2 years ago
- ☆14Jan 10, 2025Updated last year
- The OpenAI Function Calling Toolkit is a powerful tool that simplifies and organizes the process of invoking OpenAI functions in your Nod…☆16Jun 29, 2023Updated 2 years ago
- 🐣🕐📅 A simple utility to draft scheduling emails.☆12Sep 13, 2023Updated 2 years ago
- DeepSeek-V3.2-Exp DSA Warmup Lightning Indexer training operator based on tilelang☆44Nov 19, 2025Updated 4 months ago
- An example of streaming ChatGPT via the OpenAI v4.0 node SDK.☆16Sep 9, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Sep 22, 2024Updated last year
- RNN based design of a Controller for a Multi-Input Multi-Output (MIMO) System.☆21Oct 7, 2018Updated 7 years ago
- ☆27Mar 13, 2024Updated 2 years ago
- A self-hosted, real-time web UI + db for exploring your OpenAI API requests / responses.☆18Apr 27, 2023Updated 2 years ago
- This project allows you to plug in a GitHub repository URL, generate vectors for a LLM and use ChatGPT models to interact. The main frame…☆19Jun 4, 2023Updated 2 years ago
- The official evaluation suite and dynamic data release for MixEval.☆11Sep 23, 2024Updated last year
- This is the starter code for an example of storing a github repo in a vector store and chatting with it as a knowledge base☆17Jun 22, 2023Updated 2 years ago
- Photorealistic Minecraft-like game using NVIDIA RTX in Rust☆15May 1, 2021Updated 4 years ago
- ☆14Sep 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- mcp server for just☆47Dec 3, 2025Updated 4 months ago
- A library to simulate quantum computations☆12Dec 30, 2023Updated 2 years ago
- Resources backing the Feast fraud tutorial on GCP☆14May 31, 2022Updated 3 years ago
- A curated list of research papers, datasets, open-source codes, conferences, workshops related to AI for fashion and e-commerce.☆15Mar 30, 2020Updated 6 years ago
- ☆15Dec 4, 2024Updated last year
- ☆10Aug 7, 2023Updated 2 years ago
- About my PC setup and my scripts to automate workstation and server setup after a fresh OS install.☆16Dec 18, 2025Updated 3 months ago