Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆18Feb 17, 2023Updated 3 years ago
Alternatives and similar repositories for Megatron-LM
Users that are interested in Megatron-LM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codebase for multilingual neural machine translation☆13Nov 24, 2022Updated 3 years ago
- trying to reproduce suno v3☆34Jan 29, 2025Updated last year
- 5th Place Solution to 3rd YouTube-8M Video Understanding Challenge (Last Top GB Model)☆13Oct 23, 2019Updated 6 years ago
- SAFE Drive: access SAFE Network using the file system of Windows, Mac OS and Linux☆14Dec 9, 2022Updated 3 years ago
- ☆14Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for paper: Weakly- and Semi-supervised Evidence Extraction☆15Apr 12, 2021Updated 5 years ago
- Protobuf messages in a bottle☆10Feb 14, 2025Updated last year
- Modular Monoliths in Elixir☆12Mar 17, 2026Updated 3 weeks ago
- Research code and scripts used in the paper Semantic Role Labeling as Syntactic Dependency Parsing.☆15Jun 12, 2023Updated 2 years ago
- Simple example of using binary websocket messages within Phoenix☆13Feb 26, 2021Updated 5 years ago
- MuCR is a benchmark designed to evaluate Multimodal Large Language Models' (MLLMs) ability to discern causal links across modalities☆19May 27, 2025Updated 10 months ago
- Maximum mean discrepancy comparisons for single cell profiling experiments☆20Feb 9, 2022Updated 4 years ago
- ☆13Apr 11, 2022Updated 4 years ago
- Repository for the implementation and evaluation of DD-GloVe, a train-time debiasing algorithm to learn GloVe word embeddings by leveragi…☆13May 29, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Elixir-based Event Source server-side implementation using Phoenix Pubsub☆18Nov 25, 2020Updated 5 years ago
- Example microservice developed with Phoenix Framework☆13Mar 14, 2017Updated 9 years ago
- Authentication via Passkey (WebAuthn/FIDO2) for Gleam☆15Dec 10, 2024Updated last year
- A set of pre-trained machine-learning models that predict (im-)politeness scores in texts.☆19Jan 2, 2025Updated last year
- Testing the performance of CNN and BERT embeddings on GLUE tasks☆15Mar 24, 2023Updated 3 years ago
- 💡Light Bulb is a tool to help you label, train, test and deploy machine learning models without any coding.☆25Feb 15, 2023Updated 3 years ago
- Detecting bursty terms in computer science☆10Feb 2, 2022Updated 4 years ago
- Inference API server with echo and gRPC to triton server (golang)☆13Nov 16, 2022Updated 3 years ago
- Android Videokit - basic FFMPEG build for Android with x264 and libtheora support.☆22Jun 23, 2012Updated 13 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A proof of concept on creating & logging in with a passkey in ReactJS + Typescript.☆13Jan 25, 2023Updated 3 years ago
- End-to-end integration of HuggingFace's models for sequence labeling.☆11Oct 4, 2020Updated 5 years ago
- connect-go improved compression☆23Jan 22, 2026Updated 2 months ago
- Implementation of OpenPGP Message Format as desrcibed in RFC4880☆14Jul 30, 2025Updated 8 months ago
- VRAFT is a framework written in C++ that implements RAFT protocol and SEDA architecture. Based on VRAFT, distributed software can be deve…☆11Sep 24, 2024Updated last year
- Collection of brief notes from 592 lectures (started in 2014)☆13Aug 9, 2023Updated 2 years ago
- 《智能投顾》读书笔记☆12May 23, 2019Updated 6 years ago
- Product Quantization k-Nearest Neighbors☆21Jun 24, 2021Updated 4 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Halide backend for ONNX☆12Nov 5, 2019Updated 6 years ago
- Most simplest Python solution for WebRTC streaming of a video file. Capable of doing play, pause & seeking operations☆15Dec 9, 2024Updated last year
- An Interactive Tool for Natural Language Processing on Clinical Text☆23Aug 20, 2021Updated 4 years ago
- Usage of Siamese Recurrent Neural network architectures for semantic textual similarity☆22Mar 5, 2019Updated 7 years ago
- Code for "On Long-Tailed Phenomena in NMT".☆10Jan 10, 2021Updated 5 years ago
- Conway's Game of Life using experimental Scala.js WebAssembly backend☆15Apr 24, 2025Updated 11 months ago
- Lists of VPN providers (automatically updated; maintainer: @janderedev)☆14Jan 29, 2023Updated 3 years ago