Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆18Feb 17, 2023Updated 3 years ago
Alternatives and similar repositories for Megatron-LM
Users that are interested in Megatron-LM are comparing it to the libraries listed below
Sorting:
- SAFE Drive: access SAFE Network using the file system of Windows, Mac OS and Linux☆14Dec 9, 2022Updated 3 years ago
- A network service that allows credit card payment for Sia storage.☆12May 11, 2025Updated 9 months ago
- mReasoner is a unified computational implementation of the model theory of thinking and reasoning☆13Aug 17, 2023Updated 2 years ago
- Compile-time string encryption and import obfuscation for Windows PE32(+) binaries☆16Jan 18, 2026Updated last month
- ☆17Sep 10, 2025Updated 5 months ago
- End-to-end integration of HuggingFace's models for sequence labeling.☆11Oct 4, 2020Updated 5 years ago
- Different bangla datasets for sentiment analysis on bangla text☆10Nov 26, 2022Updated 3 years ago
- Wikimedia Enterprise - client SDK in Python☆20Nov 11, 2025Updated 3 months ago
- ChatGPT connected to the web to have no more restrictions and be able to summarize the latest informations after 2021☆10Mar 3, 2023Updated 3 years ago
- This repository contains the code for the Transformer-Representation Neural Topic Model (TNTM) based on the paper "Probabilistic Topic Mo…☆12Jul 6, 2024Updated last year
- Machine learning algorithms implements with jax for machine learning in production in large scale dataset.☆14Updated this week
- C4RepSet: Representative Subset from C4 data for Training Pre-trained LMs☆11Jan 13, 2023Updated 3 years ago
- 《智能投顾》读书笔记☆12May 23, 2019Updated 6 years ago
- Code that drives the public web-based tools for the Media Cloud Online News Archive and Directory.☆11Updated this week
- ☆12Aug 30, 2022Updated 3 years ago
- Lightweight static website generator: low-ceremony generic file processor with proven javascript tools.☆13Feb 14, 2026Updated 2 weeks ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- 🧬 an evolving design philosophy (masquerading as a color scheme)☆11Dec 8, 2025Updated 2 months ago
- Security research organization dedicated to finding low hanging, critical, vulnerabilities.☆15May 12, 2022Updated 3 years ago
- Fake NEWS detector using LIAR dataset.☆11Aug 19, 2019Updated 6 years ago
- Collection of iPython notebooks with some quick demos☆11May 25, 2017Updated 8 years ago
- ☆10Jul 6, 2023Updated 2 years ago
- Code and data for the Walert large language model-based chatbot☆12Aug 14, 2025Updated 6 months ago
- Containerfile for the Vanilla OS Desktop+Nvidia image.☆16Feb 5, 2026Updated last month
- Rhyme with AI☆45Jun 18, 2020Updated 5 years ago
- 💪 A toolkit to help search for papers from aclanthology, arXiv and dblp.☆43Mar 4, 2023Updated 3 years ago
- A UI designer for constructing AI applications with OpenSearch☆16Feb 26, 2026Updated last week
- Tarjan's implementation of the Chu-Liu-Edmonds algorithm for finding min/max spanning trees of dense graphs.☆11Apr 19, 2015Updated 10 years ago
- ☆13May 26, 2021Updated 4 years ago
- Via Text Density Simple Web Crawler With Go☆13Mar 19, 2023Updated 2 years ago
- [ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"☆13Jun 11, 2023Updated 2 years ago
- parse_mediawiki_dump clone☆11Mar 22, 2025Updated 11 months ago
- ☆14May 6, 2018Updated 7 years ago
- Building applications with DeepSeek R1 model☆12Feb 15, 2025Updated last year
- Simulated user for TREC 2016-2017 Dynamic Domain track☆10Dec 27, 2017Updated 8 years ago
- Fair Benchmarks☆10Mar 14, 2019Updated 6 years ago
- prevent XSS attacks by sanitizing html (this is different then escaping!)☆22Oct 14, 2023Updated 2 years ago
- ☆10Jul 11, 2023Updated 2 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago