Ongoing research training transformer language models at scale, including: BERT
☆16Apr 25, 2019Updated 7 years ago
Alternatives and similar repositories for Megatron-LM
Users that are interested in Megatron-LM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Question Answering with Interactive Text (QAit), code for EMNLP 2019 paper "Interactive Language Learning by Question Answering"☆44Sep 3, 2019Updated 6 years ago
- This code accompanies the paper "Bayesian Framework for Information-Theoretic Probing" published in EMNLP 2021.☆10Aug 23, 2021Updated 4 years ago
- Dataset and pre-trained model of EMNLP-IJCNLP 2019 paper "TalkDown: A Corpus for Condescension Detection in Context."☆10Jan 26, 2020Updated 6 years ago
- ☆14Jul 10, 2021Updated 4 years ago
- ☆12Oct 10, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code and data for SCED sentence cloze dataset☆12Dec 8, 2022Updated 3 years ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆142May 6, 2026Updated 2 weeks ago
- ☆14Jun 18, 2023Updated 2 years ago
- Authors' implementation of EMNLP-IJCNLP 2019 paper "Answering Complex Open-domain Questions Through Iterative Query Generation"☆195Oct 29, 2019Updated 6 years ago
- ☆10Jan 5, 2015Updated 11 years ago
- Pointer Networks Implementation in Keras☆11Aug 17, 2017Updated 8 years ago
- Taranis NG is an OSINT gathering and analysis tool for CSIRT teams and organisations. It allows team-to-team collaboration, and contains …☆10Oct 17, 2023Updated 2 years ago
- Auto-Video maker handling many AI's☆11Mar 18, 2024Updated 2 years ago
- upload a manim script and generate an animation☆11Mar 10, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Distributed game of life (inspired by Torben Hoffman's solution found at https://github.com/lehoff/egol)☆11Aug 13, 2015Updated 10 years ago
- Debiasing Methods in Natural Language Understanding Make Bias More Accessible: Code and Data☆14Apr 24, 2022Updated 4 years ago
- A curated list of GPT agents for cybersecurity☆12Oct 2, 2024Updated last year
- Implementation of "Adversarial Text Generation without Reinforcement Learning"☆12Mar 13, 2019Updated 7 years ago
- A collection of handy tools such as adding Key & BPM to your music library☆16Mar 8, 2023Updated 3 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85May 10, 2022Updated 4 years ago
- A Quick Thought implemented by pytorch.☆13May 19, 2019Updated 7 years ago
- The Shape of Data: Intrinsic Distance for Comparing Data Distributions☆12Sep 25, 2019Updated 6 years ago
- ☆15Mar 12, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Apr 5, 2025Updated last year
- PyTorch implementation of context2vec from Melamud et al., CoNLL 2016☆19Sep 25, 2018Updated 7 years ago
- open book question answering☆15Dec 8, 2022Updated 3 years ago
- A project to translate the Voynich Manuscript into English☆11Jun 30, 2023Updated 2 years ago
- promptflowx is a simple and powerful tool for building prompt-driven workflows.☆14Jul 31, 2024Updated last year
- Dump Linux keyrings☆24Jul 15, 2024Updated last year
- ☆11Oct 19, 2023Updated 2 years ago
- yyuu.github.io☆15Mar 3, 2016Updated 10 years ago
- Experiments for recognising textual entailment☆14Oct 12, 2012Updated 13 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆15Jul 8, 2024Updated last year
- ☆13Mar 13, 2025Updated last year
- A Multi-Type Multi-Span Network for Reading Comprehension that Requires Discrete Reasoning☆89Nov 19, 2019Updated 6 years ago
- JavaScript JSON schema validation with TypeScript type inference.☆14Jul 28, 2024Updated last year
- Making Lattice SensAI work properly on tinyVision products☆12Nov 22, 2022Updated 3 years ago
- Using a reasoning LLM to learn a prompt from data☆25May 5, 2025Updated last year
- Supporting code for the paper "Portuguese Language Models and Word Embeddings: Evaluating on Semantic Similarity Tasks".☆11Dec 8, 2022Updated 3 years ago