Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.
☆14Jan 23, 2022Updated 4 years ago
Alternatives and similar repositories for comet-deepspeed
Users that are interested in comet-deepspeed are comparing it to the libraries listed below
Sorting:
- Official code repository for the main conference paper in EMNLP 2022: SubeventWriter: Iterative Sub-event Sequence Generation with Cohere…☆11Oct 16, 2022Updated 3 years ago
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…☆11Oct 18, 2022Updated 3 years ago
- Code for EMNLP 2020 paper: Analogous Process Structure Induction for Sub-event Sequence Prediction☆11Oct 19, 2020Updated 5 years ago
- Source code for the paper 'Complex Hyperbolic Knowledge Graph Embeddings with Fast Fourier Transform'.☆12Nov 9, 2022Updated 3 years ago
- SP-10K is a large-scale human-annotated selectional preference set. Five selectional preference relations are included.☆12May 6, 2020Updated 5 years ago
- Source Code for AAAI 2022 paper "Graph Convolutional Networks with Dual Message Passing for Subgraph Isomorphism Counting and Matching"☆23Nov 13, 2022Updated 3 years ago
- Code and data for the paper Acquiring and Modelling Abstract Commonsense Knowledge via Conceptualization☆23Nov 21, 2022Updated 3 years ago
- A fast and neat API for Conceptualization of Probase☆17Oct 28, 2019Updated 6 years ago
- WinoWhy provides human-annotated reasons for answering WSC questions.☆18May 13, 2020Updated 5 years ago
- Code for the ACL2023 paper: CAT: A Contextualized Conceptualization and Instantiation Framework for Commonsense Reasoning (https://aclant…☆11May 9, 2023Updated 2 years ago
- Official code repository for the paper: AbsPyramid: Benchmarking the Abstration Ability of Language Models with a Unified Entailment Grap…☆13Oct 30, 2024Updated last year
- Data on verb transitivity in English and script to extract transitivity information from Google's syntactic ngrams corpus☆11Oct 1, 2018Updated 7 years ago
- Official Code for EMNLP2023 Main Conference paper: "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detec…☆30Nov 14, 2023Updated 2 years ago
- Extracting Cultural Commonsense Knowledge at Scale (WWW 2023)☆11Feb 15, 2024Updated 2 years ago
- ☆14Aug 10, 2023Updated 2 years ago
- Source Code for ICML 2022 paper "Boosting Graph Structure Learning with Dummy Nodes"☆20Apr 24, 2023Updated 2 years ago
- Dataset & Code for Com2Sense Benchmark☆13Sep 8, 2021Updated 4 years ago
- IJCNN 2021: Inductive Learning on Commonsense Knowledge Graph Completion (Depreciated)☆15Nov 13, 2023Updated 2 years ago
- Benchmark for Answering Existential First Order Queries with Single Free Variable (NeurIPS dataset and benchmark 2021)☆20May 3, 2023Updated 2 years ago
- A web application for playing 20 Questions to crowdsource common sense. 🤖☆16Sep 29, 2022Updated 3 years ago
- Official implementaion of EMNLP 2022 paper "Generate, Discriminate, and Contrast: A Semi-Supervised Sentence Representation Learning Fram…☆23Nov 27, 2022Updated 3 years ago
- The code of the paper "DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects"☆20May 2, 2025Updated 10 months ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆23Aug 18, 2024Updated last year
- K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce (Findings of EMNLP …☆31Jan 6, 2023Updated 3 years ago
- A Python Commonsense Knowledge Inference Toolkit☆63Dec 13, 2023Updated 2 years ago
- Code for the paper "Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering" (AAAI 2021)☆30Feb 19, 2021Updated 5 years ago
- MultiSpanQA: A Dataset for Multi-Span Question Answering☆28Jan 24, 2026Updated last month
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆34Nov 21, 2021Updated 4 years ago
- Codes and Datasets for the ACL2023 Findings Paper: FolkScope: Intention Knowledge Graph Construction for Discovering E-commerce Commonsen…☆39Mar 3, 2025Updated last year
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- 🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models☆12May 30, 2025Updated 9 months ago
- Code and data for the paper: "Unsupervised Common Sense Question Answering with Self-Talk"☆79Jul 19, 2021Updated 4 years ago
- ☆75Jul 2, 2021Updated 4 years ago
- [COLING22] An End-to-End Library for Evaluating Natural Language Generation☆93Dec 18, 2023Updated 2 years ago
- [ICLR 2022 spotlight]GreaseLM: Graph REASoning Enhanced Language Models for Question Answering☆240Apr 23, 2025Updated 10 months ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 2 years ago
- ☆10May 1, 2025Updated 10 months ago
- Longformer Encoder Decoder model for the legal domain, trained for long document abstractive summarization task.☆10Feb 26, 2021Updated 5 years ago
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 9 months ago