sdmhans / arxiv_dataset_extraction
A simple script for extracting plain text from arxiv dataset: https://www.kaggle.com/Cornell-University/arxiv
☆15Updated 4 years ago
Alternatives and similar repositories for arxiv_dataset_extraction:
Users that are interested in arxiv_dataset_extraction are comparing it to the libraries listed below
- Probing task; contextual embeddings -> textual definitions (EMNLP19)☆11Updated 3 years ago
- Code for our ACL '20 paper "Representation Engineering with Natural Language Explanations"☆29Updated 4 years ago
- Code accompanying ICML 2021 paper "Few-shot Language Coordination by Modeling Theory of Mind"☆18Updated 2 years ago
- Unifew: Unified Fewshot Learning Model☆18Updated 3 years ago
- Symbolic Brittleness in Sequence Models: on Systematic Generalization in Symbolic Mathematics (AAAI 2022)☆14Updated 3 years ago
- ☆15Updated 3 years ago
- ☆26Updated 5 years ago
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Updated 2 years ago
- EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections☆50Updated 3 years ago
- Do Neural Language Representations Learn Physical Commonsense?☆22Updated 3 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 2 years ago
- ☆49Updated last year
- Code Repo for "Differentiable Open-Ended Commonsense Reasoning" (NAACL 2021)☆31Updated last year
- Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks☆13Updated last year
- ☆13Updated last year
- ☆20Updated 3 years ago
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆20Updated last week
- Code for the paper "UnNatural Language Inference" to appear at ACL 2021 (Long Paper)☆36Updated 3 years ago
- Explicit Alignment Objectives for Multilingual Bidirectional Encoders☆13Updated 3 years ago
- This is a niche collection of research papers which are proven to be gradients pushing the field of Natural Language Processing, Deep Lea…☆25Updated 4 months ago
- Codes for the WWW2021 paper: DISCOS: Bridging the Gap between Discourse Knowledge and Commonsense Knowledge (https://arxiv.org/abs/2101.0…☆43Updated 2 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Updated last year
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated last year
- PyTorch code for the EMNLP 2020 paper "Embedding Words in Non-Vector Space with Unsupervised Graph Learning"☆41Updated 4 years ago
- Source Code for "Teaching Machine Comprehension with Compositional Explanations" (Findings of EMNLP 2020)☆11Updated 4 years ago
- ☆38Updated 4 years ago
- [EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations☆30Updated 2 years ago
- Selections from ACL 2020☆8Updated 2 years ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Updated 2 years ago
- ☆49Updated last year