sdmhans / arxiv_dataset_extraction

A simple script for extracting plain text from arxiv dataset: https://www.kaggle.com/Cornell-University/arxiv

☆15

Alternatives and similar repositories for arxiv_dataset_extraction:

Users that are interested in arxiv_dataset_extraction are comparing it to the libraries listed below

MiuLab / GenDef
Probing task; contextual embeddings -> textual definitions (EMNLP19)
☆11Updated 3 years ago
MurtyShikhar / ExpBERT
Code for our ACL '20 paper "Representation Engineering with Natural Language Explanations"
☆29Updated 4 years ago
CLAW-Lab / ToM
Code accompanying ICML 2021 paper "Few-shot Language Coordination by Modeling Theory of Mind"
☆18Updated 2 years ago
allenai / unifew
Unifew: Unified Fewshot Learning Model
☆18Updated 3 years ago
wellecks / symbolic_generalization
Symbolic Brittleness in Sequence Models: on Systematic Generalization in Symbolic Mathematics (AAAI 2022)
☆14Updated 3 years ago
nuaa-nlp / Multimodality
☆15Updated 3 years ago
UKPLab / refresh2018-predicting-trends-from-arxiv
☆26Updated 5 years ago
Yifan-Gao / open_retrieval_conversational_machine_reading
Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset
☆13Updated 2 years ago
ruiqi-zhong / Meta-tuning
EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections
☆50Updated 3 years ago
mbforbes / physical-commonsense
Do Neural Language Representations Learn Physical Commonsense?
☆22Updated 3 years ago
sunlab-osu / ReasonBERT
Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021
☆29Updated 2 years ago
alontalmor / LeapOfThought
☆49Updated last year
yuchenlin / OpenCSR
Code Repo for "Differentiable Open-Ended Commonsense Reasoning" (NAACL 2021)
☆31Updated last year
allenai / faithful-nmn
Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks
☆13Updated last year
GChrysostomou / ood_faith
☆13Updated last year
JamesHujy / ELV
☆20Updated 3 years ago
pietrolesci / memorisation-profiles
This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".
☆20Updated last week
facebookresearch / UNLU
Code for the paper "UnNatural Language Inference" to appear at ACL 2021 (Long Paper)
☆36Updated 3 years ago
JunjieHu / amber
Explicit Alignment Objectives for Multilingual Bidirectional Encoders
☆13Updated 3 years ago
keyurfaldu / AIgrads
This is a niche collection of research papers which are proven to be gradients pushing the field of Natural Language Processing, Deep Lea…
☆25Updated 4 months ago
HKUST-KnowComp / DISCOS-commonsense
Codes for the WWW2021 paper: DISCOS: Bridging the Gap between Discourse Knowledge and Commonsense Knowledge (https://arxiv.org/abs/2101.0…
☆43Updated 2 years ago
ShaojieJiang / tldr
Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"
☆10Updated last year
JeremyAlain / imitation_learning_from_language_feedback
This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"
☆27Updated last year
yandex-research / graph-glove
PyTorch code for the EMNLP 2020 paper "Embedding Words in Non-Vector Space with Unsupervised Graph Learning"
☆41Updated 4 years ago
INK-USC / mrc-explanation
Source Code for "Teaching Machine Comprehension with Compositional Explanations" (Findings of EMNLP 2020)
☆11Updated 4 years ago
castorini / transformers-arithmetic
☆38Updated 4 years ago
Alibaba-NLP / MuVER
[EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations
☆30Updated 2 years ago
juand-r / ACL-2020
Selections from ACL 2020
☆8Updated 2 years ago
renll / SparseLT
[EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing
☆14Updated 2 years ago
qkaren / unsup_gen_for_cms_reasoning
☆49Updated last year