staeiou / arxiv_archive
A tidy and complete archive of metadata for papers on arxiv.org, 1993-2019
☆28Updated 5 years ago
Alternatives and similar repositories for arxiv_archive:
Users that are interested in arxiv_archive are comparing it to the libraries listed below
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated 2 years ago
- ☆12Updated 4 years ago
- ☆54Updated 3 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 2 months ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆26Updated 3 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- Robust Cross-lingual Embeddings from Parallel Sentences☆21Updated 4 years ago
- Sentence transformers models for SpaCy☆107Updated last year
- This directory gathers the tools developed by the Data Sourcing Working Group☆31Updated 3 years ago
- A corpus of comments tagged for multiple attributes of unhealthiness.☆34Updated 3 years ago
- Custom Natural Language Processing with big and small models 🌲🌱☆68Updated 3 years ago
- Analysis of gutenberg dataset☆43Updated 6 years ago
- The ntentional blog - a machine learning journey☆23Updated 2 years ago
- Rich Context leaderboard competition, including the corpus and current SOTA for required tasks.☆21Updated 4 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Updated 2 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- Generate reports for spaCy models.☆29Updated 2 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Updated last year
- A thin wrapper around the DBpedia Spotlight HTTP API☆25Updated 7 years ago
- sequence tagging with spaCy and crfsuite☆19Updated last year
- jiant-dev☆28Updated 4 years ago
- This is the second part of the Deep Learning Course for the Master in High-Performance Computing (SISSA/ICTP).)☆33Updated 4 years ago
- ☆17Updated last year
- Training Temporal Word Embeddings with a Compass☆64Updated 2 years ago
- sumgram is a tool that summarizes a collection of text documents by generating the most frequent sumgrams (conjoined ngrams)☆56Updated 6 months ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- A toolkit for social media information extraction using multi-task learning and active learning☆19Updated 2 years ago