vespa-engine / cord-19
Search COVID-19 Open Research Dataset (CORD-19) using Vespa - the open source big data serving engine.
☆38Updated last week
Alternatives and similar repositories for cord-19:
Users that are interested in cord-19 are comparing it to the libraries listed below
- [archived]☆18Updated 3 years ago
- Open Access PDF harvester☆40Updated last year
- Burglary prediction for mortals☆10Updated 11 months ago
- Various Jupyter notebooks about Common Crawl data☆52Updated last month
- Vespa application making an index of the CORD-19 dataset.☆39Updated 3 months ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- ☆30Updated 2 years ago
- ☆14Updated 5 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆91Updated 3 years ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆60Updated last year
- Collaborative NLP annotation tool supporting enterprise authentication, inter-annotator statistics, active learning☆13Updated 2 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- Running Prodigy for a team of annotators☆53Updated 4 years ago
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)☆30Updated 4 months ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆17Updated 2 years ago
- A visualisation tool for Spacy using Hierplane.☆65Updated 2 years ago
- A simple semantic search engine for scientific papers.☆28Updated last year
- An Interactive Tool for Natural Language Processing on Clinical Text☆22Updated 3 years ago
- ☆26Updated 6 years ago
- Automatically exported from code.google.com/p/nyt-salience☆22Updated 9 years ago
- Dataset and code for three Web crawling-related papers from SIGIR-2019, NeurIPS-2019. and ICML-2020.☆40Updated 3 months ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- Production-grade embedding generation, for any length of text, for transformer models.☆23Updated this week
- A large scale feature extraction tool for text-based machine learning☆32Updated 2 years ago
- List of online / computer-based annotation tools☆18Updated 8 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆41Updated 4 years ago
- A convolutional neural network model for relation extraction.☆13Updated 2 years ago
- Finds linguistic patterns effortlessly☆36Updated last year
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)☆41Updated 3 years ago