vespa-engine / cord-19Links
Search COVID-19 Open Research Dataset (CORD-19) using Vespa - the open source big data serving engine.
☆38Updated this week
Alternatives and similar repositories for cord-19
Users that are interested in cord-19 are comparing it to the libraries listed below
Sorting:
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- List of online / computer-based annotation tools☆18Updated 8 years ago
- [archived]☆18Updated 3 years ago
- Various Jupyter notebooks about Common Crawl data☆54Updated 2 months ago
- XAI based human-in-the-loop framework for automatic rule-learning.☆49Updated 10 months ago
- Easily identify and label sentence intervals using various taggers.☆16Updated 8 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆85Updated 4 years ago
- Automatically exported from code.google.com/p/wiki-links☆42Updated 9 years ago
- ☆26Updated 6 years ago
- Dataset and code for three Web crawling-related papers from SIGIR-2019, NeurIPS-2019. and ICML-2020.☆40Updated 5 months ago
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)☆30Updated 5 months ago
- Topic Inference with Zeroshot models☆61Updated last year
- A web application tagging and retrieval of arguments in text☆29Updated 2 years ago
- ☆55Updated last year
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Disambiguating biomedical and clinical concepts with word embeddings☆14Updated 7 years ago
- ☆30Updated 2 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆91Updated 3 years ago
- Vespa application making an index of the CORD-19 dataset.☆39Updated 4 months ago
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 3 years ago
- spaCy entry points for Curated Transformers☆31Updated last week
- A browser extension providing Open Access bibliographical services☆17Updated 2 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 4 years ago
- Extract statistics from Wikipedia Dump files.☆26Updated 3 years ago
- No Teacher BART distillation experiment for NLI tasks☆27Updated 4 years ago
- Open Access PDF harvester☆40Updated last year
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆38Updated 5 years ago
- Expose a Top2Vec model with a REST API.☆90Updated 2 years ago
- ☆70Updated 2 years ago