vespa-engine / cord-19
Search COVID-19 Open Research Dataset (CORD-19) using Vespa - the open source big data serving engine.
☆37Updated last week
Alternatives and similar repositories for cord-19:
Users that are interested in cord-19 are comparing it to the libraries listed below
- ☆30Updated 2 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Generate a SQLite database from Wikipedia & Wikidata dumps.☆33Updated 11 months ago
- Easily identify and label sentence intervals using various taggers.☆16Updated 8 years ago
- Finds linguistic patterns effortlessly☆35Updated last year
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- Performance evaluation of nearest neighbor search using Vespa, Elasticsearch and Open Distro for Elasticsearch K-NN☆117Updated 3 years ago
- ☆70Updated 2 years ago
- ☆26Updated 6 years ago
- Disambiguating biomedical and clinical concepts with word embeddings☆14Updated 6 years ago
- Extract statistics from Wikipedia Dump files.☆26Updated 3 years ago
- Various Jupyter notebooks about Common Crawl data☆51Updated 3 weeks ago
- A machine learning software for extracting information from scholarly documents☆23Updated 4 years ago
- Open Access PDF harvester☆39Updated 10 months ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- [archived]☆18Updated 3 years ago
- Polyglot skipgram embeddings, and their many health benefits☆12Updated 5 years ago
- ☆9Updated 6 years ago
- The ntentional blog - a machine learning journey☆23Updated 2 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- DKPro WSD: A Java framework for word sense disambiguation☆20Updated 2 years ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆16Updated 2 years ago
- A visualisation tool for Spacy using Hierplane.☆65Updated 2 years ago
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)☆30Updated 2 months ago
- A python library to generate highly realistic typos (fuzz-testing)☆11Updated 6 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- Universal Dependencies (v1.0) for the GENIA 1.0 Treebank, along with additional raw abstracts and metadata.☆22Updated 4 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 5 months ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- A web application tagging and retrieval of arguments in text☆29Updated last year