vespa-engine / cord-19
Search COVID-19 Open Research Dataset (CORD-19) using Vespa - the open source big data serving engine.
☆37Updated 2 weeks ago
Related projects: ⓘ
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆25Updated last year
- Various Jupyter notebooks about Common Crawl data☆44Updated 2 years ago
- Generate a SQLite database from Wikipedia & Wikidata dumps.☆30Updated 5 months ago
- 🔎 A Prodigy plugin for evaluating spaCy pipelines☆12Updated 5 months ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- ☆29Updated 2 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆87Updated 2 years ago
- Topic Inference with Zeroshot models☆61Updated last year
- Finds linguistic patterns effortlessly☆31Updated last year
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆36Updated 5 years ago
- Vespa application making an index of the CORD-19 dataset.☆39Updated 2 weeks ago
- [archived]☆18Updated 3 years ago
- Collaborative NLP annotation tool supporting enterprise authentication, inter-annotator statistics, active learning☆13Updated last year
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- Hidden alignment conditional random field for classifying string pairs.☆25Updated this week
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆61Updated 6 months ago
- An Interactive Tool for Natural Language Processing on Clinical Text☆22Updated 3 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingester☆54Updated 4 months ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 7 months ago
- A PyPI package for easy text annotation in a Jupyter Notebook.☆27Updated 3 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆57Updated 4 months ago
- No Teacher BART distillation experiment for NLI tasks☆25Updated 4 years ago
- COVID-19 Open Research Dataset (CORD-19) Analysis☆56Updated last year
- ☆11Updated 2 years ago
- The ntentional blog - a machine learning journey☆23Updated last year
- GitHub repositories and users recommendations by embeddings☆16Updated last year
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)☆31Updated 4 months ago