networkdynamics / seldonite
A News Article Collection Library
☆22Updated 2 years ago
Alternatives and similar repositories for seldonite:
Users that are interested in seldonite are comparing it to the libraries listed below
- Tools for interactive visual exploration of semantic embeddings.☆32Updated 7 months ago
- Pytorch implementation of a BiLSTM model for the Wikification project.☆19Updated 5 years ago
- News API - fetch news from CommonCrawl, parse with NewsPlease, enrich with pre-trained machine-learning models, to structured searchable …☆28Updated 2 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆38Updated 5 years ago
- ☆54Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆13Updated 7 months ago
- A python library for the Semantic Scholar (S2) API with typed pydantic objects and various nifty functionalities.☆21Updated 4 years ago
- Blazing fast fuzzy text search for Python.☆44Updated 2 months ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆22Updated 2 years ago
- Finds linguistic patterns effortlessly☆36Updated last year
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆16Updated 8 months ago
- Transform a corpus of text documents (any kind) into a map with different zoom levels and topics names to summarise sub corpus of similar…☆29Updated last year
- LLM plugin for embeddings using sentence-transformers☆55Updated 2 weeks ago
- ☆67Updated last year
- spaCy entry points for Curated Transformers☆29Updated 6 months ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Small python package to measure OCR quality and other related metrics.☆21Updated last year
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- HDBSCAN Tuning for BERTopic Models☆45Updated last year
- Source code for the website geminibyexample.com which provides simple Python code examples for the Gemini SDK☆19Updated last week
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- examples and guides to using Nomic Atlas☆31Updated this week
- An integration of Qdrant ANN vector database backend with txtai☆24Updated 8 months ago
- Robust and fast topic models with sentence-transformers.☆48Updated this week
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12Updated 11 months ago
- Various Jupyter notebooks about Common Crawl data☆52Updated last week
- Production-grade embedding generation, for any length of text, for transformer models.☆23Updated 5 months ago