networkdynamics / seldonite
A News Article Collection Library
☆22Updated last year
Alternatives and similar repositories for seldonite:
Users that are interested in seldonite are comparing it to the libraries listed below
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated last year
- ☆54Updated last year
- News API - fetch news from CommonCrawl, parse with NewsPlease, enrich with pre-trained machine-learning models, to structured searchable …☆28Updated 2 years ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Updated 6 months ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆59Updated 9 months ago
- LLM plugin for embeddings using sentence-transformers☆48Updated last week
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- Building a Job Dataset☆21Updated 2 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- Finds linguistic patterns effortlessly☆35Updated last year
- Tools for interactive visual exploration of semantic embeddings.☆30Updated 5 months ago
- Pytorch implementation of a BiLSTM model for the Wikification project.☆18Updated 4 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- ☆67Updated 11 months ago
- Interpretable feature construction from taxonomies for text classification☆18Updated 2 years ago
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- Transform a corpus of text documents (any kind) into a map with different zoom levels and topics names to summarise sub corpus of similar…☆29Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated 11 months ago
- This repository serves as a collection of scrapers procuring and structuring various legal datasets☆17Updated last year
- examples and guides to using Nomic Atlas☆27Updated 2 weeks ago
- The News Landscape Toolkit (NELA)☆15Updated 4 years ago
- Small python package to measure OCR quality and other related metrics.☆21Updated last year
- KeypartX is a graph-based approach to represent perception (text in general) by key parts of speech.Updated last year
- spaCy entry points for Curated Transformers☆26Updated 4 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆66Updated 6 months ago
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆22Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆76Updated last year
- create workflows with LLMs☆52Updated 6 months ago