networkdynamics / seldonite
A News Article Collection Library
☆22Updated last year
Related projects ⓘ
Alternatives and complementary repositories for seldonite
- News API - fetch news from CommonCrawl, parse with NewsPlease, enrich with pre-trained machine-learning models, to structured searchable …☆28Updated 2 years ago
- Tools for interactive visual exploration of semantic embeddings.☆29Updated 2 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆44Updated 3 months ago
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆20Updated 8 months ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆32Updated last year
- Finds linguistic patterns effortlessly☆33Updated last year
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- ☆53Updated 10 months ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆55Updated 6 months ago
- Small python package to measure OCR quality and other related metrics.☆21Updated 9 months ago
- examples and guides to using Nomic Atlas☆27Updated 2 months ago
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆22Updated last year
- Transform a corpus of text documents (any kind) into a map with different zoom levels and topics names to summarise sub corpus of similar…☆28Updated 10 months ago
- Writing Blog Posts with Generative Feedback Loops!☆43Updated 8 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆19Updated last month
- LLM plugin for embeddings using sentence-transformers☆43Updated 9 months ago
- Python package for extractive NLP using the OpenAI API☆14Updated 2 months ago
- Streamlit app for recommending eval functions using prompt diffs☆25Updated 10 months ago
- A conda-smithy repository for spacy.☆14Updated 2 weeks ago
- ☆68Updated 8 months ago
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…☆22Updated 4 years ago
- spaCy entry points for Curated Transformers☆25Updated last month
- Summarize. is a Streamlit application that performs automatic text summarization using both extractive and abstractive models.☆15Updated 3 years ago
- Tools to construct and process webgraphs from Common Crawl data☆80Updated this week
- ☆11Updated 4 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆57Updated 6 months ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆11Updated 3 months ago