networkdynamics / seldonite
A News Article Collection Library
☆22Updated last year
Alternatives and similar repositories for seldonite:
Users that are interested in seldonite are comparing it to the libraries listed below
- News API - fetch news from CommonCrawl, parse with NewsPlease, enrich with pre-trained machine-learning models, to structured searchable …☆28Updated 2 years ago
- Tools for interactive visual exploration of semantic embeddings.☆32Updated 6 months ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- LLM plugin for embeddings using sentence-transformers☆52Updated last month
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- LLM plugin for clustering embeddings☆72Updated last year
- Pytorch implementation of a BiLSTM model for the Wikification project.☆19Updated 4 years ago
- Transform a corpus of text documents (any kind) into a map with different zoom levels and topics names to summarise sub corpus of similar…☆29Updated last year
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- spaCy entry points for Curated Transformers☆27Updated 5 months ago
- Small python package to measure OCR quality and other related metrics.☆21Updated last year
- Finds linguistic patterns effortlessly☆35Updated last year
- HDBSCAN Tuning for BERTopic Models☆45Updated last year
- 🌸 Train floret vectors☆18Updated last year
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- ☆67Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆23Updated last year
- Interpretable feature construction from taxonomies for text classification☆18Updated 2 years ago
- TextReducer - A Tool for Summarization and Information Extraction☆87Updated 10 months ago
- ☆54Updated last year
- Python package for extractive NLP using the OpenAI API☆17Updated 6 months ago
- ChatBot App built using LangChain and Lightning AI☆18Updated 2 years ago
- Various Jupyter notebooks about Common Crawl data☆51Updated last month
- Demo example of consumer goods categorization☆26Updated last year
- Production-grade embedding generation, for any length of text, for transformer models.☆23Updated 4 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆78Updated last year
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆23Updated last year
- Analysis of gutenberg dataset☆43Updated 6 years ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆46Updated 6 months ago