networkdynamics / seldonite
A News Article Collection Library
☆22Updated 2 years ago
Alternatives and similar repositories for seldonite:
Users that are interested in seldonite are comparing it to the libraries listed below
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Pytorch implementation of a BiLSTM model for the Wikification project.☆19Updated 5 years ago
- Finds linguistic patterns effortlessly☆36Updated last year
- An integration of Qdrant ANN vector database backend with txtai☆24Updated 8 months ago
- Tools for interactive visual exploration of semantic embeddings.☆32Updated 8 months ago
- ☆67Updated last year
- ☆23Updated last year
- 🌸 Train floret vectors☆18Updated 2 years ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆13Updated 8 months ago
- News API - fetch news from CommonCrawl, parse with NewsPlease, enrich with pre-trained machine-learning models, to structured searchable …☆28Updated 2 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆16Updated last month
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- ☆54Updated last year
- spaCy extension for Visual Studio Code☆30Updated last month
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆38Updated 5 years ago
- LLM plugin for embeddings using sentence-transformers☆59Updated last week
- A Python package to get useful information from documents using TopicRank Algorithm.☆16Updated last year
- A classifier that distinguishes political from non-political news articles.☆30Updated last year
- Various Jupyter notebooks about Common Crawl data☆52Updated last month
- 🔢 Work with static vector models☆28Updated 2 weeks ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- Package to help with scientific literature research☆25Updated 2 years ago
- Production-grade embedding generation, for any length of text, for transformer models.☆23Updated this week
- Transform a corpus of text documents (any kind) into a map with different zoom levels and topics names to summarise sub corpus of similar…☆29Updated last year
- Plug-and-play document processing pipelines. No training. Batteries included.☆57Updated last week
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆14Updated last year
- The News Landscape Toolkit (NELA)☆15Updated 4 years ago
- Extract knowledge from raw text☆13Updated 3 years ago
- ☆30Updated 2 years ago