OlehOnyshchak / pyWikiMM
Collects a multimodal dataset of Wikipedia articles and their images
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for pyWikiMM
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆32Updated last year
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆31Updated 3 years ago
- Discourse Analysis Tool Suite☆17Updated this week
- Code for "CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering for Ensemble Models for Propaganda Detection" (V. Blasch…☆9Updated 4 years ago
- A Python package to get useful information from documents using TopicRank Algorithm.☆16Updated last year
- The News Landscape Toolkit (NELA)☆15Updated 4 years ago
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆22Updated last year
- A pipeline using LLMs for Knowledge Engineering, combining knowledge probing and Wikidata entity mapping.☆33Updated last year
- A collection of code, data and information related to our audit of TikTok.☆17Updated last month
- Pytorch implementation of a BiLSTM model for the Wikification project.☆17Updated 4 years ago
- ☆11Updated 6 months ago
- This repository contains different algorithms that are used to build taxonomy from text corpus.☆8Updated 3 years ago
- Tool for the Automatic Assessment of Lexical Diversity☆11Updated 3 years ago
- ☆12Updated last year
- ☁️ A network analysis software platform for analyzing Dutch and European court decisions.☆16Updated last year
- ☆15Updated 3 years ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Updated last year
- StAtutory Reasoning Assessment☆11Updated last year
- Semantic Parser Localizer (SPL) code repository☆9Updated 3 years ago
- ☆14Updated 2 years ago
- An NLP research and data collection platform.☆17Updated 8 months ago
- Interpretable feature construction from taxonomies for text classification☆18Updated 2 years ago
- Quote identification, attribution and resolution.☆11Updated last year
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... Fast!!☆17Updated this week
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆19Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingester☆55Updated 6 months ago
- Code for DELSumm, an unsupervised summarization algorithm for legal case judgements.☆25Updated 2 years ago
- Open-source, knowledge-grounded conversational AI system☆13Updated 3 months ago
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆14Updated last year
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆21Updated 5 months ago