OlehOnyshchak / pyWikiMMLinks
Collects a multimodal dataset of Wikipedia articles and their images
☆16Updated 2 years ago
Alternatives and similar repositories for pyWikiMM
Users that are interested in pyWikiMM are comparing it to the libraries listed below
Sorting:
- Daily TV News Summary using GPT☆23Updated 4 months ago
- Download okCupid users public data automatically☆10Updated 3 years ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆63Updated last year
- Boolean text search in Python☆46Updated 3 months ago
- ☆27Updated 2 years ago
- Tools to construct and process Common Crawl webgraphs☆99Updated last week
- ☆14Updated 3 years ago
- Dolores is a Python library designed to improve the developer experience when working with pretrained language models. Dolores provides p…☆34Updated 5 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Simple pdf to text with python using PDFtk and PyPDF2☆21Updated 2 years ago
- Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.☆48Updated 3 years ago
- Discourse Analysis Tool Suite☆36Updated this week
- A.I. Every Day☆28Updated last year
- Code for constructing TLDR corpus from Reddit dataset☆26Updated 3 years ago
- Adversarial Training on Transformer Networks to discover check-worthy factual claims☆80Updated last year
- Allows you to edit videos automatically using Motion Detection☆32Updated 4 years ago
- Reproducing "Writing with Transformer" demo, using aitextgen/FastAPI in backend, Quill/React in frontend☆27Updated 4 years ago
- Synthetic QA generation for long documents.☆16Updated 3 years ago
- ☆56Updated 2 years ago
- ☆43Updated 2 years ago
- GenieNLP: A versatile codebase for any NLP task☆88Updated last year
- A database of movie scripts from several sources☆177Updated last year
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Updated last year
- This repository serves as a collection of scrapers procuring and structuring various legal datasets☆17Updated 2 years ago
- ☆17Updated 4 years ago
- A tool to easily scrape youtube data using the Google API☆12Updated 6 months ago
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆41Updated last year
- A language detection software☆58Updated 7 years ago
- ☆33Updated 2 years ago
- Extracts iframes or keyframes from a video file, through the command line or from inside python.☆17Updated 2 years ago