daveshap / PlainTextWikipediaLinks
Convert Wikipedia database dumps into plaintext files
☆326Updated 4 years ago
Alternatives and similar repositories for PlainTextWikipedia
Users that are interested in PlainTextWikipedia are comparing it to the libraries listed below
Sorting:
- Nearly a thousand bash and python scripts I've written over the years.☆123Updated 9 months ago
- Download subreddit comments☆97Updated 3 years ago
- Play detective on Reddit: Discover political disinformation campaigns, secret influencers and more☆221Updated last year
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆243Updated 2 years ago
- 📊 Semantic search for headlines and story text☆360Updated 2 years ago
- Conversational text Analysis using various NLP techniques☆183Updated 2 years ago
- GPT Takes the Bar Exam☆142Updated 2 years ago
- Chat interface to gpt-j. Runs in Google Colab.☆59Updated 2 years ago
- The world's largest social media toxicity dataset.☆187Updated 3 years ago
- A Reddit bot that generates new context-aware comments using Markov chains trained from a set of given users or subreddits comments histo…☆73Updated 4 years ago
- Python code for building a GPT-3 based technical blog post optimizer.☆84Updated 3 years ago
- A client for OpenAI's GPT-3 API for ad hoc testing of prompt without using the web interface.☆89Updated 5 years ago
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆125Updated last year
- An on-going dataset consisting of hashtags, n-gram counts and other misc NLP things for covid-19 analysis, stemming from over 100 000 000…☆58Updated 3 years ago
- Test prompts for GPT-J-6B and the resulting AI-generated texts☆53Updated 4 years ago
- 🧠 AI memory assistant – remember everything you read☆302Updated 2 years ago
- Question Generation - Question Answering for Automatic Flashcards☆66Updated 3 years ago
- Python script to download public Tweets from a given Twitter account into a format suitable for AI text generation.☆226Updated 5 years ago
- Offline Internet Archive project☆299Updated last year
- Streaming WARC/ARC library for fast web archive IO☆434Updated 10 months ago
- A Flask webapp & Python scripts for predicting reddit users' political leaning, using their comment history.☆64Updated 2 years ago
- The AI Knowledge Editor☆185Updated 3 years ago
- mp4grep is a CLI for transcribing and searching audio/video files☆288Updated 2 years ago
- Dolores is a Python library designed to improve the developer experience when working with pretrained language models. Dolores provides p…☆34Updated 5 years ago
- A GPT-J API to use with python3 to generate text, blogs, code, and more☆204Updated 2 years ago
- An open-source text summarization toolkit for non-experts. EMNLP'2021 Demo☆280Updated 2 years ago
- A tool to automatically turn any Wikipedia article into a video☆57Updated 3 years ago
- Multi-angle c(q)uestion answering☆456Updated 3 years ago
- Neural Search☆333Updated last year
- Scraper for downloading the entire ebooks repository of project Gutenberg☆152Updated this week