daveshap / PlainTextWikipedia
Convert Wikipedia database dumps into plaintext files
☆311Updated 3 years ago
Alternatives and similar repositories for PlainTextWikipedia:
Users that are interested in PlainTextWikipedia are comparing it to the libraries listed below
- A Reddit bot that generates new context-aware comments using Markov chains trained from a set of given users or subreddits comments histo…☆73Updated 3 years ago
- Sick of that "Save as PDF" link on Wikipedia? Why not just have Python do it for you?☆28Updated 4 years ago
- The world's largest profanity list.☆212Updated 9 months ago
- Conversational text Analysis using various NLP techniques☆179Updated last year
- Download subreddit comments☆93Updated 2 years ago
- Vector search dictionary definitions☆44Updated 2 years ago
- Python code for building a GPT-3 based technical blog post optimizer.☆84Updated 2 years ago
- The world's largest social media toxicity dataset.☆178Updated 2 years ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆242Updated last year
- Nearly a thousand bash and python scripts I've written over the years.☆120Updated this week
- experiment to generate novel-length fiction from a single story premise☆30Updated 2 years ago
- Reddit takeout: export your account data as JSON: comments, submissions, upvotes etc. 🦖☆166Updated 2 months ago
- Reddit script to archive user's saved Reddit posts and comments☆32Updated 4 years ago
- GPT-3 Explorer☆207Updated 4 years ago
- ☆32Updated last year
- 🧠 AI memory assistant – remember everything you read☆298Updated 2 years ago
- TextReducer - A Tool for Summarization and Information Extraction☆87Updated 8 months ago
- ☆205Updated 11 months ago
- a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sen…☆225Updated 2 years ago
- Chat interface to gpt-j. Runs in Google Colab.☆55Updated last year
- This is a reddit bot based on OpenAi's GPT-2 117M model☆102Updated 5 years ago
- A source of knowledge for all things LLM.☆53Updated last year
- GPT Takes the Bar Exam☆141Updated 2 years ago
- A tool to automatically turn any Wikipedia article into a video☆56Updated 2 years ago
- Python notebook to run OpenAI's Whisper model with speaker identification☆80Updated 2 years ago
- An open-source text summarization toolkit for non-experts. EMNLP'2021 Demo☆271Updated last year
- Python script to generate SVG crosswords from a text file☆124Updated 5 years ago
- Python script to download public Tweets from a given Twitter account into a format suitable for AI text generation.☆223Updated 4 years ago
- Random programs for reddit☆18Updated 4 years ago
- This AI Does Not Exist: generate realistic descriptions of made-up machine learning models.☆147Updated 2 years ago