daveshap / PlainTextWikipediaLinks
Convert Wikipedia database dumps into plaintext files
☆326Updated 4 years ago
Alternatives and similar repositories for PlainTextWikipedia
Users that are interested in PlainTextWikipedia are comparing it to the libraries listed below
Sorting:
- Nearly a thousand bash and python scripts I've written over the years.☆124Updated 11 months ago
- Download subreddit comments☆95Updated 3 years ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆244Updated 2 years ago
- Play detective on Reddit: Discover political disinformation campaigns, secret influencers and more☆224Updated 2 years ago
- A Flask webapp & Python scripts for predicting reddit users' political leaning, using their comment history.☆63Updated 2 years ago
- Conversational text Analysis using various NLP techniques☆182Updated 2 years ago
- Chat interface to gpt-j. Runs in Google Colab.☆59Updated 2 years ago
- An on-going dataset consisting of hashtags, n-gram counts and other misc NLP things for covid-19 analysis, stemming from over 100 000 000…☆58Updated 3 years ago
- A client for OpenAI's GPT-3 API for ad hoc testing of prompt without using the web interface.☆89Updated 5 years ago
- GPT Takes the Bar Exam☆142Updated 2 years ago
- The world's largest social media toxicity dataset.☆187Updated 3 years ago
- 📊 Semantic search for headlines and story text☆359Updated 2 years ago
- Reddit takeout: export your account data as JSON: comments, submissions, upvotes etc. 🦖☆180Updated 5 months ago
- Library of Alexandria (LoA in short) is a project that aims to collect and archive documents from the internet.☆128Updated last year
- The subreddit archiver☆177Updated 2 years ago
- Concise answers to search queries using Google and GPT-3. Includes citations.☆82Updated 3 years ago
- Test prompts for GPT-J-6B and the resulting AI-generated texts☆53Updated 4 years ago
- Python code for building a GPT-3 based technical blog post optimizer.☆85Updated 3 years ago
- ☆44Updated 4 years ago
- Python script to download public Tweets from a given Twitter account into a format suitable for AI text generation.☆226Updated 5 years ago
- A GPT-J API to use with python3 to generate text, blogs, code, and more☆204Updated 3 years ago
- A Chrome Extension that promotes politically diverse news reading with Artificial Intelligence!☆100Updated 4 years ago
- ☆79Updated 7 years ago
- This AI Does Not Exist: generate realistic descriptions of made-up machine learning models.☆147Updated 3 years ago
- Self-hosted GPT playground☆112Updated last year
- Neural Search☆334Updated last year
- Scraper for downloading the entire ebooks repository of project Gutenberg☆155Updated this week
- Traversing links to find the deep source of information☆69Updated 2 years ago
- Cleaning tool for web scraped text☆38Updated 2 years ago
- GPT2Explorer is bringing GPT2 OpenAI langage models playground to run locally on standard windows computers.☆28Updated 3 years ago