daveshap / PlainTextWikipediaLinks
Convert Wikipedia database dumps into plaintext files
☆327Updated 4 years ago
Alternatives and similar repositories for PlainTextWikipedia
Users that are interested in PlainTextWikipedia are comparing it to the libraries listed below
Sorting:
- Nearly a thousand bash and python scripts I've written over the years.☆124Updated last year
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆245Updated 2 years ago
- Conversational text Analysis using various NLP techniques☆182Updated 2 years ago
- Play detective on Reddit: Discover political disinformation campaigns, secret influencers and more☆224Updated 2 years ago
- 📊 Semantic search for headlines and story text☆359Updated 2 years ago
- The world's largest social media toxicity dataset.☆189Updated 3 years ago
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆127Updated last year
- An on-going dataset consisting of hashtags, n-gram counts and other misc NLP things for covid-19 analysis, stemming from over 100 000 000…☆59Updated 3 years ago
- Python code for building a GPT-3 based technical blog post optimizer.☆85Updated 3 years ago
- The subreddit archiver☆178Updated 2 years ago
- A tool to automatically turn any Wikipedia article into a video☆57Updated 3 years ago
- Tools to construct and process Common Crawl webgraphs☆105Updated last week
- GPT Takes the Bar Exam☆142Updated 3 years ago
- Offline Internet Archive project☆311Updated last year
- Use GPT-3 to process human conversations and extract context, identify information that would be useful, and suggest data sources to get …☆29Updated 4 years ago
- ☆44Updated 4 years ago
- ☆64Updated 2 years ago
- 🧠 AI memory assistant – remember everything you read☆303Updated 3 years ago
- Streaming WARC/ARC library for fast web archive IO☆446Updated last year
- Multi-angle c(q)uestion answering☆457Updated 3 years ago
- The AI Knowledge Editor☆184Updated 3 years ago
- The Python script for downloading new mp3 from RSS given channels☆141Updated 10 months ago
- A Flask webapp & Python scripts for predicting reddit users' political leaning, using their comment history.☆63Updated 2 years ago
- Labelling platform for text using weak supervision.☆260Updated 3 years ago
- 🔎 Semantic search for developers☆541Updated 2 years ago
- This AI Does Not Exist: generate realistic descriptions of made-up machine learning models.☆147Updated 3 years ago
- Set of scripts and notebooks used to produce results visible in RecipeNLG paper☆622Updated 3 years ago
- Neural Search☆334Updated last year
- Python script to generate SVG crosswords from a text file☆126Updated 6 years ago
- Espial is an engine for automated organization and discovery of personal knowledge☆177Updated 2 weeks ago