AndreiRegiani/wikipedia-crawler

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AndreiRegiani/wikipedia-crawler)

AndreiRegiani / wikipedia-crawler

Extracts plain-text from Wikipedia articles, ideal to perform linguistic analysis on a specific topic

☆43

Alternatives and similar repositories for wikipedia-crawler

Users that are interested in wikipedia-crawler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AndreiRegiani / falcon-mongo-template
View on GitHub
Project template for REST API with Falcon, MongoDB and PyPy
☆19Mar 25, 2025Updated last year
AndreiRegiani / falcon-jsonify
View on GitHub
Falcon middleware to serialize/deserialize JSON with built-in request validator
☆38Mar 26, 2026Updated 3 months ago
luizdepra / r8
View on GitHub
A simple CHIP8 interpreter made with Rust.
☆11Apr 23, 2026Updated 2 months ago
CODAIT / Identifying-Incorrect-Labels-In-CoNLL-2003
View on GitHub
Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.
☆12May 11, 2021Updated 5 years ago
fmsouza / ionic-mangiare
View on GitHub
Tinder for food match
☆11Jan 13, 2017Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
JoaoLages / RATransformers
View on GitHub
RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!
☆42Dec 14, 2022Updated 3 years ago
CLAW-Lab / ToM
View on GitHub
Code accompanying ICML 2021 paper "Few-shot Language Coordination by Modeling Theory of Mind"
☆18May 18, 2022Updated 4 years ago
nrimsky / InfluenceFunctions
View on GitHub
Implementation of Influence Function approximations for differently sized ML models, using PyTorch
☆18Sep 15, 2023Updated 2 years ago
ThaisRobba / browserify-phaser
View on GitHub
Testing and building Phaser projects with Browserify and Beefy
☆14May 3, 2015Updated 11 years ago
ricbit / rgzip
View on GitHub
Gzip decoder in Rust
☆10May 14, 2017Updated 9 years ago
durango / express-4-boilerplate
View on GitHub
Boilerplate for Express 4 (MVC), Google+, and SequelizeJS
☆15May 4, 2014Updated 12 years ago
TinTeam / SN-50
View on GitHub
Still Under Development SN-50 is a free and open source fantasy computer for building, playing and sharing resources-limited games.
☆26Apr 23, 2026Updated 2 months ago
johnscillieri / netwatch
View on GitHub
Display and label a live table of hosts in your network
☆14Nov 25, 2016Updated 9 years ago
aside-ufba / guide-automator
View on GitHub
Automated User Guide Generation with Markdown
☆13Sep 20, 2018Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
virex-84 / VoskIdentification
View on GitHub
Тестовый пример задействования модели для идентификации голоса с помощью библиотеки распознавания речи "Vosk" (Воск): https://alphacephei…
☆12Aug 14, 2023Updated 2 years ago
stratoserp / stratoserp
View on GitHub
Modules for the Stratos ERP project
☆12May 15, 2023Updated 3 years ago
marhs / pokerai
View on GitHub
Simple bot for Texas Hold'em. Uses a montecarlo approach and it's extensible.
☆12Mar 27, 2015Updated 11 years ago
codelibs / elasticsearch-langfield
View on GitHub
This plugin provides a useful feature for multi-language
☆14Jul 15, 2022Updated 4 years ago
rethinkdb / example-rabbitmq
View on GitHub
☆18Feb 18, 2016Updated 10 years ago
blinktrade / iofiber
View on GitHub
☆15Jul 8, 2020Updated 6 years ago
zhijing-jin / bleu
View on GitHub
A Handy Python wrapper for common NLP evaluation scripts like BLEU.
☆14Feb 10, 2020Updated 6 years ago
holepunchto / bare-fs
View on GitHub
Native file system operations for Bare
☆27Jul 7, 2026Updated 2 weeks ago
alviano / python
View on GitHub
My collection of Python tools!
☆11Jan 27, 2026Updated 5 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
plesk / plesk-ext-sdk
View on GitHub
Toolkit for development extensions for Plesk
☆14Jan 10, 2026Updated 6 months ago
h3rald / minline
View on GitHub
A line editing library in pure Nim.
☆19Jun 16, 2026Updated last month
roboflow / star-track
View on GitHub
⭐ Star-Track is a user-friendly utility for tracking GitHub repository statistics
☆18Updated this week
holepunchto / blind-peer
View on GitHub
Peer that is blind
☆15Updated this week
patik / kind
View on GitHub
Precise type-checker for JavaScript
☆11Oct 23, 2025Updated 8 months ago
eugen1j / aioscrapy
View on GitHub
Python asynchronous library for web scrapping
☆12Aug 24, 2021Updated 4 years ago
ShenggaoZhu / midict
View on GitHub
MIDict (Multi-Index Dict) can be indexed by any "keys" or "values", suitable as a bidirectional/inverse dict or a multi-key/multi-value d…
☆14May 19, 2016Updated 10 years ago
coolaj86 / node-walk
View on GitHub
A semi-port of python's os.walk
☆12Mar 26, 2018Updated 8 years ago
graphistry / opencl-test
View on GitHub
Infrastructure setup.
☆10Jul 27, 2019Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
donkeycode / frontendlogger
View on GitHub
☆12May 31, 2016Updated 10 years ago
mlewand / rtf-parse
View on GitHub
A simplified RTF parser.
☆11Oct 31, 2019Updated 6 years ago
duonginspace / AudioNovelty
View on GitHub
Audio Novelty Detection
☆14Nov 20, 2018Updated 7 years ago
AssemblyAI / kaldi-asr-tutorial
View on GitHub
Repo for hosting tutorial code associated with the Kaldi Speech Recognition for Beginners - A Simple Tutorial blog by AssemblyAI
☆13May 20, 2023Updated 3 years ago
mahnunchik / lucene-query-parser
View on GitHub
Lucene Query Parser for Javascript created using PEG.js.
☆24May 14, 2017Updated 9 years ago
rhasspy / phonetisaurus-pypi
View on GitHub
Python wrapper for phonetisaurus grapheme to phoneme tool
☆12Mar 11, 2021Updated 5 years ago
webignition / robots-txt-file
View on GitHub
Models a robots.txt file
☆18Jan 28, 2026Updated 5 months ago