Tool for extracting plain text from wikipedia data
☆31Mar 14, 2016Updated 9 years ago
Alternatives and similar repositories for wikipedia
Users that are interested in wikipedia are comparing it to the libraries listed below
Sorting:
- Crawling and analyzing data on Wikipedia☆17Mar 8, 2024Updated 2 years ago
- Grav GitHub Plugin☆13Dec 15, 2020Updated 5 years ago
- Scrape financial data of cities, EPCI, departments and regions☆17Aug 18, 2017Updated 8 years ago
- ☆29Mar 16, 2015Updated 10 years ago
- A framework, data and configs for generating and building Tesseract OCR lang.traineddata model files, specifically for Japanese☆10Dec 9, 2013Updated 12 years ago
- A content-filtering bypass system developed specifically to allow access to trans-related resources on public networks (libraries, school…☆27Nov 15, 2014Updated 11 years ago
- Extract JSON front matter from strings and files☆20Jan 24, 2015Updated 11 years ago
- "Save as DAISY" add-in for Microsoft Word☆10Dec 22, 2025Updated 2 months ago
- Redis tcp map for postfix☆12Jun 28, 2024Updated last year
- Grecka is a python script to convert Greek to Greeklish based on ELOT 743☆12Aug 4, 2018Updated 7 years ago
- (Labeled) Latent Dirichlet Allocation on a sentence level with Gibbs Sampling☆10Mar 27, 2014Updated 11 years ago
- Madek main web interface☆21Updated this week
- Ruby library - Fill out PDF form with FDF/XFDF via pdftk☆16Sep 28, 2021Updated 4 years ago
- ☆12Mar 12, 2025Updated 11 months ago
- Browser based post correction tool for Alto XML files☆14Sep 20, 2013Updated 12 years ago
- Python client for the legifrance.gouv.fr website☆11Apr 29, 2021Updated 4 years ago
- Simple HTTP redirector for tmpnb nodes☆12Sep 20, 2017Updated 8 years ago
- CockroachDB adaptor for Ecto 3.x☆10Nov 25, 2021Updated 4 years ago
- A tiny chat app written in Meteor for my 12Devs article.☆15Jun 20, 2013Updated 12 years ago
- Semantic dependency relationship extractor untuk bahasa Indonesia... termasuk bahasa gaul dan alay ;) (terinspirasi oleh OpenCog RelEx)☆10Oct 2, 2015Updated 10 years ago
- Declarative unit testing for Answer Set Programming projects☆12Mar 4, 2018Updated 8 years ago
- wavelet-based positive peak detection for 1-d data☆14May 26, 2011Updated 14 years ago
- A scraper that uses the twitter API to download pertinent data to text files☆10Jun 7, 2017Updated 8 years ago
- Term List Matching Plugin for ElasticSearch☆26Jan 20, 2014Updated 12 years ago
- This gem provides a way to connect to aws redshift using the ruby-pg☆12Jun 28, 2023Updated 2 years ago
- A simple web framework based on asyncio.☆25Sep 25, 2016Updated 9 years ago
- The secure, transparent, auditable, reliable electronic voting system☆14Oct 6, 2016Updated 9 years ago
- minimalist gmail cli client☆64Oct 16, 2014Updated 11 years ago
- TAUS Dynamic Quality Framework API☆12Sep 17, 2020Updated 5 years ago
- Create PDF with responsive layout☆11Apr 21, 2015Updated 10 years ago
- Experimental Redis plugin for Vim☆13Jun 8, 2013Updated 12 years ago
- Matlab based document image analysis and classification system, that makes heavy use of contextual and language cues to decode image glyp…☆12Nov 7, 2011Updated 14 years ago
- Build worker component of Kochiku☆68Dec 17, 2018Updated 7 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆12Aug 10, 2023Updated 2 years ago
- Solarized style for Qt Creator's syntax highlighter☆31Aug 22, 2016Updated 9 years ago
- Brand disambiguator for tweets to differentiate e.g. Orange vs orange (brand vs foodstuff), using NLTK and scikit-learn☆58Jul 11, 2013Updated 12 years ago
- A duplicate data detector engine PoC based on Elasticsearch.☆20Apr 3, 2015Updated 10 years ago
- A clone of Windows Security Center mainly useful for demonstrating Windows APIs for accessing Firewall/AntiVirus/AntiSpyware/Windows Upda…☆12May 10, 2010Updated 15 years ago
- [DEPRECATED] Use https://github.com/runtimejs/runtime-cli☆11Jul 18, 2015Updated 10 years ago