a python package for cleaning Gutenberg books and dataset
☆34May 2, 2025Updated 10 months ago
Alternatives and similar repositories for gutenberg_cleaner
Users that are interested in gutenberg_cleaner are comparing it to the libraries listed below
Sorting:
- WordMaster is an intelligent word-info proider which can do the work for you about finding anything about a word.☆24Apr 25, 2019Updated 6 years ago
- POS tagging models for Hindi English Code Mixed Tweets☆11Aug 1, 2018Updated 7 years ago
- This repository includes all the code and data for the paper ELiDi (End2end Entity Linking and Disambiguation)☆14Jul 18, 2021Updated 4 years ago
- Submission for the Programming Task for the Precog Recruitment Process (II)☆14Jul 29, 2016Updated 9 years ago
- [ONGOING] ACM ICPC Handbook for Algorithms and Data Structures☆24Oct 25, 2020Updated 5 years ago
- ☆31Mar 14, 2017Updated 8 years ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Jul 5, 2019Updated 6 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- repository for 2018 Fall Stats 131 class at UCLA☆14Mar 1, 2019Updated 7 years ago
- ULMFiT + Siamese Network for Sentence Vectors☆33Oct 18, 2018Updated 7 years ago
- This is the Javascript Code, it helps you to find you visited your Facebook Profile.☆12Sep 13, 2018Updated 7 years ago
- Enables mobile UI if on desktop, or disables it on mobile.☆10May 7, 2022Updated 3 years ago
- ☆13Feb 8, 2024Updated 2 years ago
- Free programming language books☆10Jun 4, 2020Updated 5 years ago
- Codes for the paper "Towards Sub-Word Level Compositions for Sentiment Analysis of Hi-En Code Mixed Text "☆35Jan 11, 2017Updated 9 years ago
- Music Genre Classification by Lyrics using a Hierarchical Attention Network☆32Dec 20, 2017Updated 8 years ago
- Files for the SACon 2018 "Learning RESTful Microservices from the Ground Up"☆15Mar 11, 2018Updated 7 years ago
- Manipulate and traverse tree-like structures in TypeScript.☆11Mar 5, 2021Updated 5 years ago
- ☆16Aug 6, 2023Updated 2 years ago
- A python implementation of discrete optimal transport with a Tsallis entropy regularization.☆14Oct 23, 2023Updated 2 years ago
- ☆12Mar 8, 2024Updated last year
- stoplists for African languages generated from the ASP corpus☆14Jan 16, 2016Updated 10 years ago
- ☆10Aug 1, 2018Updated 7 years ago
- ☆12Jan 30, 2023Updated 3 years ago
- A semantic role labeling system for the Sumerian language. A Google Summer of Code '18 initiative.☆15Feb 10, 2023Updated 3 years ago
- Clojure-style anonymous function literal for Elisp☆12Jan 21, 2015Updated 11 years ago
- Python and R scripts for visualising and analysing baby sleep patterns.☆12May 17, 2017Updated 8 years ago
- A smart distributed crawler that infers navigation models of structured websites, used to cluster pages based on their structure and extr…☆10Aug 17, 2025Updated 6 months ago
- an experimental implementation of Burrow's delta in Python 3☆12Jun 6, 2017Updated 8 years ago
- Ruby code to access Microsoft's Ngram data☆20Apr 12, 2012Updated 13 years ago
- This project aims to use Inkscape (open source software) to recreate a lot of blazons in SVG format -- Scalable Vector Graphics -- to all…☆14Jun 16, 2015Updated 10 years ago
- A POSIX emoji formatter☆11May 22, 2016Updated 9 years ago
- ☆12Mar 4, 2025Updated last year
- M5Stack Face Bluetooth KeyBoard HID Script KeyBoard☆10Nov 30, 2020Updated 5 years ago
- Hierarchically Regularized Entropy Balancing☆12Sep 20, 2025Updated 5 months ago
- 南京大学2016年《数据新闻》课程☆10Jun 16, 2017Updated 8 years ago
- Plymouth themes taken from Hackers (1995)☆11Oct 7, 2018Updated 7 years ago
- Homestuck mod for Dwarf Fortress☆11Mar 14, 2023Updated 2 years ago
- ☆14Mar 9, 2023Updated 2 years ago