Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.
☆62Jul 1, 2021Updated 4 years ago
Alternatives and similar repositories for exquisite-corpus
Users that are interested in exquisite-corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Access a database of word frequencies, in various natural languages.☆1,663Jan 4, 2025Updated last year
- Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.☆13Aug 26, 2020Updated 5 years ago
- JavaScript port of SymSpell for Node.js☆13Sep 30, 2022Updated 3 years ago
- Tools for indexing gzip files to support random-like access.☆28Mar 15, 2021Updated 5 years ago
- Ini file library for Python☆31Jun 7, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- lachesis automates the segmentation of a transcript into closed captions☆35Jan 26, 2017Updated 9 years ago
- ☆27Oct 14, 2022Updated 3 years ago
- Serapis is a sentence identifier and modeling pipeline / built for Wordnik☆24Jun 9, 2016Updated 9 years ago
- The Open Multilingual Wordnet☆73May 6, 2024Updated 2 years ago
- enable rapid iteration and development of complex data pipelines☆30Mar 9, 2025Updated last year
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆76Apr 1, 2025Updated last year
- A library for fetching and reading Tatoeba's weekly exports☆24Feb 5, 2026Updated 3 months ago
- Various test fonts (OpenType, OpenType with TrueType GX variation extensions, Multiple Master) for testing implementations of font format…☆11Jun 25, 2025Updated 11 months ago
- Object Resource Stream and CDXJ Drafts☆15Nov 28, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Irony - .NET Language Implementation Kit. Forked/Mirrored from CodePlex to Add Gtk# Explorer☆11Jan 21, 2016Updated 10 years ago
- ☆45May 14, 2026Updated 2 weeks ago
- openFrameworks addon for drawing fonts using signed distance functions (SDF)☆13Jul 16, 2018Updated 7 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Nov 9, 2021Updated 4 years ago
- An implementation of Defeasible Logic in Python☆15Sep 2, 2018Updated 7 years ago
- Pretty-print markdown☆32Feb 5, 2013Updated 13 years ago
- ThoughtTreasure commonsense knowledge base and architecture for natural language processing☆81Jul 31, 2015Updated 10 years ago
- Simple type converters: make ints, floats, bools and dates from your strings!☆11Jul 23, 2016Updated 9 years ago
- Stanford CoreNLP Extensions: Fork to provide the ability to capture Multi-Word Expressions☆10Jun 14, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- BitTorrent Tracker☆31Oct 22, 2009Updated 16 years ago
- Tools for working with types where a subset of values has a total order, like e.g. floats without NaN☆13Nov 7, 2025Updated 6 months ago
- Launch a Google search for exceptions from Python apps☆28Jan 19, 2015Updated 11 years ago
- rtpmidi package from the Scenic project: https://github.com/sat-metalab/scenic☆10Oct 27, 2015Updated 10 years ago
- Natural Language Processing tools☆12Jan 26, 2017Updated 9 years ago
- An application to manage webhooks☆20Jul 1, 2015Updated 10 years ago
- DEPRECATED Export Members of a Facebook Group to a CSV☆13Jun 30, 2020Updated 5 years ago
- Javascript tokenizer for english sentences☆14Oct 15, 2015Updated 10 years ago
- RDF Community Discussions. Ask anything here!☆13Apr 11, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An example Slack Outgoing Webhook for Buildkite☆16Jun 2, 2015Updated 10 years ago
- Data Driven Journalism Handbook☆23Sep 23, 2012Updated 13 years ago
- Mechanics functions with end-to-end support for deep learning developers, written in Ivy.☆14Aug 28, 2023Updated 2 years ago
- A simple Glyphs App plugin to find words that contain the selected glyphs.☆11Sep 18, 2024Updated last year
- A bunch of Fontlab Macros☆20Apr 20, 2017Updated 9 years ago
- An Interactive Tool for Annotating Discourse Structure and Text Improvement☆16Sep 15, 2021Updated 4 years ago
- iOS & watchOS speech-to-text app with AI voice keyboard, on-device RAG, and chat with your notes - powered by Apple Foundation Models, Wh…☆68Updated this week