A tool for text normalisation via character-level machine translation
☆13Jun 12, 2020Updated 5 years ago
Alternatives and similar repositories for csmtiser
Users that are interested in csmtiser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A tool for automatic spelling normalization☆21Jan 18, 2021Updated 5 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆15Apr 11, 2020Updated 5 years ago
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Mar 27, 2023Updated 3 years ago
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Dec 18, 2020Updated 5 years ago
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆23Apr 23, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- TweetCaT - a tool for building Twitter corpora of smaller languages or specific geographical regions☆12May 18, 2017Updated 8 years ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago
- Github mirror of MediaWiki extension WikibaseQualityConstraints - our actual code is hosted with Gerrit (please see https://www.mediawiki…☆14Updated this week
- TACOTRON: TOWARDS END-TO-END SPEECH SYNTHESIS☆16Sep 26, 2017Updated 8 years ago
- Tentative way towards a shared API for prosopographical data based on the factoid model (Bradley/Short 2005)☆24Aug 25, 2022Updated 3 years ago
- Schema for modelling parliamentary debates☆22May 23, 2022Updated 3 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Dec 15, 2023Updated 2 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆47Mar 11, 2026Updated 2 weeks ago
- Dependency-based Word Embeddings (Levy and Goldberg, 2014) with BZ2 compression support.☆21Jan 13, 2016Updated 10 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆25May 27, 2021Updated 4 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)☆11Oct 25, 2021Updated 4 years ago
- computer tools for thai language☆23Sep 27, 2017Updated 8 years ago
- Public Comment Analysis Project for the Federal Chief Data Officer Council. The Comment Analysis pilot has shown that a toolset leveragin…☆13Sep 17, 2021Updated 4 years ago
- Erlangen CRM - An OWL implementation of the CIDOC Conceptual Reference Model☆43Sep 20, 2024Updated last year
- ☆13Feb 26, 2023Updated 3 years ago
- A plugin that provides support for working with Digital Facsimiles in Text Encoding Initiative (TEI) vocabulary. The plugin contribute…☆25Jun 16, 2025Updated 9 months ago
- In this small project we will predict the email that in which folder it will go in spam or primary.☆11Jul 5, 2016Updated 9 years ago
- Automated Twitter bots, run by the artificial artificial intelligence of Amazon Mechanical Turk.☆32Dec 23, 2010Updated 15 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- This repository contains simple code in Python to help historians prepare data for quantitative analysis & visualization. Visit the follo…☆27Nov 11, 2025Updated 4 months ago
- The code for NeurIPS 2020 paper: Adversarial Crowdsourcing Through Robust Rank-One Matrix Completion.☆10Oct 26, 2020Updated 5 years ago
- OWL-ontologies for Humanities, developed in the NIE-INE project (National Infrastructure for Editions)☆20Mar 16, 2021Updated 5 years ago
- A repository of legal NLP research papers.☆12Jan 3, 2020Updated 6 years ago
- Python tool for normilizing text and text canonicalization (DISCONTINUED)☆41Sep 3, 2013Updated 12 years ago
- Weighted Training for Cross-Task Learning☆15Feb 12, 2023Updated 3 years ago
- a smart filter script for all qmail lovers☆17Aug 6, 2014Updated 11 years ago
- TurkGate: Grouping and Access Tools for External surveys (for use with Amazon Mechanical Turk)☆27Oct 27, 2015Updated 10 years ago
- Pre-processing DBpedia datasets to load into Dgraph☆13Mar 6, 2022Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Generic Environment for Context-Aware Correction of Orthography☆22Sep 7, 2022Updated 3 years ago
- A curated list of Angular 2 libraries☆24Jan 29, 2017Updated 9 years ago
- SHACL Community Group (Post-REC activitities)☆36Jan 27, 2025Updated last year
- 古漢語常用字典☆13Sep 1, 2016Updated 9 years ago
- Arabic light stemmer. Light stemming for Arabic words removes prefixes and suffixes and normalizes words☆19Dec 16, 2021Updated 4 years ago
- TweetBERT: A Pretrained Language Representation Model for Twitter Text Analysis☆15Jun 1, 2022Updated 3 years ago
- ☆15Sep 5, 2016Updated 9 years ago