A tool for text normalisation via character-level machine translation
☆13Jun 12, 2020Updated 6 years ago
Alternatives and similar repositories for csmtiser
Users that are interested in csmtiser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A tool for automatic spelling normalization☆22Jan 18, 2021Updated 5 years ago
- Digitale Geisteswissenschaften rund um Graphentechnologien☆10Feb 12, 2026Updated 4 months ago
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Mar 27, 2023Updated 3 years ago
- ASR transcription and SLU annotation web interface for call logs collected at UFAL-DSG.☆11Dec 8, 2014Updated 11 years ago
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Dec 18, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆23Apr 23, 2024Updated 2 years ago
- TweetCaT - a tool for building Twitter corpora of smaller languages or specific geographical regions☆12May 18, 2017Updated 9 years ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago
- Github mirror of MediaWiki extension WikibaseQualityConstraints - our actual code is hosted with Gerrit (please see https://www.mediawiki…☆14Updated this week
- TACOTRON: TOWARDS END-TO-END SPEECH SYNTHESIS☆16Sep 26, 2017Updated 8 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Dec 15, 2023Updated 2 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆49Mar 11, 2026Updated 3 months ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆25Oct 27, 2023Updated 2 years ago
- A web-based environment to graphically model TOSCA topologies.☆17May 14, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Dependency-based Word Embeddings (Levy and Goldberg, 2014) with BZ2 compression support.☆21Jan 13, 2016Updated 10 years ago
- A makeshift python program which relies on nltk and Stanford Core NLP models to expand common contractions in the english language.☆10Nov 8, 2017Updated 8 years ago
- ☆25May 27, 2021Updated 5 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)☆11Oct 25, 2021Updated 4 years ago
- Public Comment Analysis Project for the Federal Chief Data Officer Council. The Comment Analysis pilot has shown that a toolset leveragin…☆13Sep 17, 2021Updated 4 years ago
- ☆13Feb 26, 2023Updated 3 years ago
- A plugin that provides support for working with Digital Facsimiles in Text Encoding Initiative (TEI) vocabulary. The plugin contribute…☆25Jun 16, 2025Updated last year
- Automated Twitter bots, run by the artificial artificial intelligence of Amazon Mechanical Turk.☆32Dec 23, 2010Updated 15 years ago
- Using short models to classify long texts☆21Mar 8, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This repository contains simple code in Python to help historians prepare data for quantitative analysis & visualization. Visit the follo…☆27May 11, 2026Updated last month
- OWL-ontologies for Humanities, developed in the NIE-INE project (National Infrastructure for Editions)☆20Mar 16, 2021Updated 5 years ago
- The code for NeurIPS 2020 paper: Adversarial Crowdsourcing Through Robust Rank-One Matrix Completion.☆10Oct 26, 2020Updated 5 years ago
- Code and data for "Summarising Historical Text in Modern Languages" (EACL 2021)☆74Apr 22, 2021Updated 5 years ago
- A repository of legal NLP research papers.☆12Jan 3, 2020Updated 6 years ago
- Python tool for normilizing text and text canonicalization (DISCONTINUED)☆41Sep 3, 2013Updated 12 years ago
- Docker container for MediaWiki☆30Apr 8, 2021Updated 5 years ago
- Vocabseditor is a web-based tool for collaborative work on controlled vocabularies development☆24Sep 4, 2025Updated 9 months ago
- a smart filter script for all qmail lovers☆17Aug 6, 2014Updated 11 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Reading Group @ DMG☆11Nov 15, 2018Updated 7 years ago
- Benchmark datasets for sentiment analysis☆12May 18, 2020Updated 6 years ago
- Generic Environment for Context-Aware Correction of Orthography☆23Sep 7, 2022Updated 3 years ago
- Computer Vision, 1st Project : Shape from Shading☆12Feb 24, 2014Updated 12 years ago
- SHACL Community Group (Post-REC activitities)☆37Jan 27, 2025Updated last year
- Arabic light stemmer. Light stemming for Arabic words removes prefixes and suffixes and normalizes words☆19Dec 16, 2021Updated 4 years ago
- ☆15Sep 5, 2016Updated 9 years ago