A tool for text normalisation via character-level machine translation
☆13Jun 12, 2020Updated 5 years ago
Alternatives and similar repositories for csmtiser
Users that are interested in csmtiser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Digitale Geisteswissenschaften rund um Graphentechnologien☆10Feb 12, 2026Updated 2 months ago
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Mar 27, 2023Updated 3 years ago
- ASR transcription and SLU annotation web interface for call logs collected at UFAL-DSG.☆11Dec 8, 2014Updated 11 years ago
- Upcoming ACL 2020 paper☆26May 8, 2020Updated 6 years ago
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Dec 18, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- TweetCaT - a tool for building Twitter corpora of smaller languages or specific geographical regions☆12May 18, 2017Updated 8 years ago
- Github mirror of MediaWiki extension WikibaseQualityConstraints - our actual code is hosted with Gerrit (please see https://www.mediawiki…☆14Updated this week
- Tentative way towards a shared API for prosopographical data based on the factoid model (Bradley/Short 2005)☆24Aug 25, 2022Updated 3 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Dec 15, 2023Updated 2 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆49Mar 11, 2026Updated last month
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆24Oct 27, 2023Updated 2 years ago
- minimal examples of brat annotation visualizations☆17Jan 21, 2015Updated 11 years ago
- Dependency-based Word Embeddings (Levy and Goldberg, 2014) with BZ2 compression support.☆21Jan 13, 2016Updated 10 years ago
- A makeshift python program which relies on nltk and Stanford Core NLP models to expand common contractions in the english language.☆10Nov 8, 2017Updated 8 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)☆11Oct 25, 2021Updated 4 years ago
- Public Comment Analysis Project for the Federal Chief Data Officer Council. The Comment Analysis pilot has shown that a toolset leveragin…☆13Sep 17, 2021Updated 4 years ago
- Erlangen CRM - An OWL implementation of the CIDOC Conceptual Reference Model☆44Sep 20, 2024Updated last year
- ☆13Feb 26, 2023Updated 3 years ago
- A plugin that provides support for working with Digital Facsimiles in Text Encoding Initiative (TEI) vocabulary. The plugin contribute…☆25Jun 16, 2025Updated 10 months ago
- ☆13Jul 26, 2023Updated 2 years ago
- Using short models to classify long texts☆21Mar 8, 2023Updated 3 years ago
- This repository contains simple code in Python to help historians prepare data for quantitative analysis & visualization. Visit the follo…☆27Apr 28, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code and data for "Summarising Historical Text in Modern Languages" (EACL 2021)☆74Apr 22, 2021Updated 5 years ago
- A repository of legal NLP research papers.☆12Jan 3, 2020Updated 6 years ago
- Python tool for normilizing text and text canonicalization (DISCONTINUED)☆41Sep 3, 2013Updated 12 years ago
- Weighted Training for Cross-Task Learning☆15Feb 12, 2023Updated 3 years ago
- Docker container for MediaWiki☆31Apr 8, 2021Updated 5 years ago
- Vocabseditor is a web-based tool for collaborative work on controlled vocabularies development☆25Sep 4, 2025Updated 8 months ago
- TurkGate: Grouping and Access Tools for External surveys (for use with Amazon Mechanical Turk)☆27Oct 27, 2015Updated 10 years ago
- Pre-processing DBpedia datasets to load into Dgraph☆13Mar 6, 2022Updated 4 years ago
- Benchmark datasets for sentiment analysis☆12May 18, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Generic Environment for Context-Aware Correction of Orthography☆22Sep 7, 2022Updated 3 years ago
- Computer Vision, 1st Project : Shape from Shading☆12Feb 24, 2014Updated 12 years ago
- A curated list of Angular 2 libraries☆24Jan 29, 2017Updated 9 years ago
- SHACL Community Group (Post-REC activitities)☆37Jan 27, 2025Updated last year
- 古漢語常用字典☆14Sep 1, 2016Updated 9 years ago
- Arabic light stemmer. Light stemming for Arabic words removes prefixes and suffixes and normalizes words☆19Dec 16, 2021Updated 4 years ago
- TweetBERT: A Pretrained Language Representation Model for Twitter Text Analysis☆15Jun 1, 2022Updated 3 years ago