A simple neural truecaser written in pytorch and allennlp.
β33Jun 17, 2024Updated last year
Alternatives and similar repositories for pytorch-truecaser
Users that are interested in pytorch-truecaser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Language independent truecaser in Python.β160Oct 17, 2021Updated 4 years ago
- πΈ GlotWeb: Web Indexing for Minority Languages (WWW 2026)β17Feb 27, 2026Updated last month
- A natural language processing tool for automatically detecting quotations in text.β15Feb 26, 2022Updated 4 years ago
- Automatic Detection of Potentially Idiomatic Expressionsβ12Feb 19, 2021Updated 5 years ago
- A database of number names for 186 languages, locales, and scriptsβ67Mar 3, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- β10Jun 8, 2024Updated last year
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"β28Oct 3, 2021Updated 4 years ago
- C++ implementation of Alessandro Moschitti's Tree Kernel algorithm, from "Making Tree Kernels Practical for Natural Language Learning"β12Oct 10, 2019Updated 6 years ago
- Convert words to numbersβ21Apr 13, 2022Updated 4 years ago
- A simple and humble image captioning application, based on a neural network built with Kerasβ10Sep 23, 2022Updated 3 years ago
- Implementation of Nested Named Entity Recognition using Flairβ24Oct 29, 2021Updated 4 years ago
- [Konvens21] This repository contains the DFKI MobIE Corpus, a dataset of 3,232 German-language documents that have been annotated with fiβ¦β12Sep 17, 2024Updated last year
- Combining encoder-based language modelsβ11Nov 11, 2021Updated 4 years ago
- β13Apr 16, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"β14Aug 19, 2022Updated 3 years ago
- English-French MT dialogue datasetβ17Apr 29, 2022Updated 3 years ago
- Java Bindings for the C++ library DeepSpeechβ10Jun 4, 2020Updated 5 years ago
- steps to perform text-based speaker diarization with kaldi toolkitβ12Nov 2, 2018Updated 7 years ago
- This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paperβ14Aug 9, 2021Updated 4 years ago
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.β11Jan 11, 2020Updated 6 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.β11Nov 3, 2020Updated 5 years ago
- β14Dec 3, 2019Updated 6 years ago
- Evaluate language models using multiple choice itemsβ13Mar 6, 2026Updated last month
- Deploy open-source AI quickly and easily - Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ΓrΓ nlα»Μwα»Μ is a utility library for analysis & (pre)processing of YorΓΉbΓ‘ text β https://pypi.org/project/iranlowoβ19Dec 10, 2022Updated 3 years ago
- bin filesβ13Jan 30, 2025Updated last year
- saved models for spleeter (tf and tfjs)β16Jan 30, 2020Updated 6 years ago
- MFAQ: a Multilingual FAQ Datasetβ18Sep 17, 2023Updated 2 years ago
- Recommender System using Apache Sparkβ16Oct 3, 2017Updated 8 years ago
- MULTIOPED: A Corpus of Multi-Perspective News Editorials.β12Aug 25, 2021Updated 4 years ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.β14Jun 27, 2023Updated 2 years ago
- A web interface to understand language-specific BERT-modelsβ18Apr 16, 2024Updated last year
- A Mechanical Turk Interface (amti) π€β56Jan 11, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-coreβ15Jun 19, 2023Updated 2 years ago
- Scripts for training Kaldi for German speech recognition (ASR).β27Feb 11, 2021Updated 5 years ago
- pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inferenceβ61Dec 8, 2022Updated 3 years ago
- The source code for 'Noisy-Labeled NER with Confidence Estimation' accepted by NAACL 2021β36May 8, 2021Updated 4 years ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)β14Oct 8, 2020Updated 5 years ago
- Phonetically-Oriented Word Error Rateβ36May 4, 2019Updated 6 years ago
- Code, data, and additional analysis for the paper Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine Translation Evaluaβ¦β15Aug 13, 2020Updated 5 years ago