A simple neural truecaser written in pytorch and allennlp.
β33Jun 17, 2024Updated last year
Alternatives and similar repositories for pytorch-truecaser
Users that are interested in pytorch-truecaser are comparing it to the libraries listed below
Sorting:
- Language independent truecaser in Python.β160Oct 17, 2021Updated 4 years ago
- πΈ GlotWeb: Web Indexing for Minority Languages (WWW 2026)β17Updated this week
- A natural language processing tool for automatically detecting quotations in text.β15Feb 26, 2022Updated 4 years ago
- β10Feb 2, 2021Updated 5 years ago
- A database of number names for 186 languages, locales, and scriptsβ67Mar 3, 2023Updated 3 years ago
- A simple and humble image captioning application, based on a neural network built with Kerasβ10Sep 23, 2022Updated 3 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.β11Nov 3, 2020Updated 5 years ago
- Combining encoder-based language modelsβ11Nov 11, 2021Updated 4 years ago
- Convert words to numbersβ21Apr 13, 2022Updated 3 years ago
- Java Bindings for the C++ library DeepSpeechβ10Jun 4, 2020Updated 5 years ago
- β13Jan 14, 2025Updated last year
- steps to perform text-based speaker diarization with kaldi toolkitβ12Nov 2, 2018Updated 7 years ago
- β10Jun 8, 2024Updated last year
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.β11Jan 11, 2020Updated 6 years ago
- [Konvens21] This repository contains the DFKI MobIE Corpus, a dataset of 3,232 German-language documents that have been annotated with fiβ¦β12Sep 17, 2024Updated last year
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"β28Oct 3, 2021Updated 4 years ago
- A web application tagging and retrieval of arguments in textβ30May 1, 2023Updated 2 years ago
- Implementation of Nested Named Entity Recognition using Flairβ24Oct 29, 2021Updated 4 years ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)β13Oct 8, 2020Updated 5 years ago
- Source code for the Apple reproductionβ33Apr 23, 2021Updated 4 years ago
- Pure C# port of the Pocketsphinx keyword spotterβ13Jan 19, 2020Updated 6 years ago
- Evaluate language models using multiple choice itemsβ13Updated this week
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"β14Aug 19, 2022Updated 3 years ago
- The source code for 'Noisy-Labeled NER with Confidence Estimation' accepted by NAACL 2021β36May 8, 2021Updated 4 years ago
- Util code, issues, discussionsβ29Aug 31, 2018Updated 7 years ago
- This repository contains all manually labeled data from the GermEval-2018 shared task.β29Sep 28, 2018Updated 7 years ago
- a ducttape workflow for neural machine translationβ14Mar 23, 2021Updated 4 years ago
- C++ implementation of Alessandro Moschitti's Tree Kernel algorithm, from "Making Tree Kernels Practical for Natural Language Learning"β12Oct 10, 2019Updated 6 years ago
- β13Apr 16, 2021Updated 4 years ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.β14Jun 27, 2023Updated 2 years ago
- This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paperβ14Aug 9, 2021Updated 4 years ago
- Recommender System using Apache Sparkβ16Oct 3, 2017Updated 8 years ago
- β14Dec 3, 2019Updated 6 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-coreβ15Jun 19, 2023Updated 2 years ago
- Free Dutch voice datasetβ12Jan 28, 2021Updated 5 years ago
- Detect individual instruments activity in an audio file. π€πΉπΈπ₯β16Jun 29, 2021Updated 4 years ago
- Feature extraction for accented-speech or pathological speechβ17Apr 2, 2019Updated 6 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.β24Aug 1, 2025Updated 7 months ago
- saved models for spleeter (tf and tfjs)β16Jan 30, 2020Updated 6 years ago