Giuseppe-Della-Corte / IESTACLinks
A corpus that can be used to train English-to-Italian End-to-End Speech-to-Text Machine Translation models
☆11Updated 4 years ago
Alternatives and similar repositories for IESTAC
Users that are interested in IESTAC are comparing it to the libraries listed below
Sorting:
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆317Updated last month
- Various utilities for processing the data.☆211Updated this week
- A minimal, pure Python library to interface with CoNLL-U format files.☆151Updated this week
- Universal Dependencies online documentation☆288Updated this week
- English data☆213Updated this week
- A multilingual parallel corpus created from translations of the Bible.☆185Updated 3 months ago
- ☆64Updated 2 weeks ago
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆223Updated 2 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆82Updated last year
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆25Updated last year
- Appraise code used as part of WMT21 human evaluation campaign☆28Updated 3 weeks ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆197Updated 4 years ago
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆58Updated 3 weeks ago
- Bitextor generates translation memories from multilingual websites☆295Updated 9 months ago
- ☆24Updated 4 years ago
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆316Updated 3 years ago
- ☆45Updated 3 years ago
- This is a simple Python package for calculating a variety of lexical diversity indices☆79Updated last year
- ☆23Updated 5 years ago
- Doing things with embeddings☆66Updated 2 years ago
- A language model-based approach to Grammatical Error Correction for English that uses minimal annotated data.☆48Updated 6 years ago
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆362Updated 2 years ago
- List of research and engineering of NLP for American Native/Indigenous Languages.☆92Updated 4 years ago
- Automatically exported from code.google.com/p/universal-pos-tags☆130Updated 3 years ago
- Sentence aligner☆116Updated 4 years ago
- Python framework for processing Universal Dependencies data☆57Updated 3 weeks ago
- ☆74Updated 2 weeks ago
- analyze text with empath☆336Updated 8 years ago
- Transform TMX to text☆27Updated 2 years ago