jeongyun0609 / TranSplat
☆11Updated last month
Related projects ⓘ
Alternatives and complementary repositories for TranSplat
- Russian Drama Corpus (in TEI-P5)☆9Updated 5 years ago
- Extract data from German Wiktionary XML files.☆24Updated 3 months ago
- A Python library and application enabling translation via the DeepL translator available at deepl.com.☆61Updated 5 years ago
- State-of-the-art count-based word embeddings for low-resource languages with a special focus on historical languages.☆11Updated last month
- Downloadable database of german verbs and conjugations as found on wiktionary.org☆25Updated 2 years ago
- A collection of word lists in machine readable, web-native (.yml and .json) format☆20Updated last year
- Work in progress transmit from Google Code☆1,109Updated 6 years ago
- Offline database of synonyms/thesaurus☆189Updated 9 months ago
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆146Updated 7 months ago
- Simhash and near-duplicate detection☆410Updated last year
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:☆14,159Updated 3 months ago
- LoanPy is a linguistic toolkit for rule-based prediction and evaluation of loanword adaptation and historical reconstructions and can be …☆15Updated 8 months ago
- How to share your git hooks and config with your team members and put them under version control☆24Updated 5 years ago
- Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents☆282Updated last year
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆160Updated 4 years ago
- Summarizes news articles☆1,169Updated 3 years ago
- Open German WordNet☆88Updated 8 months ago
- A small program to detect gibberish using a Markov Chain☆597Updated 8 months ago
- This python script uses the Pocket API (http://getpocket.com/developer/) to back up all of your data, separated by tags and fixing link n…☆20Updated 11 years ago
- a CSV of every english word, part of speech, and definition. as well as a web scraping script that generates that data for you☆98Updated last year
- Universal Dependencies online documentation☆272Updated this week
- Weighted Levenshtein library☆105Updated last year
- DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text. Improving Efficiency and Accuracy in Mult…☆178Updated last year
- Jupyter notebooks for course "Computational Morphology with HFST".☆15Updated 2 years ago
- Compact Language Detector 2☆843Updated 3 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆365Updated last year
- Bitextor generates translation memories from multilingual websites☆290Updated this week
- A dataset of popular forenames and surnames by country☆19Updated last year
- LingPy: Python library for quantitative tasks in historical linguistics☆124Updated 10 months ago
- Compute Sentence Embeddings Fast!☆618Updated last year
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆17Updated 5 months ago