zaghloul404 / englishidioms
An efficient Python package for detecting and identifying English idiomatic expressions and phrases within sentences.
☆19Updated last year
Alternatives and similar repositories for englishidioms:
Users that are interested in englishidioms are comparing it to the libraries listed below
- A modern, interlingual wordnet interface for Python☆244Updated this week
- Open Language Profiles — English profile datasets from CEFR-J☆122Updated 5 years ago
- Repository for CEFR-SP corpus and sentence level assessment☆40Updated 7 months ago
- A Python Wiktionary Parser☆358Updated 2 months ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆99Updated last week
- ☆14Updated 11 months ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆64Updated this week
- Gather modern English word frequencies from all enwiki articles.☆212Updated last year
- Break long English Sentence into simple sentences☆14Updated last year
- University of Colorado VerbNet☆104Updated 11 months ago
- The Open English WordNet☆540Updated this week
- An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship ty…☆93Updated 11 months ago
- Wiktionary dump file parser and multilingual data extractor☆891Updated this week
- A multilingual parallel corpus created from translations of the Bible.☆179Updated this week
- Annotation Tool for Text Simplification Corpora☆17Updated last year
- The Open Source Dictionary☆547Updated last month
- A list of vocabulary lists☆21Updated 4 years ago
- Measure the readability of a given text using surface characteristics☆78Updated 2 months ago
- NLP system for predicting the reading difficulty level of a text in terms of its CEFR level.☆52Updated 4 months ago
- Multilingual sentence alignment using sentence embeddings☆116Updated 5 months ago
- Analyzes the given text and determine what's the vocabulary level based on CEFR levels☆45Updated 2 years ago
- Syllabification and stress detection for Spanish☆10Updated 6 months ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆361Updated last year
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆106Updated 6 years ago
- Neural CRF Model for Sentence Alignment in Text Simplification☆67Updated 3 months ago
- Sentence aligner☆112Updated 3 years ago
- Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a frame…☆21Updated last year
- Lexical database for ~70k English words with morphological variables☆42Updated 3 years ago
- Offline database of synonyms/thesaurus☆195Updated last year
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆244Updated 2 years ago