cyb3rk0tik / pyfrancLinks
Text language detection basic on trigrams.
☆14Updated last year
Alternatives and similar repositories for pyfranc
Users that are interested in pyfranc are comparing it to the libraries listed below
Sorting:
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques☆29Updated 4 years ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆23Updated 3 years ago
- Match celebrity users with their respective tweets by making use of Semantic Textual Similarity on over 900+ celebrity users' 2.5 million…☆13Updated last year
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)☆30Updated 5 months ago
- Generate multiple choice fill-in-the-blank questions from any article.☆13Updated 2 years ago
- Code for "CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering for Ensemble Models for Propaganda Detection" (V. Blasch…☆9Updated 4 years ago
- Experiments with Hugging Face 🔬 🤗☆44Updated 10 months ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- Python package for deduplication/entity resolution using active learning☆80Updated 10 months ago
- ☆14Updated 2 years ago
- Generate a SQLite database from Wikipedia & Wikidata dumps.☆35Updated last year
- Tool for the Automatic Assessment of Lexical Diversity☆11Updated 4 years ago
- Fast Neural Machine Translation in C++ - development repository☆19Updated last year
- code and data used to build a training dataset for dragnet models☆10Updated 4 years ago
- A simple semantic search engine for scientific papers.☆28Updated last year
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for ML☆63Updated 5 months ago
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆24Updated 4 years ago
- Interpretable feature construction from taxonomies for text classification☆18Updated 3 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Updated 2 years ago
- Hugging Face and Pyserini interoperability☆20Updated 2 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- Semantic Parser Localizer (SPL) code repository☆9Updated 4 years ago
- Text classification automl☆21Updated 3 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated 2 months ago
- Exploring NLP weak supervision approaches to train text classification models. The project is also a prototype for a semi-automated text …☆22Updated last year
- Extract knowledge from raw text☆13Updated 3 years ago
- Detecting gibberish as a type of sentiment analysis with GPT2☆24Updated 4 years ago
- LEMON: Explainable Entity Matching☆18Updated 3 years ago
- Library for computing Deterministic Acyclic Finite State Automata (DAFSA)☆27Updated 2 years ago