qburst / common-crawl-malayalam
Useful tools to extract malayalam text from the Common Crawl Datasets
☆27Updated 2 months ago
Alternatives and similar repositories for common-crawl-malayalam:
Users that are interested in common-crawl-malayalam are comparing it to the libraries listed below
- semantically distinct key phrase extraction using hilbert hashes.☆48Updated 3 years ago
- Topic Inference with Zeroshot models☆61Updated last year
- ☆28Updated 4 years ago
- ☆11Updated 5 years ago
- Natural Language Generation for Gramex applications.☆24Updated 2 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆70Updated 2 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- Information extraction from English and German texts based on predicate logic☆135Updated last year
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- ☆54Updated last year
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- ☆42Updated last year
- A Machine Learning tool to create the training dataset very quickly & easily by using a smart chrome extension☆14Updated 2 years ago
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆71Updated last year
- NLP tool to extract emotional phrase from tweets 🤩☆40Updated 3 years ago
- Use Google's state-of-the-art T5 pre-train model to create human-like summarization☆25Updated 3 years ago
- A News Article Collection Library☆22Updated last year
- Automatically check mismatch between code and comments using AI and ML☆53Updated 3 years ago
- Neural Machine Translation for South African Languages☆38Updated 2 years ago
- Conversational text Analysis using various NLP techniques☆181Updated last year
- ☆11Updated 4 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆26Updated 3 years ago
- Efficient and easy to use transliteration for Indian languages☆51Updated 4 years ago
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.☆35Updated 4 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆86Updated 2 months ago
- Using ML to extract campaign finance data from messy forms for journalism☆76Updated 2 years ago
- A set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification☆29Updated last month