qburst / common-crawl-malayalam
Useful tools to extract malayalam text from the Common Crawl Datasets
β28Updated 5 months ago
Alternatives and similar repositories for common-crawl-malayalam
Users that are interested in common-crawl-malayalam are comparing it to the libraries listed below
Sorting:
- A Machine Learning tool to create the training dataset very quickly & easily by using a smart chrome extensionβ14Updated 2 years ago
- NLP tool to extract emotional phrase from tweets π€©β40Updated 3 years ago
- NeatText a simple NLP package for cleaning textual data and text preprocessingβ72Updated last year
- semantically distinct key phrase extraction using hilbert hashes.β49Updated 3 years ago
- classify a job description (or noisy job title) into a ONET job titleβ19Updated 8 years ago
- β28Updated 4 years ago
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- Natural Language Generation for Gramex applications.β24Updated 2 years ago
- A curated list of ML awesome frameworks & libraries for text dataβ16Updated 2 years ago
- π End-to-end machine learning; "no code" required!β12Updated 4 years ago
- β57Updated 2 years ago
- β11Updated 5 years ago
- State of the art open-source translation for Indic languages.β5Updated 4 years ago
- Efficient and easy to use transliteration for Indian languagesβ51Updated 4 years ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clientsβ36Updated last year
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.β91Updated 3 years ago
- β22Updated 3 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy modβ¦β15Updated 2 years ago
- Expose a Top2Vec model with a REST API.β90Updated 2 years ago
- Streamlit-based Web App for Ai Text Generation based on GPT-2 Models from HuggingFace Model Hub using Python library aitextgenβ27Updated 4 years ago
- Tooling to play around with multilingual machine translation for Indian Languages.β22Updated 3 years ago
- Text classification automlβ21Updated 3 years ago
- β17Updated 4 years ago
- Topic Inference with Zeroshot modelsβ61Updated last year
- Experimentation on google's gemma modelβ16Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsβ¦β36Updated 3 years ago
- Finds linguistic patterns effortlesslyβ36Updated last year
- TorchServe+Streamlit for easily serving your HuggingFace NER modelsβ33Updated 2 years ago
- A Python library for creating adversarial splitsβ13Updated 2 years ago
- Language detection using Spacy and Fasttextβ55Updated last year