qburst / common-crawl-malayalamLinks
Useful tools to extract malayalam text from the Common Crawl Datasets
☆28Updated 7 months ago
Alternatives and similar repositories for common-crawl-malayalam
Users that are interested in common-crawl-malayalam are comparing it to the libraries listed below
Sorting:
- semantically distinct key phrase extraction using hilbert hashes.☆50Updated 3 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆38Updated 6 years ago
- Topic Inference with Zeroshot models☆61Updated 2 years ago
- Automatically check mismatch between code and comments using AI and ML☆53Updated 4 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Lightweight intelligent searching of elasticsearch data☆40Updated 4 years ago
- ☆14Updated 2 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆72Updated last year
- ☆44Updated last year
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆36Updated last year
- A comprehensive tool for linguistic analysis of communities☆49Updated 3 years ago
- Text classification automl☆21Updated 3 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆71Updated 10 months ago
- ☆55Updated last year
- ☆22Updated 3 years ago
- Loan Risk Prediction Neural Network and API☆17Updated 4 years ago
- ☆28Updated 4 years ago
- H&M Fashion Image similarity search with Weaviate and DocArray☆43Updated last year
- Framework for building and maintaining self-updating prompts for LLMs☆64Updated last year
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques☆29Updated 4 years ago
- Domain-specific BERT representation for Named Entity Recognition of lab protocol☆29Updated 4 years ago
- Easy PDF to text to spaCy text extraction in Python.☆39Updated 9 months ago
- ☆43Updated 2 years ago
- ☆57Updated 3 years ago
- A curated list of ML awesome frameworks & libraries for text data☆16Updated 2 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Updated 3 years ago
- NLP tool to extract emotional phrase from tweets 🤩☆40Updated 3 years ago