qburst / common-crawl-malayalam
Useful tools to extract malayalam text from the Common Crawl Datasets
☆27Updated 2 years ago
Related projects: ⓘ
- semantically distinct key phrase extraction using hilbert hashes.☆46Updated 2 years ago
- Topic Inference with Zeroshot models☆61Updated last year
- ☆17Updated last year
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆67Updated last month
- Example Flask project to use Spacy on AWS Lambda and get the models from an S3 bucket☆12Updated last year
- No Teacher BART distillation experiment for NLI tasks☆25Updated 4 years ago
- spaCy match and replace, maintaining conjugation☆34Updated last year
- Information extraction from English and German texts based on predicate logic☆133Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- Package that returns a company embedding given a company name☆42Updated 4 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆61Updated 6 months ago
- ☆28Updated 4 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆87Updated 2 years ago
- Language detection using Spacy and Fasttext☆53Updated 9 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆71Updated last year
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆114Updated 5 months ago
- Summarizer in python with Spacy and Universal Sentence Encoder build on Flask framework☆20Updated last year
- Automatically check mismatch between code and comments using AI and ML☆54Updated 3 years ago
- ☆65Updated 2 years ago
- https://duyet.github.io/related-skills-visualization/index.html☆11Updated 4 years ago
- ☆16Updated last year
- A personal knowledge base that I can dump information to and help me learn☆24Updated 3 months ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 3 years ago
- ☆11Updated 3 years ago
- Text classification automl☆21Updated 3 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- ☆11Updated 4 years ago
- 🤗 Push your spaCy pipelines to the Hugging Face Hub☆42Updated 3 months ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆66Updated 9 months ago
- Build Semantic Search with S-BERT and Fine-tune your model in unsupervised way☆57Updated 2 years ago