charlesdedampierre / BunkaTopics
πΊοΈ Data Cleaning and Textual Data Visualization πΊοΈ
β168Updated 10 months ago
Alternatives and similar repositories for BunkaTopics:
Users that are interested in BunkaTopics are comparing it to the libraries listed below
- A BERT-based application for reusable text classification at scaleβ38Updated last year
- Robust and fast topic models with sentence-transformers.β48Updated last week
- Notebooks for training universal 0-shot classifiers on many different tasksβ124Updated 4 months ago
- β67Updated last year
- Generalist and Lightweight Model for Text Classificationβ123Updated this week
- HDBSCAN Tuning for BERTopic Modelsβ45Updated last year
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)β206Updated 3 weeks ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β108Updated 11 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extractionβ71Updated 9 months ago
- FastFit β‘ When LLMs are Unfit Use FastFit β‘ Fast and Effective Text Classification with Many Classesβ199Updated last week
- SpanMarker for Named Entity Recognitionβ427Updated 3 months ago
- A spaCy wrapper for GliNERβ114Updated 3 months ago
- Tools for interactive visual exploration of semantic embeddings.β32Updated 8 months ago
- β87Updated 11 months ago
- Blazing fast topic modelling for short texts.β31Updated 3 weeks ago
- Powerful topic model visualization in Pythonβ119Updated last month
- Late Interaction Models Training & Retrievalβ288Updated 3 weeks ago
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- A very simple news crawler with a funny nameβ378Updated this week
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.β79Updated last year
- Easily embed, cluster and semantically label text datasetsβ530Updated last year
- Efficient few-shot learning with cross-encoders.β51Updated last year
- This is the reproduction repository for my π€ Hugging Face blog post on synthetic dataβ68Updated last year
- β360Updated last year
- π« SpaCy wrapper for ConceptNet π«β92Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β66Updated 6 months ago
- Nesta's Skills Extractor Libraryβ133Updated 6 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.β59Updated 9 months ago
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ100Updated last year
- π Template Haystack Search Application with Streamlitβ27Updated 3 months ago