charlesdedampierre / BunkaTopics
πΊοΈ Data Cleaning and Textual Data Visualization πΊοΈ
β163Updated 8 months ago
Alternatives and similar repositories for BunkaTopics:
Users that are interested in BunkaTopics are comparing it to the libraries listed below
- A BERT-based application for reusable text classification at scaleβ37Updated last year
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)β184Updated last month
- β67Updated 11 months ago
- Nesta's Skills Extractor Libraryβ126Updated 3 months ago
- A spaCy wrapper for GliNERβ107Updated 3 weeks ago
- SpanMarker for Named Entity Recognitionβ417Updated last month
- Course repository for the session "Hands-on Transformers: Fine-Tune your own BERT and GPT" of the Data Science Summer School 2023β83Updated last year
- FastFit β‘ When LLMs are Unfit Use FastFit β‘ Fast and Effective Text Classification with Many Classesβ185Updated 4 months ago
- Package to extract connotation framesβ83Updated last year
- Robust and fast topic models with sentence-transformers.β42Updated this week
- Let's build better datasets, together!β252Updated 2 months ago
- Generalist and Lightweight Model for Text Classificationβ65Updated this week
- Notebooks for training universal 0-shot classifiers on many different tasksβ119Updated last month
- This is the reproduction repository for my π€ Hugging Face blog post on synthetic dataβ63Updated last year
- β84Updated 9 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extractionβ66Updated 6 months ago
- Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)β385Updated 4 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.β76Updated last year
- HDBSCAN Tuning for BERTopic Modelsβ43Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β104Updated 9 months ago
- Tools for interactive visual exploration of semantic embeddings.β30Updated 5 months ago
- Easily embed, cluster and semantically label text datasetsβ508Updated 10 months ago
- TopicGPT allows to integrate the benefits of LLMs into Topic Modellingβ81Updated 7 months ago
- β54Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β147Updated 4 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo rankerβ106Updated last week
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated 11 months ago
- β31Updated this week
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β64Updated 3 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.β55Updated 6 months ago