mim-solutions / bert_for_longer_texts
BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them to BERT, intermediate results are pooled. The implementation allows fine-tuning.
☆141Updated 10 months ago
Alternatives and similar repositories for bert_for_longer_texts
Users that are interested in bert_for_longer_texts are comparing it to the libraries listed below
Sorting:
- Efficient Attention for Long Sequence Processing☆94Updated last year
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆262Updated 6 months ago
- Creating class-based TF-IDF matrices☆83Updated 2 years ago
- ☆161Updated 10 months ago
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning☆155Updated last year
- ☆360Updated last year
- A Framework for Textual Entailment based Zero Shot text classification☆152Updated last year
- Comparing the Performance of LLMs: A Deep Dive into Roberta, Llama, and Mistral for Disaster Tweets Analysis with Lora☆51Updated last year
- Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models for instruction named entity recognition. (Instruction NER)☆84Updated last year
- SpanMarker for Named Entity Recognition☆429Updated 4 months ago
- A Python library for calculating a large variety of metrics from text☆337Updated 5 months ago
- Code and experiments for *BERTopic: Neural topic modeling with a class-based TF-IDF procedure*☆75Updated last year
- HDBSCAN Tuning for BERTopic Models☆45Updated last year
- Clustering sentence embeddings to extract message intent☆173Updated 3 years ago
- Guideline following Large Language Model for Information Extraction☆371Updated 6 months ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆388Updated 2 years ago
- A repo to explore different NLP tasks which can be solved using T5☆172Updated 4 years ago
- a library for named entity recognition developed by UF HOBI NLP lab featuring SOTA algorithms☆147Updated last year
- Aligned Neural Topic Model (ANTM) for Exploring Evolving Topics: a dynamic neural topic model that uses document embeddings (data2vec) to…☆37Updated last year
- Coreference Resolution☆76Updated 4 years ago
- Building NER and RE components using HuggingFace Transformers☆50Updated 3 years ago
- A text truncation method, useful for instance in long text classification☆23Updated 2 years ago
- ☆44Updated 2 years ago
- Data and models for the SciFact verification task.☆232Updated last year
- Long Document Summarization Papers☆147Updated last year
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆179Updated last month
- ☆21Updated 11 months ago
- ☆42Updated 3 years ago
- OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)☆763Updated 9 months ago
- Text classification with Foundation Language Model LLaMA☆115Updated 2 years ago