mim-solutions / bert_for_longer_textsLinks
BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them to BERT, intermediate results are pooled. The implementation allows fine-tuning.
☆144Updated last year
Alternatives and similar repositories for bert_for_longer_texts
Users that are interested in bert_for_longer_texts are comparing it to the libraries listed below
Sorting:
- ☆367Updated last year
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆265Updated 10 months ago
- Code and experiments for *BERTopic: Neural topic modeling with a class-based TF-IDF procedure*☆81Updated last year
- Efficient Attention for Long Sequence Processing☆98Updated last year
- Zero and Few shot named entity & relationships recognition☆386Updated this week
- Clustering sentence embeddings to extract message intent☆175Updated 3 years ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆392Updated 2 years ago
- Guideline following Large Language Model for Information Extraction☆395Updated 10 months ago
- Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models for instruction named entity recognition. (Instruction NER)☆85Updated last year
- Comparing the Performance of LLMs: A Deep Dive into Roberta, Llama, and Mistral for Disaster Tweets Analysis with Lora☆51Updated last year
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning☆155Updated last year
- OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)☆784Updated last year
- Multi-label text classification using BERT☆65Updated 4 years ago
- ☆67Updated 4 years ago
- Text classification with Foundation Language Model LLaMA☆114Updated 2 years ago
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆93Updated 10 months ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆337Updated 2 years ago
- [ACL-IJCNLP 2021] Automated Concatenation of Embeddings for Structured Prediction☆311Updated 2 years ago
- Creating class-based TF-IDF matrices☆90Updated 2 years ago
- A curated list of resources on document similarity measures (papers, tutorials, code, ...)☆252Updated 3 years ago
- ☆60Updated 4 years ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆212Updated 4 months ago
- TopicGPT: A Prompt-Based Framework for Topic Modeling (NAACL'24)☆345Updated 6 months ago
- SpanMarker for Named Entity Recognition☆451Updated 8 months ago
- REBEL is a seq2seq model that simplifies Relation Extraction (EMNLP 2021).☆545Updated last year
- Fine-tuning of Flan-5T LLM for text classification 🤖 focuses on adapting a state-of-the-art language model to enhance its ability to cla…☆43Updated 10 months ago
- Building NER and RE components using HuggingFace Transformers☆51Updated 3 years ago
- A repo to explore different NLP tasks which can be solved using T5☆172Updated 4 years ago
- A Framework for Textual Entailment based Zero Shot text classification☆152Updated last year
- A Python library for calculating a large variety of metrics from text☆347Updated 9 months ago