d-kleine / NER_decoderLinks
Named Entity Recognition with an decoder-only (autoregressive) LLM using HuggingFace
☆44Updated last month
Alternatives and similar repositories for NER_decoder
Users that are interested in NER_decoder are comparing it to the libraries listed below
Sorting:
- Gzip and nearest neighbors for text classification☆57Updated 2 years ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆66Updated last month
- Generalist and Lightweight Model for Text Classification☆164Updated 4 months ago
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆73Updated 3 weeks ago
- Course for Interpreting ML Models☆52Updated 2 years ago
- Efficiently find the best-suited language model (LM) for your NLP task☆127Updated 3 months ago
- ☆87Updated 5 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆34Updated 2 months ago
- Chunk your text using gpt4o-mini more accurately☆44Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Updated 2 years ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆53Updated 3 months ago
- NLP Examples using the 🤗 libraries☆40Updated 4 years ago
- ☆26Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated last month
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆212Updated last month
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆138Updated 9 months ago
- PyTorch implementation for MRL☆19Updated last year
- ☆124Updated last year
- Official Implementation of the 'When XGBoost Outperforms GPT-4 on Text Classification: A Case Study' NAACL-W 2024 paper☆16Updated 10 months ago
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Updated last year
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆38Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆195Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated last year
- Using short models to classify long texts☆21Updated 2 years ago
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆53Updated last month
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated last year
- Smart commit messages☆18Updated last year
- A framework for evaluating semantic search across custom datasets, metrics, and embedding backends.☆35Updated 5 months ago
- ☆85Updated 4 months ago
- Pre-train Static Word Embeddings☆89Updated 2 months ago