sanderland / grouphug
Multi-task modelling extensions for huggingface transformers
☆13Updated last year
Alternatives and similar repositories for grouphug:
Users that are interested in grouphug are comparing it to the libraries listed below
- Multi-task modelling extensions for huggingface transformers☆20Updated last year
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆66Updated 2 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆56Updated 6 months ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆29Updated 2 years ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆328Updated last year
- ☆16Updated 2 years ago
- German Alpaca Dataset (Cleaned + Translated)☆24Updated last year
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆166Updated 2 months ago
- ☆30Updated 3 months ago
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"☆29Updated 3 months ago
- Evaluate language models using multiple choice items☆12Updated 3 weeks ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆124Updated 10 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆173Updated 3 weeks ago
- A comprehensive benchmark for entity disambiguation☆25Updated last year
- Efficient Attention for Long Sequence Processing☆92Updated last year
- ☆65Updated last year
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆37Updated 10 months ago
- Few-shot Named Entity Recognition☆122Updated 2 years ago
- Bi-encoder Based Entity Linking Tutorial. You can run experiment only in 5 minutes. Experiments on Co-lab pro GPU are also supported!☆34Updated 3 years ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆66Updated 3 months ago
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆55Updated 9 months ago
- Neural information retrieval / Semantic search / Bi-encoders☆169Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- Text classification with Foundation Language Model LLaMA☆114Updated last year
- Minimal implementation of multiple PEFT methods for LLaMA fine-tuning☆13Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆102Updated last year
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆187Updated 3 years ago
- ☆40Updated last year
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆43Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆104Updated 8 months ago