qanastek / DrBERTLinks
DrBERT: A Robust Pre-trained Model in French for Biomedical and Clinical domains
☆20Updated last year
Alternatives and similar repositories for DrBERT
Users that are interested in DrBERT are comparing it to the libraries listed below
Sorting:
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆68Updated 2 years ago
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆44Updated last year
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆55Updated 2 years ago
- Do Multilingual Language Models Think Better in English?☆42Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆110Updated last year
- Efficient few-shot learning with cross-encoders.☆58Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆84Updated last year
- ☆110Updated 9 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆186Updated 2 months ago
- A High-level Library for Named Entity Recognition in Python.☆24Updated last year
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆186Updated 3 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆61Updated last year
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Updated last year
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆24Updated 2 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated 2 years ago
- Tools for managing datasets for governance and training.☆83Updated last month
- Using short models to classify long texts☆21Updated 2 years ago
- A BERT-based application for reusable text classification at scale☆38Updated 2 years ago
- ☆102Updated last year
- ☆38Updated 3 years ago
- ☆17Updated 2 years ago
- Multidocument Summarization for Literature Review Shared Task 2022☆30Updated 2 years ago
- MAFAND-MT☆57Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Updated 3 years ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆32Updated 4 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆95Updated 2 years ago
- 🧪 Cutting-edge experimental spaCy components and features☆101Updated last year
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆107Updated last year