patrickvonplaten / notebooks
Some notebooks for NLP
☆187Updated last year
Related projects ⓘ
Alternatives and complementary repositories for notebooks
- A repo to explore different NLP tasks which can be solved using T5☆169Updated 3 years ago
- Efficient Attention for Long Sequence Processing☆87Updated 10 months ago
- Applying BERT to named entity recognition in English and Russian.☆159Updated last year
- Tutorial for first time BERT users,☆102Updated last year
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated last year
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆79Updated 2 years ago
- Yet Another Neural Machine Translation Toolkit☆174Updated 4 months ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆187Updated 3 years ago
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆129Updated 9 months ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- Few-shot Named Entity Recognition☆122Updated 2 years ago
- Comprehensive NLP Evaluation System☆188Updated 3 months ago
- Neural information retrieval / Semantic search / Bi-encoders☆167Updated last year
- Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive…☆427Updated last year
- xfspell — the Transformer Spell Checker☆187Updated 4 years ago
- This is where I put things I find useful that speed up my work with Machine Learning. Ever looked in your old projects to reuse those coo…☆255Updated 2 years ago
- Generating boolean (yes/no) questions from any content using T5 text-to-text transformer model and BoolQ dataset☆35Updated last year
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated last year
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆291Updated 4 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆200Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 5 months ago
- New dataset☆298Updated 3 years ago
- Language Modeling Example with Transformers and PyTorch Lighting☆65Updated 3 years ago
- Contains notebooks related to various transformers based models for different nlp based tasks☆42Updated last year
- A benchmark for code-switched NLP, ACL 2020☆74Updated 5 months ago
- ☆124Updated 3 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆156Updated last month
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆197Updated last year
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- A tutorial on how to implement models for part-of-speech tagging using PyTorch and TorchText.☆177Updated 3 years ago