google-research / language
Shared repository for open-sourced projects from the Google AI Language team.
☆1,640Updated 3 months ago
Alternatives and similar repositories for language:
Users that are interested in language are comparing it to the libraries listed below
- jiant is an nlp toolkit☆1,657Updated last year
- A python tool for evaluating the quality of sentence embeddings.☆2,091Updated 10 months ago
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,119Updated 2 years ago
- Code for using and evaluating SpanBERT.☆894Updated last year
- Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is design…☆963Updated 3 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,898Updated last year
- Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"☆1,413Updated last year
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,345Updated 10 months ago
- Entity Linker solution☆1,175Updated last year
- Library for Knowledge Intensive Language Tasks☆921Updated 2 years ago
- Multi-Task Deep Neural Networks for Natural Language Understanding☆2,244Updated 10 months ago
- ACL2020 Tutorial: Open-Domain Question Answering☆834Updated 4 years ago
- Autoregressive Entity Retrieval☆777Updated last year
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,215Updated 5 months ago
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,135Updated 11 months ago
- InferSent sentence embeddings☆2,285Updated 3 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆555Updated 3 years ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆712Updated last year
- Officially supported AllenNLP models☆534Updated 2 years ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,193Updated 3 months ago
- A curated list of pretrained sentence and word embedding models☆2,241Updated 3 years ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,105Updated 5 months ago
- Language-Agnostic SEntence Representations☆3,606Updated 8 months ago
- code for EMNLP 2019 paper Text Summarization with Pretrained Encoders☆1,287Updated 6 months ago
- Super easy library for BERT based NLP models☆1,877Updated 5 months ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆637Updated 2 years ago
- BERT-related papers☆2,036Updated last year
- PyTorch deep learning models for document classification☆593Updated last year
- Fast BPE☆659Updated 7 months ago
- Code for paper Fine-tune BERT for Extractive Summarization☆1,473Updated 3 years ago