codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"
☆18Oct 25, 2022Updated 3 years ago
Alternatives and similar repositories for segatron_aaai
Users that are interested in segatron_aaai are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆47Apr 12, 2019Updated 7 years ago
- Official repository of the R2-D2's pipeline☆21Nov 16, 2021Updated 4 years ago
- ☆17Oct 14, 2022Updated 3 years ago
- Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"☆32Jun 20, 2023Updated 2 years ago
- 2020阿里云天池大数据竞赛-中医药命名实体识别挑战赛☆27Nov 7, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- COMET for African languages☆11Jan 24, 2025Updated last year
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆42May 5, 2021Updated 4 years ago
- Named Entity Recognition in Nepali Language☆10Jan 12, 2023Updated 3 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Aug 2, 2021Updated 4 years ago
- Code and data for "Heterogeneous Supervised Topic Models"☆10Jun 27, 2022Updated 3 years ago
- Personal Infrastructure for Deep Learning based on Pytorch and Tensorflow☆10Jan 10, 2019Updated 7 years ago
- Code base for paper "Zero-Shot Cross-Lingual Transfer with Meta Learning"☆35Nov 8, 2024Updated last year
- Efficient-GlobalPointer的关系抽取任务☆24Jan 27, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆12May 19, 2021Updated 4 years ago
- 2020 阿里云天池大数据竞赛-中医药文献问题生成挑战赛☆30Sep 2, 2021Updated 4 years ago
- Experiments for XLM-V Transformers Integeration☆13Feb 8, 2023Updated 3 years ago
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- ☆11Apr 2, 2024Updated 2 years ago
- Pretraining scripts for BART transformer model☆12May 15, 2023Updated 2 years ago
- Code for Auditing Data Provenance in Text-Generation Models (in KDD 2019)☆10Jun 18, 2019Updated 6 years ago
- Online BaseHangul Encoder And Decoder☆12Jan 30, 2023Updated 3 years ago
- ☆11Feb 22, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is a re-implementation of our KDD 2020 paper "Grammatically Recognizing Images with Tree Convolution."☆13Dec 9, 2020Updated 5 years ago
- List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond☆13Aug 15, 2022Updated 3 years ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 3 years ago
- Implementation of the spotlight: a method for discovering systematic errors in deep learning models☆11Oct 5, 2021Updated 4 years ago
- Personal information identification standard☆21Jan 24, 2024Updated 2 years ago
- ☆22Feb 2, 2023Updated 3 years ago
- Pytorch implementation of QAnet☆13May 6, 2018Updated 7 years ago
- MENYO-20k Corpus in "The Effect of Domain and Diacritics in Yorùbá-English Neural Machine Translation" in MT Summit 2021☆13Jan 16, 2023Updated 3 years ago
- This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificati…☆16Jun 3, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)☆19Jul 28, 2021Updated 4 years ago
- An experimental custom seq-2-seq model with both layer-wise (inter-layer), and intra-layer attention (attention to previous hidden states…☆10Nov 30, 2017Updated 8 years ago
- Python script to download conference paper automatically☆16Sep 10, 2024Updated last year
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆19Mar 26, 2026Updated 3 weeks ago
- Improving Distantly-Supervised Named Entity Recognition with Self-Collaborative Denoising Learning☆14Apr 11, 2022Updated 4 years ago
- Pytorch-Lightning Seq2seq model with the use of recurrent neural network☆10Mar 29, 2021Updated 5 years ago
- BERT Baseline for the Natural Questions☆11Jan 24, 2019Updated 7 years ago