rsvp-ai / segatron_aaaiView external linksLinks
codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"
☆18Oct 25, 2022Updated 3 years ago
Alternatives and similar repositories for segatron_aaai
Users that are interested in segatron_aaai are comparing it to the libraries listed below
Sorting:
- Our paper is titled "NUS-IDS at FinCausal 2021: Dependency Tree in Graph Neural Networks for better Cause-Effect Span Detection".☆13Feb 11, 2022Updated 4 years ago
- FL-Tuning☆12Jul 11, 2022Updated 3 years ago
- ☆17Oct 14, 2022Updated 3 years ago
- [NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…☆15Oct 18, 2022Updated 3 years ago
- Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)☆19Jul 28, 2021Updated 4 years ago
- Official repository of the R2-D2's pipeline☆21Nov 16, 2021Updated 4 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Aug 2, 2021Updated 4 years ago
- Efficient-GlobalPointer的关系抽取任务☆24Jan 27, 2022Updated 4 years ago
- Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"☆32Jun 20, 2023Updated 2 years ago
- ☆22Feb 2, 2023Updated 3 years ago
- 2020阿里云天池大数据竞赛-中医药命名实体识别挑战赛☆27Nov 7, 2020Updated 5 years ago
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆41May 5, 2021Updated 4 years ago
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆28Oct 17, 2023Updated 2 years ago
- dify的插件,用于word切分等操作☆23Sep 12, 2025Updated 5 months ago
- Research code for "What to Pre-Train on? Efficient Intermediate Task Selection", EMNLP 2021☆37Dec 21, 2021Updated 4 years ago
- Code base for paper "Zero-Shot Cross-Lingual Transfer with Meta Learning"☆35Nov 8, 2024Updated last year
- hopefully I can continuously develop the project.☆29Dec 16, 2022Updated 3 years ago
- ☆41Jul 24, 2024Updated last year
- This is the pytorch implementation of the long paper on ACL 2020: A Self-Training Method for Machine Reading Comprehension with Soft Evid…☆34Aug 14, 2020Updated 5 years ago
- ☆12Nov 25, 2023Updated 2 years ago
- ☆11Nov 27, 2020Updated 5 years ago
- COMET for African languages☆10Jan 24, 2025Updated last year
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- A simple API that can generate various types of hexagon grids - returns GeoJSON data or load into PostGIS with performant JDBC.☆10Aug 2, 2025Updated 6 months ago
- Implementing BERT + CRF with PyTorch for Chinese NER.☆10Mar 7, 2022Updated 3 years ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆24Jul 21, 2025Updated 6 months ago
- ☆12Dec 14, 2022Updated 3 years ago
- A Library for Scaling Mixed-Integer Optimization-Based Machine Learning.☆12Jun 24, 2024Updated last year
- Good Examples Make A Faster Learner: Simple Demonstration-based Learning for Low-resource NER (ACL 2022)☆44Apr 7, 2022Updated 3 years ago
- [NAACL 2022] TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding☆10Jul 15, 2023Updated 2 years ago
- This repository contains my research work on building the state of the art next basket recommendations using techniques such as Autoencod…☆11Mar 10, 2021Updated 4 years ago
- 2020厦门国际银行数创金融杯建模大赛-优胜奖方案☆11Feb 2, 2021Updated 5 years ago
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- An experimental custom seq-2-seq model with both layer-wise (inter-layer), and intra-layer attention (attention to previous hidden states…☆10Nov 30, 2017Updated 8 years ago
- Elevator is an open source, on-disk key-value store. Provides high-performance bulk read-write operations over very large datasets while …☆70May 14, 2014Updated 11 years ago
- dracut module using vdfuse to loop mount☆11Mar 21, 2021Updated 4 years ago
- Hearst Patterns to extract Hypernyms from text☆13Oct 30, 2019Updated 6 years ago
- a within-document event coreference resolution system, trained and evaluated on the KBP corpus.☆10May 15, 2023Updated 2 years ago