中文bigbird预训练模型
☆96Jul 5, 2022Updated 3 years ago
Alternatives and similar repositories for chinese-bigbird
Users that are interested in chinese-bigbird are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- chinese version of longformer☆117Nov 6, 2020Updated 5 years ago
- Transformers for Longer Sequences☆633Sep 1, 2022Updated 3 years ago
- ☆35Nov 23, 2022Updated 3 years ago
- ☆14Aug 26, 2024Updated last year
- RAN: Recurrent Attention Networks for Long-text Modeling | Findings of ACL23☆23Aug 12, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- SVM Entity Relation classification for ace2005 chinese data☆14Jun 25, 2017Updated 8 years ago
- A Pytorch implementation for "Hierarchical Attention Network with Pairwise Loss for Chinese Zero Pronoun Resolution“ (AAAI 2020).☆10Dec 10, 2020Updated 5 years ago
- Knowledge Graph based Question Answering benchmark.☆10Feb 1, 2020Updated 6 years ago
- Code and data for the COLING 2020 paper "Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet"☆14Dec 2, 2020Updated 5 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- a baseline to practice☆45Jul 6, 2021Updated 4 years ago
- DescriptionPairsExtraction, entity and it's description pairs extract program based on Albert and data back-annotation. 基于Albert与结构化数据回标思…☆20Mar 7, 2022Updated 4 years ago
- Machine Reading Comprehension Leadboard Summary☆12Jan 4, 2021Updated 5 years ago
- [Neural Networks 2025] The official code for the paper "MNet: A Multi-Scale Network for Visible Watermark Removal."☆17Jun 16, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering☆121May 22, 2023Updated 2 years ago
- Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers☆49Mar 20, 2023Updated 3 years ago
- ☆420Mar 4, 2024Updated 2 years ago
- Knowledge Distillation from BERT☆54Jan 7, 2019Updated 7 years ago
- 全球人工智能技术创新大赛-赛道三-冠军方案☆239Jul 12, 2021Updated 4 years ago
- ACL 2021: HiTransformer☆13May 29, 2021Updated 4 years ago
- ☆12Jan 8, 2021Updated 5 years ago
- ☆32May 30, 2021Updated 4 years ago
- ⚡ boost inference speed of GPT models in transformers by onnxruntime☆52Aug 20, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for ACL22 findings paper: Inverse is Better! Fast and Accurate Prompt for Slot Tagging☆27Jul 13, 2022Updated 3 years ago
- ReCO: A Large Scale Chinese Reading Comprehension Dataset on Opinion☆37Jul 25, 2024Updated last year
- HMM\CRF\BERT-CRF\BILSTM-CRF\BERTBILSTMCRF\XLNETBILSTMCRF☆33Jul 30, 2022Updated 3 years ago
- Longformer: The Long-Document Transformer☆2,188Feb 8, 2023Updated 3 years ago
- BERT distillation(基于BERT的蒸馏实验 )☆314Jul 30, 2020Updated 5 years ago
- ☆20Oct 27, 2022Updated 3 years ago
- Question Dependent Recurrent Entity Network☆13Sep 21, 2017Updated 8 years ago
- 基于依存句法与语义角色标注的三元组抽取☆11Sep 6, 2018Updated 7 years ago
- ☆220Dec 8, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An original implementation of ACL 2019, "Multi-hop Reading Comprehension through Question Decomposition and Rescoring"☆138Apr 23, 2022Updated 3 years ago
- 文本智能校对大赛(Chinese Text Correction)的baseline☆66Oct 8, 2022Updated 3 years ago
- ☆18May 5, 2021Updated 4 years ago
- ☆11Jul 31, 2018Updated 7 years ago
- ☆21Aug 22, 2020Updated 5 years ago
- High frequency prediction of Chinese stock returns. Orderbook data generation. High frequency factors construction.☆18Mar 10, 2023Updated 3 years ago
- Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre…☆35Aug 19, 2021Updated 4 years ago