FLASHQuad_pytorch
☆68Apr 1, 2022Updated 4 years ago
Alternatives and similar repositories for FLASHQuad_pytorch
Users that are interested in FLASHQuad_pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"☆372Sep 26, 2023Updated 2 years ago
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- kaggle情感分析rnn+attention解法☆12Nov 17, 2017Updated 8 years ago
- 更纯粹、更高压缩率的Tokenizer in Rust☆13Dec 21, 2024Updated last year
- Source code for "A Simple but Effective Pluggable Entity Lookup Table for Pre-trained Language Models"☆44Nov 27, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- GAU-alpha-pytorch☆20May 11, 2022Updated 3 years ago
- 命名实体识别☆12Dec 21, 2020Updated 5 years ago
- RoFormer升级版☆155Aug 11, 2022Updated 3 years ago
- pytorch版unilm模型☆26Jun 19, 2021Updated 4 years ago
- FlatNCE: A Novel Contrastive Representation Learning Objective☆89Nov 4, 2021Updated 4 years ago
- ☆11Jul 5, 2020Updated 5 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆97Feb 24, 2023Updated 3 years ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆59Apr 20, 2024Updated last year
- tf&torch about nlp☆11Aug 12, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- CCL2024 Chinese Essay Rhetoric Recognition and Understanding☆17Oct 1, 2024Updated last year
- Implementation and experiments for Partially Supervised NER via Expected Entity Ratio in TACL 2022☆14Nov 7, 2022Updated 3 years ago
- RoFormer V1 & V2 pytorch☆524May 18, 2022Updated 3 years ago
- ☆21Nov 14, 2022Updated 3 years ago
- 基于“Seq2Seq+前缀树”的知识图谱问答☆70Dec 17, 2021Updated 4 years ago
- Xmixers: A collection of SOTA efficient token/channel mixers☆28Sep 4, 2025Updated 7 months ago
- 端到端的长本文摘要模型(法研杯2020司法摘要赛道)☆399May 31, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- GPLinker_pytorch☆88May 10, 2022Updated 3 years ago
- WoBERT_pytorch☆40Apr 18, 2021Updated 4 years ago
- Code for the paper "Scheduled Sampling for Transformers"☆29Jan 13, 2020Updated 6 years ago
- Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)☆12Mar 7, 2024Updated 2 years ago
- Codebase, data and models for the Re-Thinking the Shuffle Test paper at ACL2021☆10Oct 14, 2022Updated 3 years ago
- ☆18May 28, 2021Updated 4 years ago
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- ☆31Jul 2, 2023Updated 2 years ago
- 京东JDATA2019-用户对品类下店铺的购买预测☆17Jun 10, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"☆14Mar 1, 2023Updated 3 years ago
- 🤗An unofficial PyTorch implementation of ConvBert based on huggingface/transformers.☆17Oct 6, 2022Updated 3 years ago
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆152May 10, 2023Updated 2 years ago
- This is a community implementation of EEG-ConvTransformer☆17May 19, 2023Updated 2 years ago
- Repository for SPECTRA: Sparse Structured Text Rationalization, accepted at EMNLP 2021 main conference.☆10Feb 14, 2024Updated 2 years ago
- The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"☆230Oct 12, 2022Updated 3 years ago
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"☆18Oct 25, 2022Updated 3 years ago