FLASHQuad_pytorch
☆68Apr 1, 2022Updated 4 years ago
Alternatives and similar repositories for FLASHQuad_pytorch
Users that are interested in FLASHQuad_pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"☆372Sep 26, 2023Updated 2 years ago
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- kaggle情感分析rnn+attention解法☆12Nov 17, 2017Updated 8 years ago
- 更纯粹、更高压缩率的Tokenizer in Rust☆14Dec 21, 2024Updated last year
- Source code for "A Simple but Effective Pluggable Entity Lookup Table for Pre-trained Language Models"☆44Nov 27, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- GAU-alpha-pytorch☆20May 11, 2022Updated 4 years ago
- 命名实体识别☆12Dec 21, 2020Updated 5 years ago
- RoFormer升级版☆154Aug 11, 2022Updated 3 years ago
- RAG-Fusion implementation using Langchain, Weaviate and OpenAI☆13Oct 31, 2023Updated 2 years ago
- pytorch版unilm模型☆25Jun 19, 2021Updated 5 years ago
- FlatNCE: A Novel Contrastive Representation Learning Objective☆90Nov 4, 2021Updated 4 years ago
- ☆11Jul 5, 2020Updated 5 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆97Feb 24, 2023Updated 3 years ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆59Apr 20, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- CCL2024 Chinese Essay Rhetoric Recognition and Understanding☆17Oct 1, 2024Updated last year
- Implementation and experiments for Partially Supervised NER via Expected Entity Ratio in TACL 2022☆14Nov 7, 2022Updated 3 years ago
- RoFormer V1 & V2 pytorch☆526May 18, 2022Updated 4 years ago
- ☆21Nov 14, 2022Updated 3 years ago
- 基于“Seq2Seq+前缀树”的知识图谱问答☆70Dec 17, 2021Updated 4 years ago
- Xmixers: A collection of SOTA efficient token/channel mixers☆28Sep 4, 2025Updated 10 months ago
- 端到端的长本文摘要模型(法研杯2020司法摘要赛道)☆397May 31, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- GPLinker_pytorch☆88May 10, 2022Updated 4 years ago
- WoBERT_pytorch☆40Apr 18, 2021Updated 5 years ago
- Code for the paper "Scheduled Sampling for Transformers"☆29Jan 13, 2020Updated 6 years ago
- Codebase, data and models for the Re-Thinking the Shuffle Test paper at ACL2021☆10Oct 14, 2022Updated 3 years ago
- ☆18May 28, 2021Updated 5 years ago
- 京东JDATA2019-用户对品类下店铺的购买预测☆17Jun 10, 2019Updated 7 years ago
- ☆31Jul 2, 2023Updated 3 years ago
- Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"☆14Mar 1, 2023Updated 3 years ago
- 🤗An unofficial PyTorch implementation of ConvBert based on huggingface/transformers.☆17Oct 6, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆152May 10, 2023Updated 3 years ago
- Repository for SPECTRA: Sparse Structured Text Rationalization, accepted at EMNLP 2021 main conference.☆10Feb 14, 2024Updated 2 years ago
- The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"☆231Oct 12, 2022Updated 3 years ago
- ☆22Oct 22, 2024Updated last year
- DMN+ 模型的PyTorch 实现(中文数据集)☆19Feb 20, 2019Updated 7 years ago
- Sparse Attention with Linear Units☆20Apr 21, 2021Updated 5 years ago
- ☆12Apr 29, 2024Updated 2 years ago