FLASHQuad_pytorch
☆68Apr 1, 2022Updated 3 years ago
Alternatives and similar repositories for FLASHQuad_pytorch
Users that are interested in FLASHQuad_pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Bytepiece Tokenizer Implemented in Rust.☆14Nov 28, 2023Updated 2 years ago
- kaggle情感分析rnn+attention解法☆12Nov 17, 2017Updated 8 years ago
- Source code for "A Simple but Effective Pluggable Entity Lookup Table for Pre-trained Language Models"☆44Nov 27, 2022Updated 3 years ago
- GAU-alpha-pytorch☆20May 11, 2022Updated 3 years ago
- 命名实体识别☆12Dec 21, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- RoFormer升级版☆155Aug 11, 2022Updated 3 years ago
- RAG-Fusion implementation using Langchain, Weaviate and OpenAI☆13Oct 31, 2023Updated 2 years ago
- pytorch版unilm模型☆27Jun 19, 2021Updated 4 years ago
- ☆11Jul 5, 2020Updated 5 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆97Feb 24, 2023Updated 3 years ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆59Apr 20, 2024Updated last year
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- CCL2024 Chinese Essay Rhetoric Recognition and Understanding☆17Oct 1, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation and experiments for Partially Supervised NER via Expected Entity Ratio in TACL 2022☆14Nov 7, 2022Updated 3 years ago
- RoFormer V1 & V2 pytorch☆522May 18, 2022Updated 3 years ago
- ☆21Nov 14, 2022Updated 3 years ago
- 端到端的长本文摘要模型(法研杯2020司法摘要赛道)☆398May 31, 2024Updated last year
- GPLinker_pytorch☆88May 10, 2022Updated 3 years ago
- WoBERT_pytorch☆40Apr 18, 2021Updated 4 years ago
- Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)☆12Mar 7, 2024Updated 2 years ago
- Codebase, data and models for the Re-Thinking the Shuffle Test paper at ACL2021☆10Oct 14, 2022Updated 3 years ago
- ☆18May 28, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- ☆31Jul 2, 2023Updated 2 years ago
- Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"☆14Mar 1, 2023Updated 3 years ago
- 🤗An unofficial PyTorch implementation of ConvBert based on huggingface/transformers.☆17Oct 6, 2022Updated 3 years ago
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆152May 10, 2023Updated 2 years ago
- This repository contains the corpora and supplementary data, along with instructions for recreating the experiments, for our paper: "End-…☆90Feb 14, 2020Updated 6 years ago
- Repository for SPECTRA: Sparse Structured Text Rationalization, accepted at EMNLP 2021 main conference.☆10Feb 14, 2024Updated 2 years ago
- The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"☆230Oct 12, 2022Updated 3 years ago
- ☆22Oct 22, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"☆18Oct 25, 2022Updated 3 years ago
- DMN+ 模型的PyTorch 实现(中文数据集)☆19Feb 20, 2019Updated 7 years ago
- Sparse Attention with Linear Units☆20Apr 21, 2021Updated 4 years ago
- ☆12Apr 29, 2024Updated last year
- [ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering☆121May 22, 2023Updated 2 years ago
- code for Explicit Sparse Transformer☆61Jul 21, 2023Updated 2 years ago
- 中文谐音词/字库(同音词/字)Chinese Homophones☆117Nov 21, 2019Updated 6 years ago