qsyao / cudaBERTView external linksLinks
A Fast Muti-processing BERT-Inference System
☆102Oct 20, 2022Updated 3 years ago
Alternatives and similar repositories for cudaBERT
Users that are interested in cudaBERT are comparing it to the libraries listed below
Sorting:
- ☆24May 9, 2025Updated 9 months ago
- Analysis for the traces from byteprofile☆32Nov 21, 2023Updated 2 years ago
- Keras implement of Lazy optimizer☆21Nov 24, 2019Updated 6 years ago
- 夸夸机器人☆20Dec 23, 2021Updated 4 years ago
- ☆10Jul 20, 2020Updated 5 years ago
- 本项目提供了面向中文的XLNet预训练模型,旨在丰富中文自然语言处理资源,提供多元化的中文预训练模型选择。 我们欢迎各位专家学者下载使用,并共同促进和发展中文资源建设。☆11May 30, 2023Updated 2 years ago
- My solutions to the assignments of dlsys course (CSE599G1: Deep Learning System Spring 2017)☆10Jul 1, 2017Updated 8 years ago
- Conversational Word Embedding for Retrieval-based Dialog System (ACL2020)☆30Sep 2, 2020Updated 5 years ago
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,542Jul 18, 2025Updated 6 months ago
- A synthesis flow for hybrid processing-in-RRAM modes☆12Jul 15, 2021Updated 4 years ago
- Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention☆44Oct 16, 2025Updated 3 months ago
- A transformer model that should be able to solve a simple NER task☆11Mar 7, 2019Updated 6 years ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- a beautiful method for cluster or community detection☆52Oct 19, 2019Updated 6 years ago
- Fast CUDA Kernels for ResNet Inference.☆182May 26, 2019Updated 6 years ago
- huggingface ChineseBert Tokenizer☆16Apr 16, 2022Updated 3 years ago
- A tensorflow implementation of self attentive graph embedding (SAGE) in WWW 2019☆15Jun 5, 2019Updated 6 years ago
- AutodiffEngine☆13Apr 1, 2019Updated 6 years ago
- 离线端阅读理解应用 QA for mobile, Android & iPhone☆60Oct 6, 2022Updated 3 years ago
- QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking☆34Jan 20, 2026Updated 3 weeks ago
- python-2.5-annotated☆13Jan 7, 2015Updated 11 years ago
- ☆18Oct 15, 2020Updated 5 years ago
- ☆13Dec 14, 2021Updated 4 years ago
- this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large☆66Mar 30, 2020Updated 5 years ago
- Open Language Pre-trained Model Zoo☆1,004Nov 18, 2021Updated 4 years ago
- ddl-benchmarks: Benchmarks for Distributed Deep Learning☆36May 29, 2020Updated 5 years ago
- A prototype version of our submitted paper: Conversion Prediction Using Multi-task Conditional Attention Networks to Support the Creation…☆21May 16, 2019Updated 6 years ago
- ESIM for Multi-turn Response Selection Task☆584Sep 9, 2019Updated 6 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆95Dec 5, 2019Updated 6 years ago
- RAdam optimizer for keras☆71Oct 14, 2019Updated 6 years ago
- image to column☆30Jul 15, 2014Updated 11 years ago
- 人人都能看懂的轻量级解决方案☆16Jul 10, 2020Updated 5 years ago
- 简单的抓取P大校园卡消费记录的习作☆15Apr 23, 2022Updated 3 years ago
- Entity Linking,识别给定文本中出现的命名实体(Named Entity),并映射到特定的知识库中唯一的实体。包括命名实体识别、消歧等工作。☆72Aug 17, 2019Updated 6 years ago
- 中文空间语义理解评测☆39Aug 10, 2022Updated 3 years ago
- Running BERT without Padding☆480Mar 18, 2022Updated 3 years ago
- Tensorflow implementation of DCN for question answering on the Stanford Question Answering Dataset (SQuAD)☆13Dec 1, 2017Updated 8 years ago
- ☆17May 14, 2020Updated 5 years ago
- Sequence-to-Sequence Generative Model for Sequential Recommender System☆18Mar 25, 2024Updated last year