实现了Transformer中的几种位置编码方案
☆44Oct 6, 2021Updated 4 years ago
Alternatives and similar repositories for tdlm
Users that are interested in tdlm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 使用biaffine的中文命名实体识别☆10Jan 12, 2023Updated 3 years ago
- 本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型,涵盖预训练、有监督微调(SFT)和R1推理蒸馏三个阶段。项目采用自定义Transformer架构(包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码),实现高效的长文本处理和…☆21Mar 10, 2025Updated last year
- 北京邮电大学网络工程嵌入式系统实验报告☆12Jan 7, 2021Updated 5 years ago
- 使用Decoder-only的Transformer进行时序预测,包含SwiGLU和RoPE(Rotary Positional Embedding),Time series prediction using Decoder-only Transformer, Includ…☆16Jan 25, 2024Updated 2 years ago
- ☆10May 4, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- BUPT神经网络与深度学习课设☆10Dec 29, 2023Updated 2 years ago
- 使用指令微调对大模型进行微调。☆11Jun 28, 2023Updated 2 years ago
- ☆13Sep 14, 2021Updated 4 years ago
- 一个基于transformers的自定义命名实体识别模型示例☆17Jul 4, 2021Updated 4 years ago
- BUPT Software Engineering Project☆18Aug 20, 2018Updated 7 years ago
- ☆10May 10, 2019Updated 6 years ago
- A Few-Shot Learning based Approach to Multimodal Social Relation Extraction☆14Jan 17, 2023Updated 3 years ago
- springboot vue blog 博客☆13Jan 7, 2024Updated 2 years ago
- A pytorch implementation of Attention Is All You Need (Transformer) for image captioning.☆12Nov 15, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆19May 19, 2025Updated 10 months ago
- AI Challenger Image Caption Competition☆10Dec 13, 2017Updated 8 years ago
- A unified approach to explain conditional text generation models. Pytorch. The code of paper "Local Explanation of Dialogue Response Gene…☆16Mar 21, 2022Updated 4 years ago
- This repository is for the "LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation".☆17Nov 18, 2025Updated 4 months ago
- Chinese entity relation extract☆14Apr 26, 2024Updated last year
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆11Jan 29, 2019Updated 7 years ago
- Rust 官方周报(简体中文版)☆15Jul 15, 2021Updated 4 years ago
- [ICASSP 2025 Oral] The official implementation of paper "TextureDiffusion: Target Prompt Disentangled Editing for Various Texture Transfe…☆16Mar 13, 2025Updated last year
- [Unofficial] Predict code for AAAI 2022 paper: Unified Named Entity Recognition as Word-Word Relation Classification☆56Sep 5, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- BUPT智能计算系统☆21Jan 2, 2024Updated 2 years ago
- A PyTorch-based toolkit for natural language processing☆160Mar 10, 2023Updated 3 years ago
- Subjective Image Captioning using Capsule Generative Adversarial Network☆11Jun 28, 2021Updated 4 years ago
- 基于Pytorch实现的中文文本分类脚手架,以及常用模型对比。☆18Apr 23, 2021Updated 4 years ago
- 具备完整聊天室实现的提交☆17Dec 29, 2022Updated 3 years ago
- this is a high performance cuda porting of cbow model of word2vec☆17Sep 14, 2014Updated 11 years ago
- Notes about courses Machine Learning 2025 Spring by Hung-yi Lee☆26Sep 22, 2025Updated 6 months ago
- Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue (ACL Findings 2023)☆21Nov 10, 2025Updated 4 months ago
- Pytorch implementation of "Self-boosted Time-series Forecasting with Multi-task and Multi-view Learning" https://arxiv.org/pdf/1909.08181…☆30Oct 30, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- (ICML 2024) PyTorch implementation of "Self-Attention through Kernel-Eigen Pair Sparse Variational Gaussian Processes"☆16Oct 15, 2024Updated last year
- ☆20Nov 18, 2024Updated last year
- ☆24Nov 28, 2024Updated last year
- 基于BERT-MRC(阅读理解)的命名实体识别模型☆20Mar 15, 2022Updated 4 years ago
- 文本分类 法研杯 textcnn rcnn capsule attention☆23Feb 1, 2021Updated 5 years ago
- Residual Swin Transformer Channel Attention Network for Image Demosaicing☆20Jul 18, 2022Updated 3 years ago