PyTorch implementation of Lessons on Parameter Sharing across Layers in Transformers
☆26May 19, 2021Updated 4 years ago
Alternatives and similar repositories for param-share-transformer
Users that are interested in param-share-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆28Nov 25, 2021Updated 4 years ago
- ☆21May 30, 2022Updated 3 years ago
- ☆13Jul 26, 2021Updated 4 years ago
- Source Code for ACL2019 paper <Bridging the Gap between Training and Inference for Neural Machine Translation>☆41Nov 10, 2020Updated 5 years ago
- Boltzmann machine learning for Potts models of biological data☆13Jul 24, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Framework for systematic discovery of novel complexes and differential analysis of cofractionation MS datasets☆14Nov 21, 2025Updated 4 months ago
- ☆15Sep 13, 2022Updated 3 years ago
- Official implementation of the paper "SAINT"☆17Mar 12, 2022Updated 4 years ago
- ☆31Apr 27, 2022Updated 3 years ago
- 中文AllenNLP教程(持续更新)☆14Jun 3, 2019Updated 6 years ago
- The standalone version of SPOT-1D-Single available for public use for research purposes.☆24May 19, 2024Updated last year
- ☆12Sep 16, 2024Updated last year
- Feature Decay Algorithms☆11Mar 5, 2014Updated 12 years ago
- Chat with your data while uploading a pdf file and using a local LLM.☆11Mar 19, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Example of bazel python cpp binding☆10May 27, 2023Updated 2 years ago
- A Tensorflow LSTM spam detector utilizing GloVe word embeddings.☆12Nov 9, 2019Updated 6 years ago
- ☆10Jan 11, 2023Updated 3 years ago
- ☆13Oct 4, 2022Updated 3 years ago
- Some common data processing scripts☆19Dec 14, 2019Updated 6 years ago
- ☆33Sep 19, 2025Updated 6 months ago
- A structurally comprehensive dataset of AMR-to-text alignments for coverage of a larger variety of linguistic phenomena, for research rel…☆16Dec 10, 2022Updated 3 years ago
- Repository with illustrations for cft-contest-2018☆12Nov 22, 2018Updated 7 years ago
- ☆10Mar 29, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- AMR-parser. Code for EMNLP2019 paper "Core Semantic First: A Top-down Approach for AMR Parsing."☆11Feb 23, 2020Updated 6 years ago
- Unofficial implementation of Adaptive Input in PyTorch☆12Feb 22, 2019Updated 7 years ago
- An Interpretable Self-Attention Network with block-attention and attention-attribution.☆12Sep 22, 2023Updated 2 years ago
- 🪱 PARASITE || A parallel sentence data preprocessing toolkit. Originally developed as a part of the `en-ru` winner submission of WMT20 B…☆11Jun 8, 2021Updated 4 years ago
- [NeurIPS2022] Where to Pay Attention in Sparse Training for Feature Selection?☆12Feb 10, 2023Updated 3 years ago
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…☆14May 15, 2022Updated 3 years ago
- A powerful text cleaner for Japanese web texts☆12Jan 20, 2024Updated 2 years ago
- ☆18Jun 5, 2024Updated last year
- Classify image and text with ResNet and BERT models using Pytorch☆13Jul 7, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Speech Recognition Scoring Toolkit☆13Sep 30, 2015Updated 10 years ago
- Datasets for Drug Discovery and Development☆10Aug 22, 2020Updated 5 years ago
- ☆16Apr 28, 2022Updated 3 years ago
- Trained 50epochs☆12Aug 22, 2023Updated 2 years ago
- Chromosome Scale Assembler: A high-throughput chromosome scale genome assembly pipeline for vertebrate genomes☆10Oct 16, 2024Updated last year
- ☆40Apr 21, 2025Updated 11 months ago
- Python code for training models in the ACL paper, "Beyond BLEU:Training Neural Machine Translation with Semantic Similarity".☆53Dec 20, 2019Updated 6 years ago