☆28Nov 25, 2021Updated 4 years ago
Alternatives and similar repositories for share_layer_params
Users that are interested in share_layer_params are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Rethinking Perturbations in Encoder-Decoders for Fast Training☆18Nov 25, 2021Updated 4 years ago
- ☆31Apr 27, 2022Updated 4 years ago
- Bias Tests for Voice Technologies (bt4vt)☆11Jun 16, 2024Updated last year
- This repository contains my research work on building the state of the art next basket recommendations using techniques such as Autoencod…☆11Mar 10, 2021Updated 5 years ago
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models☆12Jul 1, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Machine Translation Web Interface for OpenNMT-py☆26Dec 24, 2021Updated 4 years ago
- ☆17Dec 9, 2022Updated 3 years ago
- DefSent: Sentence Embeddings using Definition Sentences☆23Aug 5, 2021Updated 4 years ago
- ☆17Sep 13, 2022Updated 3 years ago
- Code for NAACL 2022 main conference paper "Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation"☆12May 8, 2023Updated 2 years ago
- A processor for KyotoCorpus, KWDLC, and AnnotatedFKCCorpus☆10Jun 26, 2024Updated last year
- Poet: Product-oriented Video Captioner for E-commerce☆12Sep 21, 2020Updated 5 years ago
- ☆15Sep 13, 2022Updated 3 years ago
- Code and database for Jacquot et al. CVPR 2020. Can we decode subtle human activities?☆12Dec 22, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆10Mar 28, 2022Updated 4 years ago
- ☆10Jul 15, 2024Updated last year
- Topic supervised non-negative matrix factorization with sparse matrices☆12Mar 24, 2020Updated 6 years ago
- Post-editing Datasets by Rakuten (PEDRa)☆14Jun 23, 2021Updated 4 years ago
- BlackOut and Adaptive Softmax for language models by Chainer☆11Oct 20, 2017Updated 8 years ago
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- The codebase of paper:Learning Light-Weight Translation Models from Deep Transformer, which is accepted by AAAI2021 conference.☆15Jan 25, 2021Updated 5 years ago
- Multilingual Compositional Wikidata Questions (MCWQ)☆20Jun 12, 2023Updated 2 years ago
- Software Agents Pacman project, implementing AI for Pacman in Python and PDDL☆12Oct 12, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆21Jun 29, 2024Updated last year
- Script to get ACL Anthology☆16Jan 2, 2025Updated last year
- Code to reproduce the paper "Do causal predictors generalize better to new domains?"☆16Feb 7, 2025Updated last year
- Neural Fuzzy Repair (NFR) is a data augmentation pipeline, which integrates fuzzy matches (i.e. similar translations) into neural machine…☆12Aug 14, 2024Updated last year
- Code for PII detection and redaction in code datasets☆15Jan 24, 2023Updated 3 years ago
- This repository contains a CBIR system that uses swin transformer to extract image's feature.☆17Aug 11, 2023Updated 2 years ago
- Chat with your data while uploading a pdf file and using a local LLM.☆11Mar 19, 2024Updated 2 years ago
- Source code for Delving Deeper into the Decoder for Video Captioning☆39Jun 1, 2021Updated 4 years ago
- docker for UTH-BERT: https://ai-health.m.u-tokyo.ac.jp/uth-bert☆14Mar 24, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Example of bazel python cpp binding☆10May 27, 2023Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Feb 17, 2023Updated 3 years ago
- Testing DeepSpeed integration in 🤗 Accelerate☆11Jun 28, 2022Updated 3 years ago
- A pedagogical, functional-oriented deep learning library built on top of jax.☆15Jul 19, 2021Updated 4 years ago
- Unsupervised spoken sentence embeddings☆14Dec 14, 2022Updated 3 years ago
- pdf2audiobook☆22Jun 22, 2020Updated 5 years ago
- ☆19Feb 15, 2023Updated 3 years ago