HetSeq: Distributed GPU Training on Heterogeneous Infrastructure
☆106Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for hetseq
Users that are interested in hetseq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding (Findings of EMNLP'23)☆11Aug 24, 2024Updated last year
- ☆13Jun 3, 2019Updated 6 years ago
- ☆34Jul 13, 2021Updated 4 years ago
- FTPipe and related pipeline model parallelism research.☆44May 16, 2023Updated 2 years ago
- Code and data release for the paper "TwHIN-BERT: A Socially-Enriched Pre-trained Language Model for Multilingual Tweet Representations"☆29Jul 31, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Entity linking evaluation and analysis tool☆26Apr 14, 2025Updated last year
- A deep learning library based on Pytorch focussed on low resource language research and robustness☆70Nov 30, 2021Updated 4 years ago
- sequence tagging for NER for ULMFiT☆20Nov 4, 2020Updated 5 years ago
- Consistent dialogue generation☆16Oct 26, 2022Updated 3 years ago
- Search and download accepted papers from machine learning conferences☆34Apr 10, 2023Updated 3 years ago
- Code for the paper: "TSViz: Demystification of Deep Learning Models for Time-Series Analysis"☆13May 15, 2019Updated 6 years ago
- NeurIPS 2019 Paper Implementation☆12Nov 22, 2022Updated 3 years ago
- The augmented data of the paper "Parallel Data Augmentation for Formality Style Transfer" (ACL 2020).☆12May 14, 2020Updated 5 years ago
- Metadata-Induced Contrastive Learning for Zero-Shot Multi-Label Text Classification (WWW'22)☆32Jun 21, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The Python library with command line tools to interact with Dynabench(https://dynabench.org/), such as uploading models.☆55Jun 23, 2022Updated 3 years ago
- PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models. ICML 2021☆56Jul 21, 2021Updated 4 years ago
- A Chainer implementation of doc2vec☆10Nov 16, 2017Updated 8 years ago
- Script python que intenta separar en silabas palabras en español☆18Jan 8, 2017Updated 9 years ago
- (NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.☆44Nov 4, 2022Updated 3 years ago
- ☆14Jul 5, 2021Updated 4 years ago
- Image augmentation library for Jax☆42Apr 9, 2024Updated 2 years ago
- A neurosymbolic T5 agent for playing text games, from the EACL 2023 paper "Behavior Cloned Transformers are Neurosymbolic Reasoners"☆20Feb 25, 2023Updated 3 years ago
- Code for scaling Transformers☆26Dec 2, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆30Feb 11, 2022Updated 4 years ago
- Distributed skorch on Ray Train☆59Sep 21, 2022Updated 3 years ago
- ☆11Dec 2, 2024Updated last year
- [ICIP 2021] PyTorch code for "The Mind's Eye: Visualizing Class-Agnostic Features of CNNs" for generation of kernel features.☆12Sep 12, 2021Updated 4 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆64Aug 13, 2020Updated 5 years ago
- Deep Learning and Natural Language Processing using PyTorch (O'Reilly AI - NYC, 2019)☆11Apr 16, 2019Updated 7 years ago
- ☆31Jul 14, 2020Updated 5 years ago
- Eliminate global state without the boilerplate!☆13Dec 18, 2018Updated 7 years ago
- Adversarial Adaptation with Distillation for BERT Unsupervised Domain Adaptation☆35Oct 27, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆15Feb 8, 2026Updated 2 months ago
- ☆11Jan 2, 2022Updated 4 years ago
- Official code of "AAA: Adaptive Aggregation of Arbitrary Online Trackers with Theoretical Performance Guarantee"☆11May 8, 2021Updated 4 years ago
- Structured Gradient Tree Boosting☆25Nov 6, 2018Updated 7 years ago
- This repository keep my research materials about Named Entity Recognition using Transfer Learning☆10Oct 15, 2020Updated 5 years ago
- ☆15Apr 21, 2025Updated 11 months ago
- PyTorch tool for training with bigger batch size on the GPU☆11Feb 26, 2021Updated 5 years ago