A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+
☆37Mar 10, 2021Updated 5 years ago
Alternatives and similar repositories for ttt
Users that are interested in ttt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository demonstrate training T5 transformers using tensorflow 2☆14Oct 1, 2020Updated 5 years ago
- Standalone pre-training recipe with JAX+Flax☆35Apr 3, 2023Updated 3 years ago
- Backtranslations of IMDB movie reviews for Data Augmentation Purposes☆10Apr 1, 2019Updated 7 years ago
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆15May 3, 2023Updated 3 years ago
- ☆10Oct 19, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Nov 10, 2020Updated 5 years ago
- Tiny Basic - arduboy☆10Aug 19, 2020Updated 5 years ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆39Feb 23, 2023Updated 3 years ago
- A simple REPL for Lean 4, returning information about errors and sorries.☆12Jun 19, 2023Updated 2 years ago
- playing with gpt4☆14Mar 17, 2023Updated 3 years ago
- This is the original matlab version of MKCFup☆10Jan 23, 2019Updated 7 years ago
- Text summarization with python and transformer☆13Jun 17, 2023Updated 2 years ago
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆331Jan 10, 2024Updated 2 years ago
- Program the Arduboy on the Arduboy☆11Jul 28, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Disassembly of Dr. Mario NES game.☆11Mar 2, 2025Updated last year
- A software aircraft controller for RaspberryPi☆10Mar 12, 2018Updated 8 years ago
- Checkout the new version at the link!☆22Dec 11, 2020Updated 5 years ago
- RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x flax community week☆28Jul 18, 2021Updated 4 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 10 months ago
- Lecture and seminar materials for Deep Learning summer school in Ulaanbaatar, 2021☆10Jul 11, 2021Updated 4 years ago
- Public helpers for huggingface.co. Now lives in https://github.com/huggingface/huggingface_hub☆13Jul 10, 2022Updated 3 years ago
- Keras implementation of `Decoupled Neural Interfaces using Synthetic Gradients`☆12Oct 19, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SelfCriticalSequenceTrainingforImageCaptioning☆21May 27, 2017Updated 8 years ago
- TPU support for the fastai library☆13Apr 15, 2021Updated 5 years ago
- Frontend UI and Backend Server for Stable Diffusion models☆32Apr 23, 2026Updated last week
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Jul 28, 2022Updated 3 years ago
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Jan 31, 2025Updated last year
- ☆13Mar 27, 2020Updated 6 years ago
- LSTM text generation by word. Used to generate multiple sentence suggestions based on the input words or a sentence☆27Nov 10, 2020Updated 5 years ago
- Kaggle Two Sigma 2nd Prize Winning Code https://www.kaggle.com/c/two-sigma-financial-news☆14May 15, 2024Updated last year
- jiant-dev☆28Dec 17, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Полный (но ещё не "отполированный") перевод Mother 3.☆14Sep 8, 2023Updated 2 years ago
- ICLR 2019 paper: "textTOvec: DEEP CONTEXTUALIZED NEURAL AUTOREGRESSIVE TOPIC MODELS OF LANGUAGE WITH DISTRIBUTED COMPOSITIONAL PRIOR"☆25Dec 30, 2018Updated 7 years ago
- Masking tokens to modify the predictions of a pretrained sentence classifier☆16Feb 4, 2020Updated 6 years ago
- EMNLP 2021: Detecting Speaker Personas from Conversational Texts☆13Nov 5, 2021Updated 4 years ago
- Easy & Pretrained SOTA Deep Learning for RNA strings☆12Apr 15, 2022Updated 4 years ago
- Annotated corpus and code for "Extracting COVID-19 Events from Twitter".☆44May 19, 2022Updated 3 years ago
- Prose for a painting source code☆12Oct 8, 2019Updated 6 years ago