Tutorial to pretrain & fine-tune a π€ Flax T5 model on a TPUv3-8 with GCP
β58Jul 28, 2022Updated 3 years ago
Alternatives and similar repositories for t5-flax-gcp
Users that are interested in t5-flax-gcp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A software for transferring pre-trained English models to foreign languagesβ19Mar 20, 2023Updated 3 years ago
- β11Apr 11, 2023Updated 3 years ago
- Personal implementation of the Transformer paper.β22Oct 17, 2023Updated 2 years ago
- β184May 26, 2023Updated 2 years ago
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)β75Apr 10, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- MulinforCPI: enhancing precision of compound-protein interaction prediction through novel perspectives on multi-level information integraβ¦β10Jun 20, 2024Updated last year
- baikal.ai's pre-trained BERT models: descriptions and sample codesβ12Jun 24, 2021Updated 4 years ago
- Findings of ACL'2023: Optimizing Test-Time Query Representations for Dense Retrievalβ30Oct 24, 2023Updated 2 years ago
- Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.β32Sep 26, 2023Updated 2 years ago
- PathPiece tokenizerβ14Nov 10, 2024Updated last year
- λ§€μ£Ό λͺ©μμΌ, 20:00 λͺ¨μβ16Jul 24, 2020Updated 5 years ago
- β18Feb 25, 2025Updated last year
- β11Aug 12, 2020Updated 5 years ago
- μ΄μ± ν΄μκΈ° based on ko-BARTβ29Mar 31, 2021Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- β14May 3, 2022Updated 3 years ago
- β11Oct 3, 2021Updated 4 years ago
- λ€μ΄λ² μν 리뷰λ°μ΄ν°λ₯Ό νμ©ν νκΈ ν μ€νΈ κ°μ λΆμβ12Aug 22, 2018Updated 7 years ago
- νκ΅μ΄ μμ± λͺ¨λΈμ μμ μΆλ‘ μ μν KommonGen λ°μ΄ν°μ μ λλ€.β17Oct 5, 2021Updated 4 years ago
- CMU Linguistic Annotation Backendβ15Sep 22, 2025Updated 6 months ago
- β20Sep 28, 2021Updated 4 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.β105May 20, 2022Updated 3 years ago
- Meta Representation Transformation for Low-resource Cross-lingual Learningβ42May 5, 2021Updated 4 years ago
- Ukrainian ELECTRA modelβ12Mar 11, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- KnowMAN: Weakly Supervised Multinomial Adversarial Networksβ12Nov 9, 2021Updated 4 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.β15Sep 29, 2021Updated 4 years ago
- βοΈ Utilizing RBERT model structure for KLUE Relation Extraction taskβ15Nov 15, 2022Updated 3 years ago
- Korean Named Entity Corpusβ25May 12, 2023Updated 2 years ago
- β14Sep 10, 2021Updated 4 years ago
- β10Jun 8, 2024Updated last year
- Minimal code to train ELMo models in recent versions of TensorFlowβ14Apr 30, 2023Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Trainingβ51Jan 20, 2024Updated 2 years ago
- Data and code: "Answering legal questions from laymen in German civil law system", BΓΌttner & Habernal, EACL'24β14Mar 2, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A BART version of an open-domain QA model in a closed-book setupβ119Aug 13, 2020Updated 5 years ago
- Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)β19May 17, 2022Updated 3 years ago
- Korean Speech to English Translation Corpusβ45Sep 3, 2021Updated 4 years ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"β13Dec 14, 2021Updated 4 years ago
- β13Dec 17, 2021Updated 4 years ago
- Analyzing mBERT's multilinguality in a small laboratory settingβ13Jun 12, 2023Updated 2 years ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasksβ62Jan 22, 2022Updated 4 years ago