Welcome to my Transformers tutorial series! In this series, I'll be diving into the powerful Transformer architecture and its implementation in TensorFlow and PyTorch. Whether you're an experienced NLP practitioner or just starting out, I hope you'll find the series informative and engaging.
☆10May 3, 2023Updated 2 years ago
Alternatives and similar repositories for Transformers-Tutorial
Users that are interested in Transformers-Tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The …☆20May 24, 2023Updated 2 years ago
- Sarcasm is a term that refers to the use of words to mock, irritate, or amuse someone. It is commonly used on social media. The metaphori…☆17May 24, 2023Updated 2 years ago
- This is an implementation of Gossip Protocol in c++ language.☆18Aug 10, 2021Updated 4 years ago
- ☆12Jun 7, 2023Updated 2 years ago
- ☆18Sep 26, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Learn Japanese using music. Frontend written in Nuxt.js and optional backend using Litserve☆20Jun 2, 2025Updated 10 months ago
- python package for calculating famous measures in computational linguistics☆15Nov 5, 2024Updated last year
- ☆19Nov 28, 2022Updated 3 years ago
- ☆11Dec 1, 2020Updated 5 years ago
- Beyond Words: A Multimodal Exploration of Persuasion in Memes☆12Jun 8, 2024Updated last year
- ☆19Feb 20, 2022Updated 4 years ago
- Reproducible analyses for the NicheCompass manuscript☆14Jul 3, 2025Updated 9 months ago
- ☆16May 6, 2025Updated 11 months ago
- Official Implementation of Avoiding spurious correlations via logit correction☆17May 6, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The official implementation of the ACL 2023 paper, "Paraphrasing-Guided Data Augmentation for Contrastive Prompt-based Few-shot Fine-tuni…☆11Nov 28, 2023Updated 2 years ago
- CAT-Walk is an inducive method that learns hyperedge representations via a novel higher-order random walk, SetWalk.☆20Oct 16, 2023Updated 2 years ago
- Official codebase for the NeurIPS 2023 paper: Towards Last-layer Retraining for Group Robustness with Fewer Annotations. https://arxiv.or…☆12May 15, 2024Updated last year
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆11Aug 26, 2024Updated last year
- Have an LLM write your biography, probably incorrectly☆14Dec 26, 2024Updated last year
- Library JPEG Quant Smooth☆12Aug 18, 2023Updated 2 years ago
- Shell Written in C☆11Nov 5, 2024Updated last year
- ARMAN: Pre-training with Semantically Selecting and Reordering of Sentences for Persian Abstractive Summarization☆11Oct 3, 2021Updated 4 years ago
- Code for the "Hiding Data In Sound" video☆12Jan 19, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆19Jan 27, 2023Updated 3 years ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 6 months ago
- Vincent Kit For Arduino☆16Dec 27, 2024Updated last year
- A latent flow-based diffusion model trained on the 2012 ImageNet dataset from scratch.☆25May 21, 2025Updated 11 months ago
- DecompX: Explaining Transformers Decisions by Propagating Token Decomposition [ACL 2023]☆19Jul 3, 2025Updated 9 months ago
- [NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers☆21May 16, 2023Updated 2 years ago
- Pattern recognition methods for feature extraction from ECG (open source version)☆26Aug 4, 2025Updated 8 months ago
- ECG-R1: Protocol-Guided and Modality-Agnostic MLLM for Reliable ECG Interpretation☆40Feb 21, 2026Updated 2 months ago
- Contrastive Regularizer☆15Feb 15, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Toolkit to Benchmark Methods for Ring-Based Cardiovascular Metrics☆26Sep 23, 2025Updated 7 months ago
- ☆17Feb 9, 2024Updated 2 years ago
- Code of our NeurIPS 2020 publication 'Uncovering the Topology of Time-Varying fMRI Data using Cubical Persistence'☆24Oct 22, 2020Updated 5 years ago
- Probing and Generalization of Metaphorical Knowledge in Pre-Trained Language Modelss[ACL 2022]☆23May 15, 2022Updated 3 years ago
- Tyee: A Unified, Modular, and Fully-Integrated Configurable Toolkit for Intelligent Physiological Health Care☆20Mar 1, 2026Updated last month
- [NeurIPS 2024] The repository for experiment codes for the paper: Scaling Law for Time Series Forecasting.☆20Oct 23, 2024Updated last year
- Code for CVPR 2023 Robust Generalization against Photon-Limited Corruptions via Worst-Case Sharpness Minimization☆13Mar 27, 2023Updated 3 years ago