Explore visualization tools for understanding Transformer-based large language models (LLMs)
☆23Dec 1, 2024Updated last year
Alternatives and similar repositories for Awesome-Transformer-Visualization
Users that are interested in Awesome-Transformer-Visualization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jul 25, 2023Updated 2 years ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- PGRAG☆53Jul 16, 2024Updated last year
- ☆14May 7, 2024Updated 2 years ago
- ☆33Nov 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- 🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)☆12Updated this week
- An awesome repository & A comprehensive survey on interpretability of LLM attention heads.☆404Mar 2, 2025Updated last year
- Programming Languages I Lecture Notes☆12Apr 29, 2026Updated 2 weeks ago
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆174Dec 7, 2024Updated last year
- This repository provides a 3D implementation of DINOv2 for self-supervised pretraining on volumetric (3D) medical images using Lightly, M…☆53Apr 24, 2026Updated 3 weeks ago
- ☆15Apr 13, 2026Updated last month
- A set of kernel-based (Un)conditional independence tests including SDCIT (Lee and Honavar, UAI 2017)☆16Feb 6, 2020Updated 6 years ago
- Qwen3-0.6B megakernel: 527 tok/s decode on RTX 3090 (3.8x faster than PyTorch)☆101Feb 10, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NAACL 2024] CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions☆13May 7, 2024Updated 2 years ago
- For Certified Robustness to Text Adversarial Attacks by Randomized [MASK]☆17Oct 8, 2024Updated last year
- ☆16Apr 30, 2025Updated last year
- ☆17May 31, 2024Updated last year
- ☆18Jul 25, 2025Updated 9 months ago
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆39Feb 9, 2026Updated 3 months ago
- [NeurIPS 24] Can LLMs Solve Molecule Puzzles? A Multimodal Benchmark for Molecular Structure Elucidation☆19Jan 2, 2026Updated 4 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆147Nov 13, 2025Updated 6 months ago
- CIKM 2021: Pooling Architecture Search for Graph Classification☆21Jul 19, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Apr 14, 2026Updated last month
- ☆14Jul 24, 2023Updated 2 years ago
- a fast async pool based on channel☆26Apr 22, 2026Updated 3 weeks ago
- Materials for SDM 2023 tutorial: Augmentation Methods for Graph Learning☆21Apr 28, 2023Updated 3 years ago
- ☆96Jun 5, 2024Updated last year
- Code for replicating experiments from the paper, Preference Exploration for Efficient Bayesian Optimization with Multiple Outcomes, publi…☆13Jun 22, 2023Updated 2 years ago
- Yet Another Introduction to Quantum Computing☆14Oct 27, 2025Updated 6 months ago
- A 22.9 million carbon atom dataset☆16Mar 7, 2023Updated 3 years ago
- This repository contains the dataset and code for our ACL'23 publication: "MatSci-NLP: Evaluating Scientific Language Models on Materials…☆17Nov 21, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PyTorch implementation of "Online Hyperparameter Optimization for Class-Incremental Learning" (AAAI 2023 Oral)☆17Jun 30, 2023Updated 2 years ago
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆57Nov 26, 2024Updated last year
- Topological Data Analysis (TDA) for Natural Language Processing (NLP) Applications☆25Apr 27, 2026Updated 3 weeks ago
- A Unix shell written in Java☆16Aug 30, 2016Updated 9 years ago
- HUSTPass auth library for Node.js. Node.js 华科统一身份认证库。☆24Oct 2, 2023Updated 2 years ago
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- Python library to compress LitGPT models for resource efficient inference.☆16May 6, 2026Updated last week