victorchen96 / Draw-Paper-Plot-Using-SeabornLinks
some examples for drawing illustration plots for paper using seaborn package
☆15Updated 5 years ago
Alternatives and similar repositories for Draw-Paper-Plot-Using-Seaborn
Users that are interested in Draw-Paper-Plot-Using-Seaborn are comparing it to the libraries listed below
Sorting:
- Mixture of Attention Heads☆48Updated 2 years ago
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆86Updated 2 years ago
- code for Explicit Sparse Transformer☆62Updated 2 years ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated last year
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆45Updated 9 months ago
- ☆31Updated last year
- Crawl & visualize ICLR papers and reviews.☆18Updated 2 years ago
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Le…☆75Updated last year
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Updated 2 years ago
- ☆12Updated 2 years ago
- Code for Reparameterizable Subset Sampling via Continuous Relaxations, IJCAI 2019.☆57Updated last year
- ☆33Updated 4 years ago
- ☆21Updated 2 years ago
- Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs☆38Updated last year
- Code for paper Fully Hyperbolic Neural Networks☆79Updated 2 years ago
- ☆45Updated 2 years ago
- [ACL 2023] Code for paper “Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation”(https://arxiv.org/abs/2305.…☆38Updated 2 years ago
- Code for NeurIPS 2019 paper "Screening Sinkhorn Algorithm for Regularized Optimal Transport"☆10Updated 5 years ago
- Crawl & visualize ICLR papers and reviews☆110Updated 2 years ago
- Source code for the TMLR paper "Black-Box Prompt Learning for Pre-trained Language Models"☆56Updated last year
- This package implements THOR: Transformer with Stochastic Experts.☆65Updated 3 years ago
- Learning to Encode Position for Transformer with Continuous Dynamical Model☆60Updated 5 years ago
- The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models☆16Updated last year
- ☆29Updated 3 years ago
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆27Updated last year
- Source code for "Distilling Knowledge From Graph Convolutional Networks", CVPR'20☆58Updated 2 years ago
- Spectral Graph Attention Network with Fast Eigen-approximation☆12Updated 3 years ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Updated last year
- Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)☆35Updated last year
- [ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models☆22Updated last year