This is the official implementation of our ACL 2025 Main paper "Balancing Diversity and Risk in LLM Sampling".
☆17Oct 16, 2025Updated 5 months ago
Alternatives and similar repositories for Benchmarking-and-Guiding-Adaptive-Sampling-Decoding-for-LLMs
Users that are interested in Benchmarking-and-Guiding-Adaptive-Sampling-Decoding-for-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official implementation of our ICML 2024 paper "MultiMax: Sparse and Multi-Modal Attention Learning""☆22Feb 9, 2026Updated last month
- This is the official repository of our NeurIPS 2025 paper "MaxSup: Overcoming Representation Collapse in Label Smoothing"☆22Nov 6, 2025Updated 4 months ago
- The official repository for CosPGD: a unified white-box adversarial attack for pixel-wise prediction tasks.☆15May 8, 2025Updated 10 months ago
- Smooth Variational Graph Embeddings for Efficient Neural Architecture Search☆15Apr 8, 2024Updated last year
- Our code for ICLR'24 paper "Energy-based Automated Model Evaluation".☆23Feb 13, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is the official implementation of our BMVC 2022 paper "SP-ViT: Learning 2D Spatial Priors for Vision Transformers"☆13Mar 27, 2023Updated 2 years ago
- This repo contains the data used in "Towards Understanding Climate Change Perceptions: A Social Media Dataset"☆15Aug 30, 2024Updated last year
- GitHub repository of the ICLR 2023 paper "Neural Architecture Design and Robustness: A Dataset"☆16Jan 25, 2023Updated 3 years ago
- Official repository for our paper Robust Models are less Over-Confident☆20Mar 12, 2025Updated last year
- Code for "Learning Where To Look – Generative NAS is Surprisingly Efficient"☆15Aug 1, 2022Updated 3 years ago
- This is the official implementation of our CVPR 2024 paper "BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition"☆131Jul 25, 2024Updated last year
- Code for FrequencyLowCut Pooling (FLC pooling)☆20Apr 22, 2025Updated 11 months ago
- [ICML2024]Adaptive decoding balances the diversity and coherence of open-ended text generation.☆19Jun 2, 2024Updated last year
- Our code for ICCV'23 paper "CAME: Contrastive Automated Model Evaluation".☆27Jan 25, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official implementation of "Intra-Source Style Augmentation for Improved Domain Generalization" (WACV 2023 & IJCV)☆36Jan 11, 2024Updated 2 years ago
- Code accompanying the AAAI 2021 paper "Spectral Distribution Aware Image Generation".☆24Jan 1, 2021Updated 5 years ago
- Benchmarking Optimizers for LLM Pretraining☆56Dec 30, 2025Updated 2 months ago
- Implementation of the paper "Understanding anomaly detection with deep invertible networks through hierarchies of distributions and featu…☆42Nov 24, 2020Updated 5 years ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆48Aug 13, 2025Updated 7 months ago
- ☆10May 28, 2023Updated 2 years ago
- ☆13Aug 14, 2022Updated 3 years ago
- ☆17Jul 6, 2023Updated 2 years ago
- Our code for ICLR'25 paper "DataMan: Data Manager for Pre-training Large Language Models".☆121Feb 7, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆33Nov 13, 2025Updated 4 months ago
- Unsupervised deep learning framework with online(MLP: prediction-based, 1 D Conv and VAE: reconstruction-based, Wavenet: prediction-based…☆129Dec 2, 2022Updated 3 years ago
- Sets of Image Provenance cases, including node and edge information, generated automatically using Reddit Photoshop Battles☆14Jul 26, 2018Updated 7 years ago
- Repo for "Synergy of Sight and Semantics: Visual Intention Understanding with CLIP"☆12Mar 12, 2025Updated last year
- Multi-head Recurrent Layer Attention for Vision Network☆22Mar 2, 2023Updated 3 years ago
- Smooth Variational Graph Embeddings for Efficient Neural Architecture Search☆14Feb 2, 2023Updated 3 years ago
- Code for "Context-Aware Recurrent Encoder for Neural Machine Translation" (TASLP 2017)☆12Oct 29, 2018Updated 7 years ago
- ☆12Apr 16, 2024Updated last year
- Configurable multithreaded 3D cave generation for Minecraft based on simplex noise☆13Jul 22, 2016Updated 9 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Aether is a sleek, minimalist portfolio theme designed for Astro.js, perfect for those who value simplicity and speed. With a focus on cl…☆10Oct 29, 2024Updated last year
- Deep Networks with Recurrent Layer Aggregation☆28Nov 10, 2021Updated 4 years ago
- Stock sentiment analyzer in Python☆12May 2, 2021Updated 4 years ago
- A python implementation of the paper "Scalable Recognition with a Vocabulary Tree, D. Nister, H. Stewenius, 2006"☆17May 4, 2023Updated 2 years ago
- Influence-Based Mini-Batching (IBMB), as proposed in "Influence-Based Mini-Batching for Graph Neural Networks" (LoG 2022)☆19Dec 21, 2022Updated 3 years ago
- This is the official implementation of our paper "Hypergraph Transformer for Skeleton-based Action Recognition."☆116Oct 7, 2023Updated 2 years ago
- Official Repo for the Paper "AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution o…☆24Jan 12, 2025Updated last year