The code and data for the GPT-4 based benchmark in the vicuna blog post
☆43Aug 2, 2023Updated 2 years ago
Alternatives and similar repositories for vicuna-blog-eval
Users that are interested in vicuna-blog-eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Code for NAACL 2022 paper: "Persona-Guided Planning for Controlling the Protagonist's Persona in Story Generation"☆16Sep 1, 2022Updated 3 years ago
- Unofficial Scalable-Softmax Is Superior for Attention☆20May 30, 2025Updated 10 months ago
- Official repository for LongChat and LongEval☆534May 24, 2024Updated last year
- ☆46Mar 24, 2026Updated 3 weeks ago
- [EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization☆39Sep 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆37May 11, 2024Updated last year
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- ☆13Feb 2, 2021Updated 5 years ago
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 7 months ago
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆82Apr 12, 2024Updated 2 years ago
- Pytorch implementation of Class Balanced Loss based on Effective number of Samples☆12May 18, 2023Updated 2 years ago
- torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.☆25Mar 29, 2024Updated 2 years ago
- Benchmarking Attention Mechanism in Vision Transformers.☆20Oct 10, 2022Updated 3 years ago
- MMoE: Multimodal Mixture-of-Experts (EMNLP 2024)☆15Nov 14, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- (ACL '25 - Oral) FedEx-LoRA: Exact Aggregation for Federated and Efficient Fine-Tuning of Foundation Models☆32Oct 4, 2025Updated 6 months ago
- ACL 2023☆39Jun 6, 2023Updated 2 years ago
- [UAI 2023] Improvable Gap Balancing for Multi-Task Learning☆16May 29, 2023Updated 2 years ago
- ☆235Jun 11, 2024Updated last year
- The official implementation of the EMNLP 2023 paper LLM-FP4☆222Dec 15, 2023Updated 2 years ago
- ☆19Jun 3, 2023Updated 2 years ago
- ☆25Oct 16, 2024Updated last year
- ☆10Dec 18, 2023Updated 2 years ago
- SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia☆42Mar 13, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- Chrome Extension. As the name suggests.☆10Jan 30, 2022Updated 4 years ago
- Basic Artificial Intelligence Theory☆10Mar 11, 2025Updated last year
- yet another anki app☆14Sep 9, 2024Updated last year
- 2023-1 고려대학교 AIKU 딥러닝 방학 부트캠프: Deep into Deep☆10Jul 10, 2023Updated 2 years ago
- Pytorch implementation of our paper accepted by ICML 2023 -- "Bi-directional Masks for Efficient N:M Sparse Training"☆13Jun 7, 2023Updated 2 years ago
- UI for ActivityWatch. Include category editor and viewer for multiple categorizations.☆10Jan 31, 2024Updated 2 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- [NeurIPS 24] Official Implementation (Pytorch) of "Inversion-based Latent Bayesian Optimization"☆10Nov 15, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆81Jul 21, 2022Updated 3 years ago
- Python package for rematerialization-aware gradient checkpointing☆27Oct 31, 2023Updated 2 years ago
- Pandas Helper Library for reading and writing DataFrames from and to HBase.☆10Mar 8, 2018Updated 8 years ago
- ☆13May 21, 2024Updated last year
- Typing + Paste == Typaste.☆14Jan 15, 2026Updated 3 months ago
- Unofficial implementation of YOTO (You Only Train Once) applied to Class balanced loss☆23May 3, 2020Updated 5 years ago
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆21Aug 1, 2025Updated 8 months ago