SihyeongPark / Awesome-LLM-Benchmark
Awesome-LLM-Benchmark: List of benchmarks for Large-Language Models
☆9Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Awesome-LLM-Benchmark
- A testbed for agents and environments that can automatically improve models through data generation.☆12Updated this week
- Directed masked autoencoders☆14Updated last year
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.☆12Updated 2 months ago
- ☆13Updated last year
- Describe the format of image/text datasets☆11Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- Lottery Ticket Adaptation☆36Updated last month
- ☆9Updated 11 months ago
- A Data Source for Reasoning Embodied Agents☆19Updated last year
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆13Updated last month
- Minimum Description Length probing for neural network representations☆16Updated last week
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆36Updated 7 months ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- The tool to read/get/extract and write/change/modify BIOS/UEFI settings from Linux terminal.☆6Updated last year
- ☆21Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Updated 8 months ago
- ☆18Updated 8 months ago
- ☆16Updated last week
- SCREWS: A Modular Framework for Reasoning with Revisions☆26Updated last year
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆14Updated 8 months ago
- official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization☆13Updated 10 months ago
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆30Updated 2 years ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated this week
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆17Updated 8 months ago
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆18Updated 11 months ago
- Building a predictive model for the popularity of an unreleased hip hop track on Spotify☆11Updated 7 months ago
- ☆12Updated 3 weeks ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated this week