Zefan-Cai / PyramidKV

The Official Implementation of PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling

☆480

Related projects: ⓘ

jordddan / Pruning-LLMs
The framework to prune LLMs to any size and any config.
☆96Updated 6 months ago
gersteinlab / ML-Bench
The Official Repo of ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://a…
☆350Updated last week
longyuewangdcu / GuoFeng-Webnovel
Multilingual Corpus of Web Fiction
☆211Updated 2 months ago
smartyfh / LLM-Uncertainty-Bench
Benchmarking LLMs via Uncertainty Quantification
☆206Updated 7 months ago
Elfsong / Mercury
Code Efficiency Benchmark
☆81Updated last month
NexaAI / Awesome-LLMs-on-device
Awesome LLMs on Device: A Comprehensive Survey
☆613Updated this week
YUCHEN005 / GenTranslate
Code for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"
☆192Updated last month
guanchuwang / redis-bench
☆366Updated 3 weeks ago
Infini-AI-Lab / TriForce
[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
☆203Updated 2 weeks ago
gordonhu608 / MQT-LLaVA
Matryoshka Query Transformer for Large Vision-Language Models
☆88Updated 2 months ago
wei-potato / Train-llm-from-scratch
使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力
☆145Updated 2 months ago
yileijin / Bootstrap-3D-GS
☆353Updated last month
YangLinyi / GLUE-X
We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that …
☆115Updated last year
IAAR-Shanghai / UHGEval
[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA
☆176Updated 2 weeks ago
yuanze-lin / REVIVE
[NeurIPS 2022] Official Code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
☆132Updated this week
mlpc-ucsd / BLIVA
(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions
☆260Updated 5 months ago
dongxuyue / Open-ReplaceAnything
Unofficial Implementation of ReplaceAnything: https://aigcdesigngroup.github.io/replace-anything/
☆526Updated 3 months ago
Windsander / ADI-Stable-Diffusion
Accelerate your Stable Diffusion inference with the library's universal C/C++ framework design, powered by ONNXRuntime & across platforms…
☆610Updated last month
Ledzy / BAdam
☆189Updated 2 months ago
UniModal4Reasoning / DocGenome
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models
☆92Updated last week
MingXiangL / DEVIL
Evaluating dynamics capability of T2V generation models with DEVIL protocols.
☆321Updated 3 weeks ago
PKU-YuanGroup / Machine-Mindset
An MBTI Exploration of Large Language Models
☆448Updated 7 months ago
ZrrSkywalker / MathVerse
[ECCV 2024] Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
☆133Updated 2 weeks ago
PeiranLi0930 / L-SVD
Large-Scale Selfie Video Dataset (L-SVD): A Benchmark for Emotion Recognition
☆407Updated last month
uclaml / SPPO
The official implementation of Self-Play Preference Optimization (SPPO)
☆461Updated last month
yewentao256 / TinyNN
动手构建一个完整的神经网络; Hands-on construction of a complete neural network
☆13Updated last year
zou-group / avatar
AvaTaR: Optimizing LLM Agents for Tool-Assisted Knowledge Retrieval (https://arxiv.org/abs/2406.11200)
☆140Updated last month
HITsz-TMG / UMOE-Scaling-Unified-Multimodal-LLMs
The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"
☆754Updated last week
ShareGPT4Omni / ShareGPT4Video
An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
☆1,220Updated last month
om-ai-lab / OmDet
Real-time and accurate open-vocabulary end-to-end object detection
☆1,483Updated 2 weeks ago