Repo for EmbedLLM: Learning Compact Representations of Large Language Models
☆29Sep 25, 2025Updated 6 months ago
Alternatives and similar repositories for EmbedLLM
Users that are interested in EmbedLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Jul 24, 2024Updated last year
- Code repo for efficient quantized MoE inference with mixture of low-rank compensators☆36Apr 14, 2025Updated last year
- US Neighborhood data in GeoJSON format from OpenSource Zillow Neighborhood Boundaries Shapefiles☆11Oct 27, 2016Updated 9 years ago
- Official implementation of the ICLR paper "Streamlining Redundant Layers to Compress Large Language Models"☆41May 1, 2025Updated 11 months ago
- Official code for the paper "Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark"☆30Jun 30, 2025Updated 9 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆155Jun 13, 2024Updated last year
- Yunhao Cao's Opensourced Study Notes☆14Aug 19, 2024Updated last year
- [Findings@ACL'26] LLMRouterBench: A Massive Benchmark and Unified Framework for LLM Routing☆52Apr 6, 2026Updated last week
- ☆17Nov 3, 2024Updated last year
- ☆21Oct 2, 2024Updated last year
- ☆14Sep 19, 2022Updated 3 years ago
- Github Repo for OATS: Outlier-Aware Pruning through Sparse and Low Rank Decomposition☆20Apr 16, 2025Updated last year
- [ACCV 2024] Official code of our paper "Joint Image Super-resolution and Low-light Enhancement in the Dark"☆14Jun 23, 2025Updated 9 months ago
- Sabonis, a Digital Forensics and Incident Response pivoting tool☆19Mar 3, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆23Jun 2, 2025Updated 10 months ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆29Jul 24, 2025Updated 8 months ago
- ☆10Aug 27, 2022Updated 3 years ago
- This approach of Intrusion Detection uses two GPT models, which are trained on normal network traffic, to predict sequences of communicat…☆11Oct 3, 2023Updated 2 years ago
- High accuracy captcha solver for SJTU Jaccount login page using SVM and ResNet.☆13Nov 9, 2022Updated 3 years ago
- [ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".☆24Mar 16, 2025Updated last year
- 这个仓库包含了我在上人工智能课时完成的拼音输入法作业。☆11Feb 16, 2022Updated 4 years ago
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆33Updated this week
- 清华树洞在被封禁之前的所有数据(All data publicated in THU tree hole during 2020 Spring to 2021 Winter)☆16Jul 30, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆22May 28, 2024Updated last year
- Code for "LLM Embeddings Improve Test-time Adaptation to Tabular Y|X-Shifts"☆12Oct 17, 2024Updated last year
- ☆14Feb 26, 2025Updated last year
- The official implementation for Common Sense Enhanced Knowledge-based Recommendation with Large Language Model☆14Apr 21, 2024Updated last year
- VLS: Steering Pretrained Robot Policies via Vision–Language Models☆48Mar 29, 2026Updated 3 weeks ago
- ☆12Nov 20, 2023Updated 2 years ago
- 词、句拼音转汉字、拼音分割、拼音补全、pygame输入中文☆15Mar 21, 2020Updated 6 years ago
- [TVC 2023] Official PyTorch implementation for "Learning adaptive hyper-guidance via proxy-based bilevel optimization for image enhanceme…☆11Oct 8, 2024Updated last year
- Datasets for cybersecurity☆16Aug 12, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [TPAMI 2024] Official PyTorch implementation for "Learning with Constraint Learning: New Perspective, Solution Strategy and Various Appli…☆13Oct 7, 2024Updated last year
- Principles and Methodologies for Serial Performance Optimization (OSDI' 25)☆27Jun 5, 2025Updated 10 months ago
- Usenix Security'23☆15Feb 14, 2023Updated 3 years ago
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆26May 29, 2025Updated 10 months ago
- A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in Large Language Models☆113Nov 12, 2025Updated 5 months ago
- ☆14Feb 11, 2022Updated 4 years ago
- [ICML 2024] Official repository of ICML 2024 - RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language…☆11Apr 4, 2026Updated 2 weeks ago