Upaya07 / NeurIPS-llm-efficiency-challengeView external linksLinks
Code for NeurIPS LLM Efficiency Challenge
☆60Apr 9, 2024Updated last year
Alternatives and similar repositories for NeurIPS-llm-efficiency-challenge
Users that are interested in NeurIPS-llm-efficiency-challenge are comparing it to the libraries listed below
Sorting:
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- Evaluating LLMs with fewer examples☆169Apr 12, 2024Updated last year
- [ICLR 2025] Official implementation of "Expand and Compress: Exploring Tuning Principles for Continual Spatio-Temporal Graph Forecasting"☆24Oct 30, 2025Updated 3 months ago
- ☆32Jan 1, 2024Updated 2 years ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆36Jul 6, 2023Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆16Aug 23, 2023Updated 2 years ago
- ☆13Oct 21, 2021Updated 4 years ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Oct 9, 2025Updated 4 months ago
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Aug 17, 2023Updated 2 years ago
- A Python-based tool that monitors dark web sources for mentions of specific organizations for Threat Monitoring.☆23Apr 7, 2025Updated 10 months ago
- ☆16Sep 18, 2023Updated 2 years ago
- Layout Analysis Dataset with Segmonto (LADaS)☆24Jul 12, 2025Updated 7 months ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Dec 30, 2023Updated 2 years ago
- ☆20Jul 12, 2023Updated 2 years ago
- PyCon Talks 2022 by Antoine Toubhans☆23Jul 8, 2022Updated 3 years ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆140Dec 28, 2024Updated last year
- Code for fine-tuning Platypus fam LLMs using LoRA☆629Feb 4, 2024Updated 2 years ago
- ☆19Dec 6, 2023Updated 2 years ago
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆86Jul 28, 2024Updated last year
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆261Apr 23, 2024Updated last year
- 🤫 Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Con…☆50Dec 20, 2023Updated 2 years ago
- ☆26Nov 25, 2023Updated 2 years ago
- [ICLR 2025] Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding☆40Mar 18, 2025Updated 10 months ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Jan 7, 2024Updated 2 years ago
- Reimplementation of the task generation part from the Alpaca paper☆119Apr 4, 2023Updated 2 years ago
- ☆150Jan 4, 2024Updated 2 years ago
- ☆55Apr 1, 2024Updated last year
- ☆23Sep 19, 2024Updated last year
- Includes examples on how to evaluate LLMs☆23Nov 4, 2024Updated last year
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated last year
- Implementation of the Mamba SSM with hf_integration.☆55Aug 31, 2024Updated last year
- ☆12Jan 17, 2026Updated 3 weeks ago
- Data Structures with Python(AIX20001) 강의 자료실☆18Jun 14, 2024Updated last year
- ☆25Nov 12, 2022Updated 3 years ago
- Kedro Plugin to support running pipelines on Kubernetes using Airflow.☆27Mar 11, 2025Updated 11 months ago
- The official repository of the paper "On the Exploitability of Instruction Tuning".☆70Feb 5, 2024Updated 2 years ago
- ☆28Updated this week
- Let's build better datasets, together!☆270Dec 20, 2024Updated last year