NeurIPS 2024 tutorial on LLM Inference
☆49Dec 10, 2024Updated last year
Alternatives and similar repositories for neurips2024-inference-tutorial-code
Users that are interested in neurips2024-inference-tutorial-code are comparing it to the libraries listed below
Sorting:
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated 11 months ago
- Reinforcement Learning via Regressing Relative Rewards☆39Dec 12, 2024Updated last year
- ☆14Oct 11, 2023Updated 2 years ago
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆41Dec 13, 2024Updated last year
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆71Mar 28, 2025Updated 11 months ago
- ☆14May 21, 2024Updated last year
- DPO, but faster 🚀☆48Dec 6, 2024Updated last year
- ☆16Jul 23, 2024Updated last year
- Official code for Deep Bayesian Video Frame Interpolation (ECCV2022)☆18May 29, 2023Updated 2 years ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆19Dec 27, 2024Updated last year
- Official code and dataset for our EMNLP 2024 Findings paper: Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Kn…☆19Dec 27, 2024Updated last year
- Reinforcement learning tutorials using the rlberry library.☆17Jan 9, 2023Updated 3 years ago
- ☆21Nov 11, 2024Updated last year
- Triton Implementation of HyperAttention Algorithm☆48Dec 11, 2023Updated 2 years ago
- ☆17Jul 3, 2017Updated 8 years ago
- Vocabulary Parallelism☆25Mar 10, 2025Updated 11 months ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆28Apr 17, 2024Updated last year
- ☆46Feb 8, 2024Updated 2 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆187Jan 19, 2026Updated last month
- Fantastic Data Engineering for Large Language Models☆93Dec 29, 2024Updated last year
- A simple Python tool to measure the performance of ONNX models.☆27Sep 15, 2024Updated last year
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆23Apr 17, 2024Updated last year
- a curated list of the role of small models in the LLM era☆111Sep 23, 2024Updated last year
- Official Repository of Personalized Visual Instruct Tuning☆34Mar 6, 2025Updated 11 months ago
- Distributional Gradient Boosting Machines☆28Dec 13, 2022Updated 3 years ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Aug 30, 2024Updated last year
- ☆29Oct 3, 2022Updated 3 years ago
- [MM 2024 Oral] Refiner for AIGC☆29Jul 29, 2024Updated last year
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆27Oct 20, 2022Updated 3 years ago
- Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives☆70Feb 22, 2024Updated 2 years ago
- Map-Elites based on Evolution Strategies☆33Feb 11, 2022Updated 4 years ago
- ☆31Aug 25, 2022Updated 3 years ago
- ☆123Feb 21, 2025Updated last year
- ☆34Jan 7, 2026Updated last month
- ☆29Dec 28, 2025Updated 2 months ago
- ☆35Jan 21, 2025Updated last year
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Sep 25, 2024Updated last year
- ☆71Oct 29, 2021Updated 4 years ago