usail-hkust/benchmark_inference_time_computation_LLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/usail-hkust/benchmark_inference_time_computation_LLM)

usail-hkust / benchmark_inference_time_computation_LLM

[NeurIPS 2025] Bag of Tricks for Inference-time Computation of LLM Reasoning

☆16

Alternatives and similar repositories for benchmark_inference_time_computation_LLM

Users that are interested in benchmark_inference_time_computation_LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

baopj / E3M
View on GitHub
[ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.
☆11Jul 16, 2024Updated 2 years ago
ljang0 / videowebarena
View on GitHub
☆14Dec 25, 2024Updated last year
ShoumikSaha / agent-skill-security
View on GitHub
☆15May 13, 2026Updated 2 months ago
AI45Lab / DEAN
View on GitHub
☆11Oct 25, 2024Updated last year
luckyfan-cs / Template-of-HKUST-GZ-Thesis
View on GitHub
☆59Jul 1, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
THUDM / APAR
View on GitHub
APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding
☆14Jul 22, 2024Updated 2 years ago
Peiyance / REVOLVE
View on GitHub
Revolve: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization
☆22Dec 13, 2024Updated last year
AstorYH / PASB
View on GitHub
An end-to-end security evaluation framework tailored for real-world personalized agent.
☆15Feb 28, 2026Updated 4 months ago
LaVi-Lab / FTTT
View on GitHub
[ACL 2025] Official code for ''Learning to Reason from Feedback at Test-Time''.
☆13May 16, 2025Updated last year
tanganke / subspace_fusion
View on GitHub
Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"
☆14Mar 28, 2024Updated 2 years ago
TrustAIRLab / HarmfulSkillBench
View on GitHub
The Official Repository for Paper "HarmfulSkillBench: How Do Harmful Skills Weaponize Your Agents?"
☆15May 2, 2026Updated 2 months ago
tanganke / pareto_set_learning
View on GitHub
Code for paper "Towards Efficient Pareto Set Approximation via Weight-Ensembling Mixture of Experts"
☆11Sep 13, 2024Updated last year
Vito-Swift / sigcomm23-BeamSense
View on GitHub
Simulation, multi-path estimation, and CBR parsing code of SIGCOMM2023 BeamSense CBR-Sensing
☆10Jan 14, 2024Updated 2 years ago
ZrW00 / MuScleLoRA
View on GitHub
The code implementation of MuScleLoRA (Accepted in ACL 2024)
☆10Dec 1, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
thunlp / NOSA
View on GitHub
The official implementation of NOSA
☆19Jun 11, 2026Updated last month
gridaco / context
View on GitHub
📖 UI/UX context detection engine
☆12Jan 3, 2021Updated 5 years ago
hkust-nlp / GUIMid
View on GitHub
☆22May 3, 2025Updated last year
colour-science / smits1999
View on GitHub
An RGB to Spectrum Conversion for Reflectances - Smits (1999)
☆14Jan 26, 2020Updated 6 years ago
sinwang20 / D2PO
View on GitHub
[ACL 2025] "World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning." https://arxiv.org/abs/2503.1…
☆18Jul 22, 2025Updated last year
psunlpgroup / FoVer
View on GitHub
This repository includes code and materials for the paper "Efficient PRM Training Data Synthesis via Formal Verification" (ACL 2026 Findi…
☆19Apr 7, 2026Updated 3 months ago
EIT-NLP / Speak-While-Watching
View on GitHub
☆17Mar 1, 2026Updated 4 months ago
XuankunRong / SafeGRPO
View on GitHub
[CVPR'26] SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization
☆21Feb 19, 2026Updated 5 months ago
TiankaiHang / CCA
View on GitHub
☆22Jan 26, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
MantisAI / prompt_engineering
View on GitHub
Code that accompanies the PyData New York (2022) talk: Addressing the sensitivity of Large language models
☆13Nov 7, 2022Updated 3 years ago
ethz-spylab / misleading-privacy-evals
View on GitHub
Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading" (https://arxiv.org/abs/2404.17399)
☆13Apr 29, 2024Updated 2 years ago
open-compass / CriticEval
View on GitHub
[NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs
☆49Nov 29, 2024Updated last year
latent-variable / r1_reasoning_effort
View on GitHub
Forces DeepSeek R1 models to engage in extended reasoning by intercepting early termination tokens.
☆19Feb 12, 2025Updated last year
yuanmu97 / PacketGame
View on GitHub
[SIGCOMM 2023] PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale
☆15Jul 1, 2023Updated 3 years ago
LeiWang1999 / TVM.CMakeExtend
View on GitHub
Tutorials of Extending and importing TVM with CMAKE Include dependency.
☆16Oct 11, 2024Updated last year
QianNing0 / MoG-DUN
View on GitHub
PyTorch code for JSTSP2021 paper "Accurate and Lightweight Image Super-Resolution with Model-Guided Deep Unfolding Network""
☆12Nov 21, 2020Updated 5 years ago
IBM / ColPret
View on GitHub
Efficient Scaling laws and collaborative pretraining.
☆23Updated this week
dapowan / Penetrative-AI
View on GitHub
☆17Oct 19, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
DD-DuDa / TensorRT-in-Action
View on GitHub
TensorRT-in-Action 是一个 GitHub 代码库，提供了使用 TensorRT 的代码示例，并有对应 Jupyter Notebook。
☆15Jun 1, 2023Updated 3 years ago
AmazingSealock / HKUSTGZ_Indoor_Robot
View on GitHub
☆15Jun 19, 2025Updated last year
LeiWang1999 / Stream-k.tvm
View on GitHub
☆20Sep 28, 2024Updated last year
66RING / CritiPrefill
View on GitHub
Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".
☆17Sep 15, 2024Updated last year
tmlr-group / AlphaDiana
View on GitHub
A System for Evaluating Reasoning Agents such as OpenClaw
☆21Apr 3, 2026Updated 3 months ago
bhatiaabhinav / RL-v2
View on GitHub
Version 3.0.0 Pytorch implementations of DQN, DDQN, DDPG, SAC, Discrete SAC. With more features :)
☆12Feb 16, 2023Updated 3 years ago
brave-experiments / MELT-public
View on GitHub
codebase for "MELTing Point: Mobile Evaluation of Language Transformers"
☆21Jul 19, 2024Updated 2 years ago