liuyifan22 / Qwen2.5-VL-BatchedLinks
A batched implementation for efficient Qwen2.5-VL inference.
☆20Updated 5 months ago
Alternatives and similar repositories for Qwen2.5-VL-Batched
Users that are interested in Qwen2.5-VL-Batched are comparing it to the libraries listed below
Sorting:
- 💻 As a Frontend Development Intern at Shen AI (Aug – Oct 2024), I built the company website using React.js and worked with the design te…☆13Updated 7 months ago
- Restaurant Management System is a web application built with Next.js and TypeScript. It offers features for restaurant management, includ…☆14Updated 8 months ago
- Incremental optimizations to the N-Body problem in order to evaluate and compare the performance of Python translators in the HPC environ…☆13Updated 2 years ago
- The proposal of this work involves a simulation of an ant colony swarm that was applied to a problem of search and rescue of objects of i…☆12Updated 2 years ago
- This project is a real-time Wav2Lip implementation that I am actively optimizing to enhance the precision and performance of audio-to-lip…☆11Updated 2 years ago
- Membrane-based dehumidification is currently being considered as a promising solution for the building application due to its low cost an…☆10Updated 5 years ago
- Principles and Methodologies for Serial Performance Optimization (OSDI' 25)☆21Updated 7 months ago
- A clean, modular implementation of the Proximal Policy Optimization (PPO) algorithm in PyTorch, written with a strong focus on readabilit…☆19Updated last year
- High-performance CUDA kernels for real-time financial low latency inference, optimized for both consumer and datacenter GPUs.☆19Updated 5 months ago
- aliasgharheidaricom / RUN-Beyond-the-Metaphor-An-Efficient-Optimization-Algorithm-Based-on-Runge-Kutta-MethodThe optimization field suffers from the metaphor-based “pseudo-novel” or “fancy” optimizers. Most of these cliché methods mimic animals' …