vLLM fork for Tesla V100 (SM70) with AWQ 4-bit support, CUDA 12.8 build flow, and validated Qwen3.5 27B/35B deployment on multi-GPU V100.
☆433Jun 20, 2026Updated last week
Alternatives and similar repositories for 1Cat-vLLM
Users that are interested in 1Cat-vLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RuCLIP-SB (Russian Contrastive Language–Image Pretraining SWIN-BERT) is a multimodal model for obtaining images and text similarities and…☆15Jan 25, 2022Updated 4 years ago
- ECAI 2025☆20May 4, 2026Updated last month
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated 2 years ago
- This Elgg plugin lets users preview MS Office files (doc, docx, xls, xlsx, ppt, pptx), Apple iWork pages, Adobe eps, and zip files using …☆12Aug 28, 2015Updated 10 years ago
- Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …☆10Jul 18, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Details about the wide minima density hypothesis and code to compute width of a minima☆10Nov 30, 2024Updated last year
- Building a quick conversation-based search demo with langchain.☆10Apr 2, 2024Updated 2 years ago
- YouTube Assistant☆12May 15, 2023Updated 3 years ago
- machine translation data process tools☆10Apr 29, 2024Updated 2 years ago
- ☆11Nov 12, 2018Updated 7 years ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆47Mar 20, 2025Updated last year
- 自己写过的爬虫脚本☆12May 11, 2019Updated 7 years ago
- ☆12Apr 29, 2021Updated 5 years ago
- Simple Interprocess Plugins for Go☆16Mar 24, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- fast-embeddings-api☆16Nov 23, 2023Updated 2 years ago
- [TNNLS 2026] SvANet: Exploiting Scale-Variant Attention for Segmenting Small Medical Objects☆72Jan 29, 2026Updated 5 months ago
- IEEE OUI database as JSON☆18Updated this week
- A Github App to chat with Your GitHub Repo's Issues Using ChatGPT☆16Mar 8, 2023Updated 3 years ago
- PaddleSeq☆10Mar 28, 2023Updated 3 years ago
- A lightweight tool that efficiently isolates target speaker data from your datasets.☆20Nov 23, 2024Updated last year
- A tiny and efficient non-blocking or asynchronous network library☆13May 4, 2019Updated 7 years ago
- ☆12May 20, 2022Updated 4 years ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Jan 27, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- One Line To Build Zero-Data Classifiers in Minutes☆65Sep 25, 2024Updated last year
- Converts Mandarin Chinese pinyin notation to IPA (international phonetic alphabet) notation☆19Nov 28, 2023Updated 2 years ago
- ☆12Aug 3, 2024Updated last year
- 知乎图片选择框架的优化版本,增加是否选择原图功能,可显示原图大小,状态栏颜色自适应;解决无法显示某些大图的bug。☆11Sep 11, 2019Updated 6 years ago
- ☆78Feb 19, 2024Updated 2 years ago
- Code for NAACL 2022 main conference paper "Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation"☆12May 8, 2023Updated 3 years ago
- ☆19Jul 20, 2015Updated 10 years ago
- Extracts the text from DWG and DXF files.☆15Mar 31, 2016Updated 10 years ago
- This gem wraps command line tools to extract plain text from typical files, such as PDF and common office formats.☆14Nov 25, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 大语言模型工具集☆27Aug 1, 2025Updated 10 months ago
- Fixed memory overflow issue in ProcessHider.☆16May 27, 2018Updated 8 years ago
- Extract FHIR data, Transform with NLP and DEID tools, and then Load FHIR data into a SQL Database for analysis☆25Jun 16, 2026Updated last week
- Make any person bald!! Component of the paper: Learning to regulate 3D head shape by removing occluding hair from in-the-wild images.☆12Jun 6, 2022Updated 4 years ago
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆21Updated this week
- ☆11Feb 26, 2024Updated 2 years ago
- ☆17Aug 25, 2022Updated 3 years ago