sasha0552 / pascal-pkgs-ci
The main repository for building Pascal-compatible versions of ML applications and libraries.
☆63Updated 2 weeks ago
Alternatives and similar repositories for pascal-pkgs-ci:
Users that are interested in pascal-pkgs-ci are comparing it to the libraries listed below
- A fork of vLLM enabling Pascal architecture GPUs☆25Updated last month
- CI scripts designed to build a Pascal-compatible version of vLLM.☆12Updated 7 months ago
- llama.cpp fork with additional SOTA quants and improved performance☆231Updated this week
- Lightweight Inference server for OpenVINO☆143Updated this week
- AI management tool☆113Updated 4 months ago
- LLM inference in C/C++☆67Updated last week
- Open Source Text Embedding Models with OpenAI Compatible API☆150Updated 8 months ago
- Formatron empowers everyone to control the format of language models' output with minimal overhead.☆190Updated 2 months ago
- GPU Power and Performance Manager☆57Updated 5 months ago
- Download models from the Ollama library, without Ollama☆68Updated 4 months ago
- ☆83Updated 3 months ago
- ggml implementation of embedding models including SentenceTransformer and BGE☆56Updated last year
- Enhancing Translation with RAG-Powered Large Language Models☆77Updated last week
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆55Updated last month
- ☆197Updated 2 weeks ago
- Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-scrip…☆59Updated 8 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆243Updated 3 weeks ago
- Sentence Transformers API: An OpenAI compatible embedding API server☆49Updated 6 months ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆155Updated 5 months ago
- An innovative library for efficient LLM inference via low-bit quantization☆351Updated 7 months ago
- ☆81Updated 3 weeks ago
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆88Updated 2 months ago
- Easily view and modify JSON datasets for large language models☆71Updated 3 weeks ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆148Updated 10 months ago
- automatically quant GGUF models☆164Updated last week
- A multimodal, function calling powered LLM webui.☆214Updated 6 months ago
- Advanced Quantization Algorithm for LLMs/VLMs.☆413Updated this week
- A fast batching API to serve LLM models☆183Updated 11 months ago
- LM inference server implementation based on *.cpp.☆154Updated this week
- A pipeline parallel training script for LLMs.☆136Updated last week