Blazing-fast LLM inference in pure Rust. No PyTorch and Python runtime.
☆220May 25, 2026Updated this week
Alternatives and similar repositories for xinfer
Users that are interested in xinfer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jan 4, 2024Updated 2 years ago
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆663May 19, 2026Updated last week
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Apr 22, 2025Updated last year
- This Repository allows to super fast download historical ohlcv data from binance.☆12Nov 27, 2020Updated 5 years ago
- Minimal JAX/Flax port of `lpips` supporting `vgg16`, with pre-trained weights stored in the 🤗 Hugging Face hub.☆17Aug 1, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- "BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks"☆13May 10, 2024Updated 2 years ago
- This project aims to provide a quick and efficient way to capture any thought to your AnyType second brain. It leverages the protobuf GRP…☆16Aug 26, 2024Updated last year
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- Cleanlab Vizzy: illustrating the core ideas behind the Cleanlab algorithm☆16Apr 19, 2023Updated 3 years ago
- A constraint programming solver.☆10Jan 10, 2024Updated 2 years ago
- ☆13Mar 27, 2020Updated 6 years ago
- ☆12Apr 26, 2024Updated 2 years ago
- A complete CUDA tutorial ranging from first GPU programs to advanced asynchronous methods☆30Jan 22, 2026Updated 4 months ago
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆98May 18, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- TextPy: Collaborative Agent Workflow through Programming and Prompting☆27May 9, 2025Updated last year
- Run GEPA on your favorite non-python libraries.☆35Jan 22, 2026Updated 4 months ago
- Community client for handling Weaviate vector database transactions written in Rust, for Rust.☆10Jun 2, 2024Updated last year
- Deformable Convolution Networks v4☆15Mar 25, 2024Updated 2 years ago
- Two-Path-Transformer-Based Generative Adversarial Network Using Joint Magnitude Masking And Complex Spectral Mapping For Speech Enhanceme…☆16May 29, 2024Updated last year
- caro: fast Rust CLI that turns natural‑language tasks into a safe POSIX command. Built for macOS (MLX/Metal) with a built‑in model; suppo…☆33May 18, 2026Updated last week
- Pytorch implementation of an energy transformer - an energy-based reccurrent variant of the transformer.☆14Jul 11, 2023Updated 2 years ago
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Mar 31, 2026Updated last month
- I-JEPA finetuning recipe☆13Jul 11, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A modular framework for building and deploying Retrieval-Augmented Generation (RAG) systems with built-in evaluation and monitoring.☆21Nov 26, 2025Updated 6 months ago
- (ACM MM 2022 Workshop APCCPA) IPDAE: Improved Patch-Based Deep Autoencoder for Lossy Point Cloud Geometry Compression.☆16Mar 15, 2024Updated 2 years ago
- Swete's LXX Text from 1KY Greek with Corrections Against Manuscripts☆10Oct 11, 2020Updated 5 years ago
- Avoid merge conflicts across git worktrees for parallel AI coding agents☆59Feb 24, 2026Updated 3 months ago
- ☆14Dec 21, 2024Updated last year
- Secure Command-Line Password Manager☆25Dec 4, 2025Updated 5 months ago
- Enlightener, the cutting-edge Retrieval-Augmented Generation (RAG) system that revolutionizes query responses. By combining the power of …☆13Jul 28, 2025Updated 9 months ago
- A collection of writings from historical Christianity, browse at https://historicalchristian.faith/by_father.php☆17Updated this week
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆35Oct 13, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62May 6, 2023Updated 3 years ago
- A list of articles outside of the official MLIR docs that I've found useful for learning MLIR☆12Aug 16, 2023Updated 2 years ago
- Black for Python docstrings and reStructuredText (rst).☆18Apr 7, 2023Updated 3 years ago
- ☆16Feb 6, 2024Updated 2 years ago
- Samples to show you how to create and deploy apps with Defang.☆11May 14, 2026Updated last week
- Rust API client for https://open-meteo.com/☆36Feb 21, 2026Updated 3 months ago
- Docker image for Cloudflare workerd☆15Feb 11, 2023Updated 3 years ago