Efficient LLM inference on Slurm clusters.
☆97Apr 27, 2026Updated this week
Alternatives and similar repositories for vector-inference
Users that are interested in vector-inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM finetuning in resource-constrained environments.☆55Jun 24, 2024Updated last year
- A toolkit for research on multimodal representation learning☆19Apr 20, 2026Updated last week
- Toy autograd engine in OCaml with Apple Accelerate backend☆30Jul 31, 2024Updated last year
- You should use PySR to find scaling laws. Here's an example.☆33Sep 30, 2023Updated 2 years ago
- A streamlined reference manual for AI practitioners, students, and developers to quickly look up core concepts and implementations.☆26Jun 23, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Linear Mode Connectivity in Multitask and Continual Learning: https://arxiv.org/abs/2010.04495☆12Oct 12, 2020Updated 5 years ago
- ☆22Apr 17, 2025Updated last year
- ☆14Feb 4, 2026Updated 2 months ago
- A toolkit providing easy and unified access to building control environments for reinforcement learning (RL) [No longer actively maintain…☆42Nov 26, 2024Updated last year
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- ☆10Apr 27, 2021Updated 5 years ago
- Completing the Puzzle of All-in-One Event Understanding Benchmark with Event Arguments☆14Mar 12, 2024Updated 2 years ago
- Deprecated in favor of MultivariateStats.jl☆27Sep 13, 2014Updated 11 years ago
- A Reinforcement Learning solution for HVAC control to optimize energy consumption. Developed by Vector Institute and TELUS.☆79Apr 27, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Jun 17, 2022Updated 3 years ago
- Self Supervised Learning for Time Series Using Similarity Distillation☆12Jun 29, 2022Updated 3 years ago
- SegMate: A Segmentation Toolkit☆22Feb 18, 2024Updated 2 years ago
- Official Implementation of "Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning" at EMNLP 2024 Main Conf…☆45Jul 31, 2025Updated 8 months ago
- Code accompanying the paper "Understanding Bias in Word Embeddings"☆22Dec 8, 2022Updated 3 years ago
- A simple Python tool to measure the performance of ONNX models.☆27Sep 15, 2024Updated last year
- This repository provides the dataset used in "Schema-Guided Natural Language Generation" by Yuheng Du, Shereen Oraby, Vittorio Perera, Mi…☆13Dec 8, 2020Updated 5 years ago
- Collection of evals for Inspect AI☆466Updated this week
- The source code of ExFunTube☆10Aug 8, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [AAAI 2026] This is the official implementation of the paper "ExtendAttack: Attacking Servers of LRMs via Extending Reasoning".☆22Mar 18, 2026Updated last month
- YesBut - Multimodal Satire Comprehension Dataset☆19Oct 23, 2024Updated last year
- Deep neural network architecture for representing robot experiences in an episodic-like memory which facilitates encoding, recalling, and…☆15Sep 12, 2018Updated 7 years ago
- Supercharging Imbalanced Data Learning WithCausal Representation Transfer☆12Nov 29, 2021Updated 4 years ago
- ☆25Aug 29, 2025Updated 8 months ago
- ☆13Apr 23, 2025Updated last year
- Annotated sequence data☆11Feb 2, 2025Updated last year
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- The program ranked first in Audio-only track of DCASE2024 Challenge task3.☆22Mar 2, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ACL 2024] FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Model☆17Apr 28, 2025Updated last year
- CLAIR: A (surprisingly) simple semantic text metric with large language models.☆22Jan 28, 2024Updated 2 years ago
- Code for "Merging Text Transformers from Different Initializations"☆20Feb 2, 2025Updated last year
- CaPC is a method that enables collaborating parties to improve their own local heterogeneous machine learning models in a setting where b…☆24Mar 16, 2022Updated 4 years ago
- ☆34Apr 14, 2025Updated last year
- Library to extract embeddings for DNA sequences using BioFM genomics foundation model☆19Aug 13, 2025Updated 8 months ago
- ☆26Feb 23, 2026Updated 2 months ago