Efficient LLM inference on Slurm clusters.
☆97May 12, 2026Updated last week
Alternatives and similar repositories for vector-inference
Users that are interested in vector-inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM finetuning in resource-constrained environments.☆56Jun 24, 2024Updated last year
- ☆18Updated this week
- Toy autograd engine in OCaml with Apple Accelerate backend☆30Jul 31, 2024Updated last year
- A toolkit for evaluating and monitoring AI models in clinical settings☆91May 11, 2026Updated last week
- ECIR'21: Simplified TinyBERT: Knowledge Distillation for Document Retrieval☆17Apr 25, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A collection of demos and utilities prepared ahead of the Vector Institute Privacy Enhancing Techniques (PETs) Bootcamp.☆15Sep 22, 2022Updated 3 years ago
- Linear Mode Connectivity in Multitask and Continual Learning: https://arxiv.org/abs/2010.04495☆12Oct 12, 2020Updated 5 years ago
- Simple (and cheap!) neural network uncertainty estimation☆82Oct 7, 2025Updated 7 months ago
- A collection of computer vision image and video use case implementations and datasets used for the Vector Institute Computer Vision Knowl…☆11Jan 14, 2025Updated last year
- Official code of "The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets"☆26Mar 24, 2026Updated last month
- ☆12Feb 11, 2026Updated 3 months ago
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated last year
- ☆14Oct 7, 2024Updated last year
- A Reinforcement Learning solution for HVAC control to optimize energy consumption. Developed by Vector Institute and TELUS.☆79Apr 27, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆22Feb 25, 2019Updated 7 years ago
- Constrained learning using boxes for event-event relation extraction☆12Aug 5, 2022Updated 3 years ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆19Jun 11, 2025Updated 11 months ago
- virtual node analysis on ogb benchmark dataset☆14Mar 9, 2023Updated 3 years ago
- A Large-Scale Dataset and Framework for Genomic Foundation Model Benchmarking☆32May 12, 2026Updated last week
- SegMate: A Segmentation Toolkit☆22Feb 18, 2024Updated 2 years ago
- Self Supervised Learning for Time Series Using Similarity Distillation☆12Jun 29, 2022Updated 3 years ago
- Code accompanying the paper "Understanding Bias in Word Embeddings"☆22Dec 8, 2022Updated 3 years ago
- Jupyter Notebooks from book UNDERSTANDING DEEP LEARNING (Prof Simon Prince) that I could solve.☆15Mar 20, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The source code of ExFunTube☆10Aug 8, 2025Updated 9 months ago
- Collection of evals for Inspect AI☆498Updated this week
- [AAAI 2026] This is the official implementation of the paper "ExtendAttack: Attacking Servers of LRMs via Extending Reasoning".☆22Mar 18, 2026Updated 2 months ago
- ☆11Oct 18, 2022Updated 3 years ago
- [ICCV 2025] LightSwitch: Multi-view Relighting with Material-guided Diffusion☆68Aug 13, 2025Updated 9 months ago
- YesBut - Multimodal Satire Comprehension Dataset☆19Oct 23, 2024Updated last year
- Deep neural network architecture for representing robot experiences in an episodic-like memory which facilitates encoding, recalling, and…☆15Sep 12, 2018Updated 7 years ago
- Supercharging Imbalanced Data Learning WithCausal Representation Transfer☆12Nov 29, 2021Updated 4 years ago
- SOTA work about out-of-distribution detection☆14Mar 5, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆25Aug 29, 2025Updated 8 months ago
- BookWorm: A Dataset for Character Description and Analysis [EMNLP Findings 2024]☆14Feb 28, 2025Updated last year
- ☆13Apr 23, 2025Updated last year
- ☆11Aug 31, 2024Updated last year
- Code for Learning idiolectal style variation in online register☆10May 18, 2023Updated 3 years ago
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- The program ranked first in Audio-only track of DCASE2024 Challenge task3.☆22Mar 2, 2026Updated 2 months ago