☆52Feb 17, 2025Updated last year
Alternatives and similar repositories for InfiniRetri
Users that are interested in InfiniRetri are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AI Pull-Request Reviewer Companion (in the command line)☆13Apr 11, 2024Updated 2 years ago
- A few models converted from caffe to CoreMLs format.☆15Jun 6, 2017Updated 8 years ago
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆57Mar 31, 2026Updated last month
- Nexusflow function call, tool use, and agent benchmarks.☆30Dec 13, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Jul 13, 2024Updated last year
- Mixture of Lora Experts☆11Apr 7, 2024Updated 2 years ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Apr 22, 2025Updated last year
- A collection of statistics algorithms from Mersenne twister generator to MCMC sampling.☆19Aug 2, 2022Updated 3 years ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆18Apr 5, 2025Updated last year
- GPT-jax based on the official huggingface library☆13Jun 22, 2021Updated 4 years ago
- MEXMA: Token-level objectives improve sentence representations☆43Jan 6, 2025Updated last year
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision☆19Apr 1, 2025Updated last year
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆20Nov 15, 2025Updated 6 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- Notebooks for Alea GPU☆12Feb 16, 2017Updated 9 years ago
- This repository benchmarks multiple vector databases for music semantic search, using a shared dataset and query set. It provides both a …☆35Aug 31, 2025Updated 8 months ago
- ☆92Aug 18, 2024Updated last year
- The official implementation of the paper "Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models".☆89Mar 25, 2025Updated last year
- In this paper, the image is spliced based on super pixel.☆12Mar 15, 2019Updated 7 years ago
- [2025 ACL Findings] Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization☆25Oct 29, 2025Updated 6 months ago
- ☆53Jul 18, 2024Updated last year
- Official Implementation for NorMuon paper☆70Apr 30, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated last year
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆18Dec 13, 2024Updated last year
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆23Jun 2, 2025Updated 11 months ago
- Running LLMs against a sandbox airport to see if they can make the correct decisions in real time☆26Jul 22, 2025Updated 9 months ago
- ☆19Oct 2, 2023Updated 2 years ago
- ☆20Mar 18, 2026Updated 2 months ago
- Chat with an AI simulation of anyone as easily as copy-pasting text into a folder!☆19Mar 4, 2023Updated 3 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- transplant several overlays to s9_pynq board☆17Oct 31, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [Arxiv 2026] ActionPlan: Future-Aware Streaming Motion Synthesis via Frame-Level Action Planning☆83Mar 26, 2026Updated last month
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆18Sep 9, 2022Updated 3 years ago
- ☆41Apr 30, 2025Updated last year
- ☆33May 26, 2024Updated last year
- Python powered music controlling webpage with websockets and bottle py (works with spotify, vlc, audacious, and others)☆11Jun 9, 2017Updated 8 years ago
- Combining SOAP and MUON☆21Feb 11, 2025Updated last year
- ☆13Apr 17, 2024Updated 2 years ago