☆52Feb 17, 2025Updated last year
Alternatives and similar repositories for InfiniRetri
Users that are interested in InfiniRetri are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is InfiniRetri, a tool enhance Transformer-based LLMs(Large Language Model) ablity to hangle Long-Context.☆122Mar 27, 2025Updated last year
- A few models converted from caffe to CoreMLs format.☆15Jun 6, 2017Updated 9 years ago
- A simple and minimal open source implementation of "Introducing LFM2: The Fastest On-Device Foundation Models on the Market" from Liquid …☆29Jun 22, 2026Updated last week
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆58Mar 31, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Nexusflow function call, tool use, and agent benchmarks.☆29Dec 13, 2024Updated last year
- This project is established for real-time training of the RWKV model.☆49May 17, 2024Updated 2 years ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Apr 22, 2025Updated last year
- The source code for running LLMs on the AAAR-1.0 benchmark.☆20Apr 5, 2025Updated last year
- GPT-jax based on the official huggingface library☆13Jun 22, 2021Updated 5 years ago
- MEXMA: Token-level objectives improve sentence representations☆43Jan 6, 2025Updated last year
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision☆19Apr 1, 2025Updated last year
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- This repository benchmarks multiple vector databases for music semantic search, using a shared dataset and query set. It provides both a …☆36Aug 31, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆93Aug 18, 2024Updated last year
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆14Oct 3, 2024Updated last year
- The official implementation of the paper "Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models".☆90Mar 25, 2025Updated last year
- HTTP proxy for authenticating users via OAuth2☆10Sep 6, 2019Updated 6 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆16Apr 30, 2025Updated last year
- [2025 ACL Findings] Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization☆25Oct 29, 2025Updated 8 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆140Aug 24, 2025Updated 10 months ago
- A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.☆47Dec 17, 2025Updated 6 months ago
- Reasoning-based Evaluation and Ranking of Translations.☆19Jun 2, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆11Jan 12, 2021Updated 5 years ago
- A Modular System for Flexible, High-Performance Traffic http://www.ict-mplane.eu/☆24Oct 4, 2018Updated 7 years ago
- ☆52Mar 18, 2025Updated last year
- Official Implementation for NorMuon paper☆81Apr 30, 2026Updated 2 months ago
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆18Dec 13, 2024Updated last year
- Running LLMs against a sandbox airport to see if they can make the correct decisions in real time☆29Jul 22, 2025Updated 11 months ago
- HaSTL: A fast GPU implementation of STL decomposition with missing values and support for both CUDA and OpenCL☆13Sep 11, 2023Updated 2 years ago
- ☆19Oct 2, 2023Updated 2 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Reproduction study of Grassmann Flows for sequence modeling (arXiv 2512.19428). Shows 22.6% gap vs claimed 10-15%, includes CUDA kernels …☆30Dec 26, 2025Updated 6 months ago
- Alice is a Browser Extension to supercharge your literature review by providing instant context, summaries, bibtex, and code implementati…☆24Apr 30, 2026Updated 2 months ago
- Non-blocking concurrent hashmap for Haskell☆18Sep 29, 2017Updated 8 years ago
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 4 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 3 years ago
- Simple trend predictions for bitcoin price data and cryptocurrency market capitalization.☆12Jan 20, 2018Updated 8 years ago
- ☆17Jun 12, 2026Updated 3 weeks ago