Lifecycle-Aware Memory for long-horizon LLM agents — 66.05% on PaperBench, 94.66% on SurveyBench, 10 peer-reviewed acceptances at FSE/ICML/TOSEM/AEI/ICoGB
☆117May 8, 2026Updated last week
Alternatives and similar repositories for PaperGuru-Benchmark
Users that are interested in PaperGuru-Benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A KMP (Kotlin Multiplatform) logging library with Android-style API. Write once, log everywhere — composable, lazy, and zero-boilerplate.☆27Jan 5, 2026Updated 4 months ago
- A local-first, file-first research knowledge compiler for building reviewable Markdown wikis from raw materials.☆80Apr 19, 2026Updated last month
- A Codex/OpenSkills workflow for turning product ideas, prototypes, and screenshots into structured PRDs.☆53Apr 26, 2026Updated 3 weeks ago
- ☆24Feb 6, 2026Updated 3 months ago
- ☆79Feb 4, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 2.4Ghz&5Ghz双频WiFi deauth 安全测试模块,并对现有的固件做了小改动和升级处理【2.4Ghz & 5Ghz dual-band WiFi deauth security test module, and made minor modifications …☆58Jan 18, 2025Updated last year
- LeakyCLIP is a CLIP inversion and privacy auditing framework for extracting training data signals from CLIP embeddings and analyzing memb…☆24Feb 27, 2026Updated 2 months ago
- A Quantum Computing Library in Rust which help deploy your emulation☆189Jan 18, 2026Updated 4 months ago
- 让AI完全接管你的博客☆31Nov 2, 2025Updated 6 months ago
- A golang goroutine pool with high-performance and elegance☆132May 10, 2026Updated last week
- This project is a research project based on Deep Reinforcement Learning (DRL), aiming to solve the resource coordination problem in UAV-A…☆102Oct 13, 2025Updated 7 months ago
- A large-scale open 3D dataset designed for autonomous driving, robotics, and 4D perception tasks☆230Mar 31, 2026Updated last month
- TrinityGuard: A Unified Framework for Safeguarding Multi-Agent Systems☆218Apr 17, 2026Updated last month
- 从0训练类 o1 大语言模型。☆133Jan 8, 2026Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆170Mar 12, 2026Updated 2 months ago
- A visual animation engine for demonstrating relativity physics☆125Nov 26, 2025Updated 5 months ago
- 一个在 JetBrains 上的插件:Tree Description 。可以为项目模块增加自定义备注,颜色分类、标注用途,还可以共享开源映射关系。☆212Jan 26, 2026Updated 3 months ago
- LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence https://arxiv.org/abs/2509.03505☆3,385Mar 4, 2026Updated 2 months ago
- Go implementation of OpenAI's Gym.☆115May 3, 2026Updated 2 weeks ago
- A diffusion-based style transfer system that injects multi-token CLIP style embeddings into UNet attention layers for controllable artist…☆81Dec 1, 2025Updated 5 months ago
- A Python package for street view image perception analysis, providing tools for feature extraction and comfort prediction.☆82Apr 29, 2026Updated 2 weeks ago
- An AI-native multi-model database unifying SQL, vector, full-text, graph, and sandboxed Python — for transactional, analytical, and agent…☆102May 2, 2026Updated 2 weeks ago
- Multimodal Document Intelligence Platform☆41Apr 10, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆600Jan 15, 2025Updated last year
- A navigation algorithm based on CMU team's open-source local planner☆120Oct 9, 2025Updated 7 months ago
- DataMate is an enterprise-level data processing platform designed for model fine-tuning and RAG retrieval.☆348May 6, 2026Updated last week
- [ICRA 2025] Official Implementation of "Robust Robot Walker: Learning Agile Locomotion over Tiny Traps"☆89Apr 28, 2025Updated last year
- Official Repo for "EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies"☆94Mar 18, 2026Updated 2 months ago
- The study leverages street view images to understand urban-scale streetscape thermal comfort.☆87May 13, 2025Updated last year
- ☆22Nov 16, 2025Updated 6 months ago
- Concise Evaluation Benchmark for Large Language Models☆25Jul 27, 2025Updated 9 months ago
- 之前在做视频理解相关的工作,用qt写了一个视频动作标注工具, 简单易用。☆21Sep 25, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆1,547Sep 18, 2025Updated 8 months ago
- ☆87May 9, 2026Updated last week
- 🔥 A continuously updated collection of papers, datasets, and benchmarks on post-training and alignment for video generation.☆140Apr 13, 2026Updated last month
- 汇集前沿、有趣、实用的AI应用项目,并提供一键部署体验。☆52Nov 17, 2025Updated 6 months ago
- SAG - SQL驱动的RAG引擎 · 查询时自动构建知识图谱 | SQL-Driven RAG Engine · Automatically Build Knowledge Graph During Querying☆1,129Dec 8, 2025Updated 5 months ago
- A minimal and lightweight video streaming management platform 一个极简轻量的视频流媒体管理平台☆458Apr 17, 2026Updated last month
- ☆104May 11, 2026Updated last week