[EACL'26] DeepSieve: Information Sieving via LLM-as-a-Knowledge-Router
☆104Jan 4, 2026Updated 2 months ago
Alternatives and similar repositories for DeepSieve
Users that are interested in DeepSieve are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆86Jun 20, 2025Updated 9 months ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 5 months ago
- Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning☆56Feb 24, 2026Updated last month
- ☆17Feb 16, 2025Updated last year
- ☆10Mar 31, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago
- KITE (Knowledge-Intensive Task Evaluation) is an end-to-end benchmark for RAG pipelines☆23Aug 14, 2024Updated last year
- VLM benchmarks for robot manipulation tasks☆22Apr 30, 2025Updated 10 months ago
- VAEGAN, I Love u☆16Aug 15, 2023Updated 2 years ago
- ☆101Jan 23, 2026Updated 2 months ago
- [ICONIP'24]Mingyu.Jin's final year project☆30Aug 23, 2024Updated last year
- This is the official repo for [CVPR 2025] paper, Mitigating the Human-Robot Domain Discrepancy in Visual Pre-training for Robotic Manipul…☆29Mar 31, 2025Updated 11 months ago
- 电子病历结构化解析☆13May 11, 2022Updated 3 years ago
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"☆11Sep 20, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This repository to demonstrate an application built with Java 21 + SrpingBoot 3 + MyBatis including CRUD operations, authentication, rout…☆12Dec 1, 2024Updated last year
- ☆12Mar 18, 2021Updated 5 years ago
- 这个是由清华大学基础模型研究中心主办的《2024金融行业·大模型挑战赛》复赛参赛方案☆58Apr 27, 2025Updated 11 months ago
- Showing the relationship between ImageNet ID and labels and pytorch pre-trained model output ID and labels☆10Oct 11, 2020Updated 5 years ago
- ☆22Apr 23, 2024Updated last year
- ☆18Dec 20, 2023Updated 2 years ago
- ☆10May 4, 2018Updated 7 years ago
- ☆22Sep 16, 2022Updated 3 years ago
- 用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]☆21May 16, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- 金融多模态研究报告生成☆123Jul 2, 2025Updated 8 months ago
- Code and pruned models for our paper: K. Gkrispanis, N. Gkalelis, V. Mezaris, "Filter-Pruning of Lightweight Face Detectors Using a Geome…☆14May 8, 2024Updated last year
- Implementation of "Multi-modal Retrieval Augmented Multi-modal Generation: Datasets, Evaluation Metrics and Strong Baselines"☆31Feb 24, 2025Updated last year
- PhD thesis template with title page according to the University of Amsterdam.☆15Sep 12, 2021Updated 4 years ago
- Code to obtain the Gigaword dataset (non-anonymized) for summarization☆13Aug 27, 2022Updated 3 years ago
- Optimize QWen1.5 models with TensorRT-LLM☆17May 14, 2024Updated last year
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆47May 11, 2025Updated 10 months ago
- ☆20Aug 30, 2024Updated last year
- 针对保险话术培训场景设计的陪练机器人/培训机器人的demo☆19Dec 16, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Sep 6, 2021Updated 4 years ago
- A repository for ACL 2022 paper "How do we answer complex questions: Discourse structure of long form answers"☆19May 31, 2025Updated 9 months ago
- ☆81Oct 1, 2025Updated 5 months ago
- ☆196Jun 2, 2025Updated 9 months ago
- ☆67Sep 18, 2024Updated last year
- EfficientDet_anchor_free☆11Feb 19, 2020Updated 6 years ago
- ☆10Apr 3, 2023Updated 2 years ago