☆112Jun 15, 2025Updated 8 months ago
Alternatives and similar repositories for SLOT
Users that are interested in SLOT are comparing it to the libraries listed below
Sorting:
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- ☆87Dec 29, 2023Updated 2 years ago
- One-shot Entropy Minimization☆188Jun 13, 2025Updated 8 months ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 7 months ago
- [NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference☆17Jun 19, 2025Updated 8 months ago
- ☆59Jul 21, 2025Updated 7 months ago
- ☆62Oct 29, 2024Updated last year
- ☆19Aug 4, 2025Updated 6 months ago
- ☆21Jul 3, 2025Updated 8 months ago
- ☆14Apr 20, 2025Updated 10 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Mar 7, 2025Updated 11 months ago
- ☆13Jun 26, 2024Updated last year
- Pruning the VLLMs☆106Dec 9, 2024Updated last year
- [NeurIPS 2024] Search for Efficient LLMs☆16Jan 16, 2025Updated last year
- Re-implementation of the work Livebot☆16Jun 21, 2020Updated 5 years ago
- QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking☆35Jan 20, 2026Updated last month
- ☆64Apr 9, 2024Updated last year
- ☆22Oct 22, 2024Updated last year
- 大模型API企业网关,公司内部API管理,分发聚和系统,支持将多种大模型转换成统一的OpenAI兼容接口,尤其对国内开源模型deepseek,qwen,kimi,glm提供特别支持 可供个人或者企业内部大模型API统一管理和渠道分发使用(key管理与二次分发),长期更新,支…☆36Sep 12, 2025Updated 5 months ago
- Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.☆71Aug 8, 2025Updated 6 months ago
- ☆20Nov 3, 2024Updated last year
- Research work aimed at addressing the problem of modeling infinite-length context☆46Dec 18, 2025Updated 2 months ago
- Implementation of "Decoding-time Realignment of Language Models", ICML 2024.☆21Jun 17, 2024Updated last year
- Official repository for the paper "MICo-150K: A Comprehensive Dataset for Multi-Image Composition".☆54Feb 21, 2026Updated last week
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆22Oct 10, 2024Updated last year
- [ICML 2023] "NeRFool: Uncovering the Vulnerability of Generalizable Neural Radiance Fields against Adversarial Perturbations" by Yonggan …☆18Mar 10, 2024Updated last year
- [NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example☆411Nov 21, 2025Updated 3 months ago
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning☆57Oct 10, 2025Updated 4 months ago
- WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning☆36Jun 10, 2025Updated 8 months ago
- ☆26Apr 14, 2025Updated 10 months ago
- ☆21Jan 17, 2025Updated last year
- Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding☆53Feb 10, 2026Updated 3 weeks ago
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆23May 27, 2025Updated 9 months ago
- BM-NAS: Bilevel Multimodal Neural Architecture Search (AAAI 2022 Oral)☆19Dec 6, 2022Updated 3 years ago
- ☆96Nov 6, 2024Updated last year
- [NeurIPS25 Spotlight] EMPO, A Fully Unsupervised RLVR Method☆98Nov 24, 2025Updated 3 months ago
- ☆19Jun 4, 2021Updated 4 years ago
- Low-Rank Llama Custom Training☆23Mar 27, 2024Updated last year
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated 10 months ago