rohinmanvi / Capability-Aware-and-Mid-Generation-Self-EvaluationsView external linksLinks
☆21Jul 25, 2025Updated 6 months ago
Alternatives and similar repositories for Capability-Aware-and-Mid-Generation-Self-Evaluations
Users that are interested in Capability-Aware-and-Mid-Generation-Self-Evaluations are comparing it to the libraries listed below
Sorting:
- ☆33Jul 9, 2025Updated 7 months ago
- Securade.ai Sentinel - A monitoring and surveillance application that enables visual Q&A and video captioning for existing CCTV cameras.☆26Apr 6, 2025Updated 10 months ago
- ☆41Jun 19, 2024Updated last year
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆55Oct 29, 2024Updated last year
- ☆29Nov 9, 2025Updated 3 months ago
- ☆46Jun 24, 2025Updated 7 months ago
- ☆23Jul 5, 2024Updated last year
- ☆25Apr 10, 2025Updated 10 months ago
- SFT+RL boosts multimodal reasoning☆46Jun 27, 2025Updated 7 months ago
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆30Oct 20, 2025Updated 3 months ago
- 👷♂️High performance agent framework that can do everything. Minion is designed to execute any type of queries, offering a variety of fe…☆118Updated this week
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆68Apr 26, 2025Updated 9 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- PROSE Public Benchmark Suite☆31Sep 15, 2025Updated 5 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Feb 8, 2026Updated last week
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆28Dec 10, 2024Updated last year
- MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…☆85Updated this week
- Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)☆28Dec 8, 2023Updated 2 years ago
- Dynamic Shell Command MCP Server☆41Feb 27, 2025Updated 11 months ago
- TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models☆37Nov 10, 2024Updated last year
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆52Dec 7, 2025Updated 2 months ago
- 🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…☆50Nov 26, 2025Updated 2 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆33May 1, 2025Updated 9 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- The High Performance LLM Native Mock Server☆17Jan 8, 2026Updated last month
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆29Jan 19, 2025Updated last year
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Aug 5, 2025Updated 6 months ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Jul 17, 2024Updated last year
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs☆38Jan 26, 2026Updated 3 weeks ago
- HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model☆41Feb 16, 2025Updated last year
- 💧 Inland water systems are essential to our environment because they are vital ecosystems that are bio-diverse. Thus, finding innovative…☆11Apr 26, 2020Updated 5 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- ☆11Jun 22, 2025Updated 7 months ago
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆30Oct 30, 2025Updated 3 months ago
- ☆53Aug 24, 2025Updated 5 months ago
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆28Oct 23, 2025Updated 3 months ago
- Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"☆80Dec 5, 2025Updated 2 months ago
- Replicating O1 inference-time scaling laws☆93Dec 1, 2024Updated last year