google-deepmind / latent-multi-hop-reasoningView external linksLinks
[ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?
β90Mar 18, 2025Updated 10 months ago
Alternatives and similar repositories for latent-multi-hop-reasoning
Users that are interested in latent-multi-hop-reasoning are comparing it to the libraries listed below
Sorting:
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiersβ27Mar 1, 2025Updated 11 months ago
- π gigasmol: a lightweight wrapper for gigachat api model for seamless use with smolagents.β15Oct 23, 2025Updated 3 months ago
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformersβ75Jun 23, 2025Updated 7 months ago
- Lego for GRPOβ30May 27, 2025Updated 8 months ago
- The AI Adoption and Management Framework (AI-AMF) is a structured methodology designed to help organizations successfully integrate artifβ¦β14Feb 18, 2025Updated 11 months ago
- Repository for "Training Language Models To Explain Their Own Computations"β20Dec 22, 2025Updated last month
- CUDA implementation of Wavelet KAN.β16Jun 8, 2024Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ββ34Apr 20, 2025Updated 9 months ago
- β11Sep 7, 2024Updated last year
- Official repo for BWLer: Barycentric Weight Layerβ29Sep 26, 2025Updated 4 months ago
- [ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Modelsβ39Jun 14, 2025Updated 8 months ago
- From Word to World: Can Large Language Models be Implicit Text-based World Models?β46Dec 25, 2025Updated last month
- [ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcutsβ17Mar 11, 2025Updated 11 months ago
- β17Jun 11, 2025Updated 8 months ago
- [ICML2025] Official code for "Reinforced Lifelong Editing for Language Models"β21Feb 23, 2025Updated 11 months ago
- Entropy Based Sampling and Parallel CoT Decodingβ17Oct 9, 2024Updated last year
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq APIβ¦β22Oct 31, 2024Updated last year
- Code for "On Measuring Faithfulness of Natural Language Explanations"β21Jul 23, 2024Updated last year
- β16Jul 23, 2024Updated last year
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advanβ¦β30Apr 1, 2025Updated 10 months ago
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."β18Dec 13, 2024Updated last year
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)β26Feb 25, 2025Updated 11 months ago
- Nexusflow function call, tool use, and agent benchmarks.β30Dec 13, 2024Updated last year
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projectionsβ21Oct 15, 2024Updated last year
- β40Jun 11, 2025Updated 8 months ago
- Memory optimized Mixture of Expertsβ73Jul 25, 2025Updated 6 months ago
- Example code and guides for building with Scrapybaraβ139Mar 20, 2025Updated 10 months ago
- β45Jul 21, 2025Updated 6 months ago
- β52Feb 12, 2025Updated last year
- Code to enable layer-level steering in LLMs using sparse auto encodersβ29Sep 18, 2025Updated 4 months ago
- new optimizerβ20Aug 4, 2024Updated last year
- Official Implementation of "Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning"β25Dec 16, 2025Updated 2 months ago
- Dr. MAS is an end-to-end RL training framework for multi-agent LLM systems, supporting the co-training of multiple (heterogeneous) LLMs.β60Updated this week
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,185Jan 30, 2025Updated last year
- Personal project, Generative AI, Streamlit, Pythonβ54Apr 30, 2025Updated 9 months ago
- MCP (Model Context Protocol) server for Weaviateβ160May 22, 2025Updated 8 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)β109Mar 7, 2025Updated 11 months ago
- Radio Javan downloader and info scraper for Node.jsβ21Aug 20, 2023Updated 2 years ago
- β41Apr 9, 2025Updated 10 months ago