Hypernetworks that update LLMs to remember factual information
☆584Mar 2, 2026Updated 2 weeks ago
Alternatives and similar repositories for doc-to-lora
Users that are interested in doc-to-lora are comparing it to the libraries listed below
Sorting:
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆1,216Jun 8, 2025Updated 9 months ago
- Model Merging with Functional Dual Anchors☆47Nov 23, 2025Updated 3 months ago
- A skill to bring back Golden Gate claude☆29Oct 18, 2025Updated 5 months ago
- Training Transformers with knowledge localization (SGTM)☆50Jan 11, 2026Updated 2 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆40Feb 7, 2026Updated last month
- FormulaOne: A dataset of algorithmic problems based on MSO formulas.☆25Mar 1, 2026Updated 2 weeks ago
- xLSTMAD - Powerful xLSTM based Method for Anomaly Detection☆15Mar 1, 2026Updated 2 weeks ago
- Shaping capabilities with token-level pretraining data filtering☆85Jan 28, 2026Updated last month
- LLM that can be trained on 1 or more GPUs for research.☆32Updated this week
- Elixir port of the Ember framework for building LLM applications☆12Nov 4, 2025Updated 4 months ago
- The core repository for Katanemo's advanced function calling models with top-tier performance. Features three collections: Arch-Function …☆20Jun 23, 2025Updated 8 months ago
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆51Jan 30, 2026Updated last month
- ☆20Updated this week
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"☆41Oct 9, 2025Updated 5 months ago
- Portable Memory Harness for Agents. Grounding the Autonomous Era☆113Mar 13, 2026Updated last week
- ☆53Aug 12, 2025Updated 7 months ago
- ☆41Jul 6, 2025Updated 8 months ago
- Efficient non-uniform quantization with GPTQ for GGUF☆63Sep 17, 2025Updated 6 months ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆57Mar 12, 2026Updated last week
- Computer-Use Agents as Judges for Generative UI☆44Nov 27, 2025Updated 3 months ago
- [CVPR'26] VecGlypher: Unified Vector Glyph Generation with Language Models☆104Feb 26, 2026Updated 3 weeks ago
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆32Feb 26, 2025Updated last year
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆39Nov 26, 2025Updated 3 months ago
- ☆13Mar 14, 2026Updated last week
- Scaling Zero-Shot Reference-to-Video Generation☆64Dec 11, 2025Updated 3 months ago
- ☆13Aug 12, 2022Updated 3 years ago
- Kuzushiji Recognition Kaggle 2019. Build a DL model to transcribe ancient Kuzushiji into contemporary Japanese characters. Opening the do…☆25Oct 26, 2019Updated 6 years ago
- QEMU support for a custom board based on a Microchip ATSAMD21G18A microcontroller (MCU)☆14Jun 10, 2024Updated last year
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,192Jan 30, 2025Updated last year
- A PyTorch native library for training speculative decoding models☆43Updated this week
- Reproducible, flexible LLM evaluations☆350Mar 2, 2026Updated 2 weeks ago
- ☆21Jul 3, 2025Updated 8 months ago
- Official Implementation of Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training☆143Mar 13, 2026Updated last week
- ☆39Feb 11, 2025Updated last year
- Use VBB interactively, using a map.☆10Jan 11, 2022Updated 4 years ago
- ☆13Sep 26, 2024Updated last year
- ☆14Jun 24, 2024Updated last year
- TiC: Exploring Vision Transformer in Convolution☆11Oct 24, 2023Updated 2 years ago
- Unsupervised Domain Adaptation on Graphs☆15Apr 6, 2022Updated 3 years ago