Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"
☆19Jun 11, 2025Updated last year
Alternatives and similar repositories for Remarkable-Robustness-of-LLMs
Users that are interested in Remarkable-Robustness-of-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated last year
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- Muon fsdp 2☆62Aug 8, 2025Updated 10 months ago
- Code for "Merging Text Transformers from Different Initializations"☆20Feb 2, 2025Updated last year
- [CVPR'25] Official code of paper "Mimic In-Context Learning for Multimodal Tasks"☆26May 21, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Create string diagrams with LaTeX!☆14Jan 3, 2025Updated last year
- ☆23Sep 19, 2024Updated last year
- ☆18Mar 25, 2021Updated 5 years ago
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- ☆18Jun 19, 2023Updated 3 years ago
- Code to study the generalisability of benchmark models on non-stationary EHRs.☆15Aug 7, 2019Updated 6 years ago
- [AAAI 2023 Oral] Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training☆14Apr 19, 2023Updated 3 years ago
- "Visual Prompt Selection for In-Context Learning Segmentation Framework"☆14Dec 13, 2024Updated last year
- DISSECT: Disentangled Simultaneous Explanations via Concept Traversals☆12Feb 5, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Test equality between a black-box LLM API and a reference distribution☆18Oct 29, 2024Updated last year
- The Compositionality article class.☆14Mar 16, 2026Updated 3 months ago
- Network analysis of Friends scripts☆14Jun 19, 2020Updated 5 years ago
- Benchmark API for Multidomain Language Modeling☆25Aug 26, 2022Updated 3 years ago
- Repository for paper Decrypting Cryptic Crosswords☆11Jan 15, 2022Updated 4 years ago
- ☆67May 18, 2023Updated 3 years ago
- ☆14Feb 24, 2025Updated last year
- SeCap: Self-Calibrating and Adaptive Prompts for Cross-view Person Re-Identification in Aerial-Ground Networks (CVPR'25)☆26Apr 10, 2026Updated 2 months ago
- The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models☆18Oct 4, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13Jun 13, 2022Updated 4 years ago
- ☆13Apr 17, 2025Updated last year
- Evaluating Multimodal Generative AI with Korean Educational Standards, NAACL 2025.☆26May 15, 2025Updated last year
- Experiments learning the even-parity dataset with MPS (tensor trains)☆24Nov 1, 2023Updated 2 years ago
- ☆18Nov 13, 2024Updated last year
- Pytorch-based tools for constructing a vocabulary of visual concepts in a GAN.☆17Feb 25, 2022Updated 4 years ago
- An implementation of LazyLLM token pruning for LLaMa 2 model family.☆13Jan 6, 2025Updated last year
- This Network-graph based literature review tool uses the open-source version of Neo4j with Jupyter Notebooks written in Python to import …☆14Oct 30, 2023Updated 2 years ago
- Genetics for Language Models☆17Jul 1, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Mar 31, 2021Updated 5 years ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models