The attention map viewer for LLaMA models.
☆36Dec 16, 2023Updated 2 years ago
Alternatives and similar repositories for llama-viz
Users that are interested in llama-viz are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Dec 14, 2024Updated last year
- The evaluation code for the paper "MoreHopQA: More Than Multi-hop Reasoning"☆14Jun 21, 2024Updated last year
- ☆12Mar 7, 2024Updated 2 years ago
- ☆18Oct 6, 2022Updated 3 years ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆18Jun 13, 2023Updated 2 years ago
- ☆13Aug 26, 2024Updated last year
- Entity-Based Knowledge Conflicts in Question Answering. Code repo for EMNLP2021 paper: https://aclanthology.org/2021.emnlp-main.565/☆78Aug 29, 2022Updated 3 years ago
- ☆126Updated this week
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- ☆11Aug 8, 2018Updated 7 years ago
- A framework for evaluating Machine Translation models.☆12Apr 21, 2026Updated last week
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A full fledged mistral+wandb☆13Aug 16, 2024Updated last year
- The code for paper "ProQA: Structural Prompt-based Pre-training for Unified Question Answering"☆11Feb 7, 2023Updated 3 years ago
- ☆16Sep 10, 2024Updated last year
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Mar 27, 2021Updated 5 years ago
- This repository includes the implementation and results of the paper "ChatGPT is fun, but it is not funny! Humor is still challenging Lar…☆13Jul 13, 2023Updated 2 years ago
- ☆22Jan 5, 2024Updated 2 years ago
- ComfyUI Workflows☆10Sep 27, 2025Updated 7 months ago
- Official implementation of the paper "ALTER: Augmentation for Large-Table-Based Reasoning"☆15Aug 26, 2024Updated last year
- ☆18Feb 24, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- RevLLM -- Reverse Engineering Tools for Large Language Models☆20Feb 29, 2024Updated 2 years ago
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆36Nov 18, 2025Updated 5 months ago
- ☆12Feb 17, 2025Updated last year
- ☆20May 27, 2025Updated 11 months ago
- Extend bert-nmt to context-aware translation.☆11May 24, 2021Updated 4 years ago
- code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open D…☆12Sep 16, 2022Updated 3 years ago
- 基于stable-diffusion的虚拟换装方法☆11Apr 27, 2024Updated 2 years ago
- A set of custom nodes that I've either written myself or adapted from other authors for my own convenience.☆11Sep 18, 2024Updated last year