LoserCheems / WonderfulMatricesLinks
Wonderful Matrices to Build Small Language Models
☆44Updated 11 months ago
Alternatives and similar repositories for WonderfulMatrices
Users that are interested in WonderfulMatrices are comparing it to the libraries listed below
Sorting:
- Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers☆115Updated last year
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Updated last year
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…☆45Updated last month
- Generate Python Package with Simple Prompts☆75Updated last year
- Open-source Python toolkit focused on deep learning with ordinal methodologies☆65Updated last month
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆34Updated last year
- XmodelLM☆38Updated last year
- Synthetic Data Engine 💎☆72Updated 2 weeks ago
- Pivotal Token Search☆142Updated last month
- The official implementation of Cross-Task Experience Sharing (COPS)☆29Updated last year
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated last year
- The official implementation of "Learning Compact Vision Tokens for Efficient Large Multimodal Models"☆29Updated 7 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"…☆25Updated 3 months ago
- ☆19Updated 8 months ago
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆92Updated last year
- syftr is an agent optimizer that helps you find the best agentic workflows for your budget.☆326Updated 3 months ago
- ☆41Updated 2 years ago
- Train, tune, and infer Bamba model☆138Updated 7 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆183Updated last week
- The official implementation of Preference Data Reward-Augmentation.☆18Updated 8 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆128Updated 11 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆125Updated 5 months ago
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆23Updated 10 months ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Updated 2 years ago
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆246Updated 2 weeks ago
- Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs☆24Updated 6 months ago
- [ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling☆29Updated last year
- Interactive Variational Autoencoder (VAE)☆65Updated last year
- Train your own SOTA deductive reasoning model☆107Updated 10 months ago