LoserCheems / WonderfulMatrices
Wonderful Matrices to Build Small Language Models
☆44Updated 3 months ago
Alternatives and similar repositories for WonderfulMatrices
Users that are interested in WonderfulMatrices are comparing it to the libraries listed below
Sorting:
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Updated 10 months ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated 7 months ago
- Open-source Python toolkit focused on deep learning with ordinal methodologies☆52Updated this week
- Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers☆107Updated 8 months ago
- XmodelLM☆39Updated 5 months ago
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆32Updated 6 months ago
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…☆39Updated 3 weeks ago
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆89Updated 9 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆41Updated 3 months ago
- ☆31Updated last year
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆31Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆31Updated 3 weeks ago
- Verifiers for LLM Reinforcement Learning☆50Updated last month
- ☆27Updated 10 months ago
- Simple repository for training small reasoning models☆27Updated 3 months ago
- ☆68Updated 10 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆108Updated 2 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆57Updated 11 months ago
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated 3 weeks ago
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆22Updated 2 months ago
- The official implementation of Cross-Task Experience Sharing (COPS)☆22Updated 6 months ago
- The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.☆24Updated last year
- A benchmark for testing memorization abilities of LMs☆20Updated 7 months ago
- ☆11Updated last year
- The official implementation of Preference Data Reward-Augmentation.☆17Updated 2 weeks ago
- ☆27Updated last month
- Source code of our paper "PairDistill: Pairwise Relevance Distillation for Dense Retrieval", EMNLP 2024 Main.☆22Updated 5 months ago
- This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"…☆21Updated 5 months ago
- Implementation☆24Updated last month
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆67Updated 5 months ago