LoserCheems / WonderfulMatricesLinks

Wonderful Matrices to Build Small Language Models

☆44

Alternatives and similar repositories for WonderfulMatrices

Users that are interested in WonderfulMatrices are comparing it to the libraries listed below

Sorting:

EvanZhuang / MetaTree
Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers
☆112Updated 10 months ago
annahedstroem / sanity-checks-revisited
[NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"
☆25Updated last year
XiaoduoAILab / XmodelLM
XmodelLM
☆39Updated 8 months ago
ayrna / dlordinal
Open-source Python toolkit focused on deep learning with ordinal methodologies
☆56Updated 2 weeks ago
codelion / pts
Pivotal Token Search
☆119Updated 3 weeks ago
didiglobal-fintech-credit-risk / FinLangNet
☆2Updated 2 months ago
giangdip2410 / HyperRouter
Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"
☆33Updated last year
UKPLab / 5pils
Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…
☆42Updated 3 months ago
mostly-ai / mostlyai-engine
Synthetic Data Engine 💎
☆64Updated this week
rosewang2008 / backtracing
Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.
☆89Updated last year
facebookresearch / ZeroSumEval
A framework for pitting LLMs against each other in an evolving library of games ⚔
☆34Updated 3 months ago
GitsSaikat / PyGen
Generate Python Package with Simple Prompts
☆71Updated 8 months ago
ml-jku / hopfield-boosting
☆31Updated last year
nahidalam / maya
Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya
☆117Updated this week
mwatkins1970 / SAE_Feature_Interpretability_Tool
A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…
☆19Updated 10 months ago
rosewang2008 / posr
Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings
☆33Updated 8 months ago
AI-Hypercomputer / RecML
☆186Updated this week
xnought / vae-explainer
Interactive Variational Autoencoder (VAE)
☆57Updated 9 months ago
pixas / MedSSS
Towards Medical Small Language Models with Self-Evolved \\ Slow Thinking
☆79Updated 2 months ago
git-disl / Virus
This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"
☆50Updated 6 months ago
EsmaeilNarimissa / aws-sft-grpo-budget-llm-finetune
☆19Updated 2 months ago
robertvacareanu / llm4regression
Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their…
☆153Updated 11 months ago
AgnostiqHQ / multi-agent-llm
Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)
☆118Updated 6 months ago
visresearch / LLaVA-STF
The official implementation of "Learning Compact Vision Tokens for Efficient Large Multimodal Models"
☆29Updated 2 months ago
thu-spmi / PPT2DST
☆11Updated last year
moucheng2017 / SOP-LVM-ICL-Ensemble
[NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…
☆23Updated 4 months ago
ThrunGroup / maptree
☆39Updated last year
ysh-1998 / CoWPiRec
The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.
☆24Updated last year
Babelscape / LLM-Oasis
This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"…
☆23Updated 8 months ago
METR / eval-analysis-public
Public repository containing METR's DVC pipeline for eval data analysis
☆91Updated 4 months ago