EvanZhuang / MetaTree
Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers
☆104Updated 5 months ago
Alternatives and similar repositories for MetaTree:
Users that are interested in MetaTree are comparing it to the libraries listed below
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆88Updated 7 months ago
- The first dense retrieval model that can be prompted like an LM☆64Updated 5 months ago
- PyTorch implementation of models from the Zamba2 series.☆176Updated 3 weeks ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆107Updated 2 weeks ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated 11 months ago
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- Supercharge huggingface transformers with model parallelism.☆76Updated 4 months ago
- ☆30Updated 9 months ago
- ☆67Updated 6 months ago
- Open-source Python toolkit focused on deep learning with ordinal methodologies☆44Updated last week
- Implementation of the Llama architecture with RLHF + Q-learning☆162Updated 2 weeks ago
- NanoGPT (124M) quality in 2.67B tokens☆27Updated this week
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆30Updated 3 months ago
- ☆40Updated 9 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆36Updated last week
- ☆78Updated 10 months ago
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆215Updated 10 months ago
- ☆41Updated 3 weeks ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆118Updated 6 months ago
- A curated reading list of research in Adaptive Computation, Inference-Time Computation & Mixture of Experts (MoE).☆139Updated last month
- SaLSa Optimizer implementation (No learning rates needed)☆28Updated 2 weeks ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆95Updated last month
- Set of scripts to finetune LLMs☆36Updated 10 months ago
- Source code of our paper "PairDistill: Pairwise Relevance Distillation for Dense Retrieval", EMNLP 2024 Main.☆21Updated 2 months ago
- Wonderful Matrices to Build Small Language Models☆44Updated this week
- ☆46Updated last year
- ☆47Updated 5 months ago
- This is the official repository for Inheritune.☆109Updated last week
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"☆215Updated 3 weeks ago