apple / ml-divide-or-conquerLinks

☆13

Alternatives and similar repositories for ml-divide-or-conquer

Users that are interested in ml-divide-or-conquer are comparing it to the libraries listed below

Sorting:

apple / ml-toad
☆14Updated 10 months ago
apple / ml-tree-dst
☆33Updated 3 years ago
kyegomez / Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆30Updated 2 weeks ago
facebookresearch / lss_eval
This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…
☆31Updated last year
facebookresearch / coocmap
code for paper "Accessing higher dimensions for unsupervised word translation"
☆21Updated 2 years ago
apple / ml-aura
Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024
☆21Updated last year
apple / ml-selfcond
Self-Conditioning Pre-Trained Language Models, ICML 2022
☆31Updated 3 years ago
renll / SeqBoat
[NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling
☆37Updated last year
princeton-nlp / ShortcutGrammar
EMNLP 2022: Finding Dataset Shortcuts with Grammar Induction https://arxiv.org/abs/2210.11560
☆58Updated 4 months ago
srush / LLM-Talk
☆51Updated last year
apple / ml-planner
☆53Updated last year
kyegomez / MM1
PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"
☆24Updated 2 weeks ago
apple / ml-cread
☆28Updated 3 years ago
Yuanhy1997 / HyPe
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]
☆14Updated 2 years ago
sunyt32 / torchscale
Transformers at any scale
☆41Updated last year
srush / drop7
☆18Updated last year
ngoyal2707 / Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆18Updated 2 years ago
dojoteef / storium-gpt2
Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Lo…
☆39Updated last year
Zyphra / zcookbook
Training hybrid models for dummies.
☆25Updated 6 months ago
apple / ml-entropy-reconstruction
☆29Updated 2 years ago
marvl-challenge / marvl-code
[EMNLP 2021] Code and data for our paper "Visually Grounded Reasoning across Languages and Cultures"
☆30Updated 3 years ago
facebookresearch / mexma
MEXMA: Token-level objectives improve sentence representations
☆41Updated 6 months ago
eth-easl / fmengine
Utilities for Training Very Large Models
☆58Updated 9 months ago
EleutherAI / mdl
Minimum Description Length probing for neural network representations
☆18Updated 5 months ago
shreyansh26 / Attention-Mask-Patterns
Using FlexAttention to compute attention with different masking patterns
☆44Updated 9 months ago
yikangshen / megablocks
☆20Updated last year
stas00 / porting
Helper scripts and notes that were used while porting various nlp models
☆46Updated 3 years ago
EleutherAI / best-download
URL downloader supporting checkpointing and continuous checksumming.
☆19Updated last year
aws-neuron / aws-neuron-reference-for-megatron-lm
☆14Updated last year
SYSTRAN / fuzzy-match
Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.
☆50Updated 2 months ago