Aleph-Alpha-Research / trigramsLinks

☆56

Alternatives and similar repositories for trigrams

Users that are interested in trigrams are comparing it to the libraries listed below

Sorting:

epfml / DenseFormer
☆81Updated last year
SeunghyunSEO / optimized_hf_llama_class_for_training
☆48Updated 11 months ago
arcee-ai / DAM
☆53Updated 8 months ago
Zyphra / Zyda_processing
☆37Updated last year
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
RobertCsordas / moeut
☆83Updated 11 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated 11 months ago
hamishivi / EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆75Updated 11 months ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 5 months ago
Zyphra / Zamba2
PyTorch implementation of models from the Zamba2 series.
☆184Updated 6 months ago
kyegomez / Infini-attention
Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…
☆56Updated last week
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 3 months ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆49Updated 5 months ago
OpenEvaByte / evabyte
EvaByte: Efficient Byte-level Language Models at Scale
☆103Updated 3 months ago
trapoom555 / Language-Model-STS-CFT
Improving Text Embedding of Language Models Using Contrastive Fine-tuning
☆64Updated last year
EleutherAI / improved-t5
Experiments for efforts to train a new and improved t5
☆76Updated last year
tyler-romero / microR1
Simple repository for training small reasoning models
☆32Updated 5 months ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆68Updated 3 months ago
siyan-zhao / prepacking
The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …
☆61Updated 9 months ago
Knowledgator / FlashDeBERTa
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆62Updated 2 months ago
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆46Updated last year
sher222 / LeReT
Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval
☆49Updated 9 months ago
lilakk / BLEUBERI
Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"
☆25Updated 2 months ago
ahstat / episodic-memory-benchmark
Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…
☆49Updated 3 months ago
apple / ml-hypercloning
☆48Updated 9 months ago
luyug / magix
Supercharge huggingface transformers with model parallelism.
☆77Updated last week
allenai / infinigram-api
☆70Updated 2 weeks ago
LAGoM-NLP / transtokenizer
☆50Updated 6 months ago
Hannibal046 / nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆80Updated last year
facebookresearch / mexma
MEXMA: Token-level objectives improve sentence representations
☆41Updated 6 months ago