edupoux / MVA_2022_SLLinks

☆7

Alternatives and similar repositories for MVA_2022_SL

Users that are interested in MVA_2022_SL are comparing it to the libraries listed below

Sorting:

pbelcak / UltraFastBERT
The repository for the code of the UltraFastBERT paper
☆517Updated last year
pbelcak / fastfeedforward
A repository for log-time feedforward networks
☆223Updated last year
huggingface / speechbox
☆359Updated last year
huggingface / community-events
Place where folks can contribute to 🤗 community events
☆423Updated last year
apple / ml-sigma-reparam
☆307Updated last year
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆268Updated last year
cisnlp / GlotLID
💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
☆147Updated 2 months ago
lucasmllr / xsbert
explainable Siamese sentence transformers
☆12Updated last year
facebookresearch / stopes
A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…
☆282Updated 6 months ago
HomebrewML / HeavyBall
Efficient optimizers
☆253Updated last week
google-research / metricx
☆104Updated 7 months ago
HeegyuKim / torch-xla-SPMD
Pytorch/XLA SPMD Test code in Google TPU
☆23Updated last year
warner-benjamin / optimi
Fast, Modern, and Low Precision PyTorch Optimizers
☆103Updated 2 weeks ago
HazyResearch / H3
Language Modeling with the H3 State Space Model
☆519Updated last year
huggingface / kernels
Load compute kernels from the Hub
☆233Updated this week
naver / nllb-pruning
Library for pruning experts per language pair in NLLB-200
☆33Updated 2 years ago
huggingface / olm-datasets
Pipeline for pulling and processing online language model pretraining data from the web
☆177Updated 2 years ago
mixedbread-ai / batched
The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…
☆142Updated 3 weeks ago
alvenirai / punctfix
☆22Updated last year
facebookresearch / belebele
Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.
☆335Updated 7 months ago
inseq-team / inseq
Interpretability for sequence generation models 🐛 🔍
☆432Updated 3 months ago
rom1504 / cc2dataset
Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...
☆317Updated last year
bjoernpl / lm-evaluation-harness-de
A framework for few-shot evaluation of autoregressive language models.
☆13Updated last year
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆256Updated last year
konstantinjdobler / focus
[EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"
☆32Updated 2 months ago
bminixhofer / zett
Code for Zero-Shot Tokenizer Transfer
☆135Updated 6 months ago
gautierdag / bpeasy
Fast bare-bones BPE for modern tokenizer training
☆164Updated last month
huggingface / open_asr_leaderboard
☆116Updated 2 weeks ago
bjoernpl / GermanBenchmark
A repository containing the code for translating popular LLM benchmarks to German.
☆27Updated last year
KennethEnevoldsen / scandinavian-embedding-benchmark
A Scandinavian Benchmark for sentence embeddings
☆40Updated 2 months ago