pytorch-tpu/fairseq

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pytorch-tpu/fairseq)

pytorch-tpu / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

☆22

Alternatives and similar repositories for fairseq

Users that are interested in fairseq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lemmonation / jm-nat
View on GitHub
Code for ACL2020 "Jointly Masked Sequence-to-Sequence Model for Non-Autoregressive Neural Machine Translation"
☆39Jun 24, 2020Updated 6 years ago
ymcui / Cross-Lingual-MRC
View on GitHub
Cross-Lingual Machine Reading Comprehension (EMNLP 2019)
☆67Nov 6, 2019Updated 6 years ago
SaeedNajafi / pytorch-ocd
View on GitHub
Implementation of the Optimal Completion Distillation for Sequence Labeling
☆17Jul 25, 2024Updated last year
AlexNaitsat / ABCD_Algorithm
View on GitHub
☆12Jun 15, 2021Updated 5 years ago
pytorch-tpu / examples
View on GitHub
This repository contains example code to build models on TPUs
☆30Feb 17, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
allenai / missing-fact
View on GitHub
Repository for the What's Missing EMNLP'19 paper
☆17Mar 12, 2021Updated 5 years ago
Noahs-ARK / MAE
View on GitHub
☆21May 5, 2020Updated 6 years ago
facebookresearch / DisCo
View on GitHub
DisCo Transformer for Non-autoregressive MT
☆77Jul 28, 2022Updated 3 years ago
xwhan / ProQA
View on GitHub
Progressively Pretrained Dense Corpus Index for Open-Domain QA and Information Retrieval
☆43Jun 12, 2023Updated 3 years ago
kefirski / amt
View on GitHub
Adversarial Machine Translation with pytorch
☆23Jan 14, 2018Updated 8 years ago
amake / moses-smt
View on GitHub
Dock You a Moses: Moses Statistical MT in a container
☆14Feb 18, 2020Updated 6 years ago
bozheng-hit / VoCapXLM
View on GitHub
Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"
☆20Nov 12, 2021Updated 4 years ago
google / airdialogue_model
View on GitHub
☆17Jul 16, 2020Updated 6 years ago
microsoft / SparseMixer
View on GitHub
Sparse Backpropagation for Mixture-of-Expert Training
☆30Jul 2, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
gonglinyuan / metro_t0
View on GitHub
Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)
☆22Nov 1, 2023Updated 2 years ago
sofiaherrero / lime-ner
View on GitHub
lime-ner: extending LIME for Named Entity Recognition
☆10Aug 15, 2018Updated 7 years ago
zhuohan123 / macaron-net
View on GitHub
Codes for "Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View"
☆147Jun 10, 2019Updated 7 years ago
lemmonation / fcl-nat
View on GitHub
Code for "Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation"
☆13Jul 10, 2020Updated 6 years ago
HMJiangGatech / huge
View on GitHub
High-Dimensional Undirected Graph Estimation
☆13Jan 10, 2024Updated 2 years ago
microsoft / deepnmt
View on GitHub
☆31Jun 28, 2022Updated 4 years ago
allenai / tpu_pretrain
View on GitHub
LM Pretraining with PyTorch/TPU
☆137Oct 24, 2019Updated 6 years ago
HarshTrivedi / phd-advice
View on GitHub
A list of advisory blogs and resources that I have found useful so far.
☆22Nov 25, 2020Updated 5 years ago
shangjingbo1226 / ContrastSubgraphMining
View on GitHub
Contrast Subgraph Mining from Coherent Cores
☆13Feb 20, 2018Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
thespectrewithin / joint_align
View on GitHub
Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework
☆52Feb 1, 2020Updated 6 years ago
yuchenlin / awesome-commonsense
View on GitHub
[Work in progress] A reading list for machine commonsense reasoning
☆34Apr 14, 2020Updated 6 years ago
ayanc / rpgan
View on GitHub
RP-GAN: Stable GAN Training with Random Projections
☆22Jun 27, 2018Updated 8 years ago
stefan-it / italian-bertelectra
View on GitHub
🇮🇹 Italian BERT and ELECTRA models (incl. evaluation)
☆18Oct 20, 2022Updated 3 years ago
lemmonation / spine
View on GitHub
Code of the paper "SPINE: Structural Identity Preserved Inductive Network Embedding"
☆12Jul 29, 2019Updated 6 years ago
LiyuanLucasLiu / Torch-Scope
View on GitHub
A Toolkit for Training, Tracking, Saving Models and Syncing Results
☆62Mar 12, 2020Updated 6 years ago
shwinshaker / LipGrow
View on GitHub
An adaptive training algorithm for residual network
☆17Aug 22, 2020Updated 5 years ago
se4u / mvlsa
View on GitHub
Multiview LSA
☆11Jun 22, 2015Updated 11 years ago
uds-lsv / afro-maft
View on GitHub
☆17Jan 12, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sgraaf / Replicate-Toronto-BookCorpus
View on GitHub
This repository contains code to replicate the no-longer publicly available Toronto BookCorpus dataset
☆49Apr 6, 2022Updated 4 years ago
william-silversmith / countless
View on GitHub
Code and performance tests to demonstrate the COUNTLESS algorithm. https://medium.com/@willsilversmith/countless-high-performance-2x-down…
☆10Oct 23, 2019Updated 6 years ago
masakhane-io / masakhane-news
View on GitHub
MasakhaNEWS: News Topic Classification for African Languages
☆26May 12, 2024Updated 2 years ago
dragen1860 / Graph-Neural-Network-Papers
View on GitHub
Curated Lists for graph neural network, graph convolutional network, graph attention network, etc.
☆27Apr 22, 2019Updated 7 years ago
nyu-dl / dl4dial-bayesian-calibration
View on GitHub
☆14Oct 25, 2019Updated 6 years ago
fallcat / stupidNMT
View on GitHub
Hard-Coded Gaussian Attention for Neural Machine Translation
☆36May 22, 2023Updated 3 years ago
uwnlp / hitl_parsing
View on GitHub
☆12Nov 15, 2016Updated 9 years ago