alexa/ramen

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alexa/ramen)

alexa / ramen

A software for transferring pre-trained English models to foreign languages

☆20

Alternatives and similar repositories for ramen

Users that are interested in ramen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GeorgeVern / smala
View on GitHub
Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".
☆13Sep 17, 2021Updated 4 years ago
malteos / clp-transfer
View on GitHub
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning
☆30Jan 25, 2023Updated 3 years ago
aiintelligentsystems / next-level-bert
View on GitHub
☆16Jun 14, 2024Updated 2 years ago
ketranm / fan_vs_rnn
View on GitHub
The Importance of Being Recurrent for Modeling Hierarchical Structure
☆25Jun 27, 2018Updated 8 years ago
ketranm / sa-nmt
View on GitHub
structured attention encoder
☆13Jun 6, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
philschmid / multilingual-serverless-qa-aws-lambda
View on GitHub
☆10Dec 17, 2020Updated 5 years ago
ckkissane / sae-transfer
View on GitHub
Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"
☆13Jul 18, 2024Updated 2 years ago
adapter-hub / hgiyt
View on GitHub
Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"
☆28Oct 3, 2021Updated 4 years ago
gsarti / t5-flax-gcp
View on GitHub
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
☆58Jul 28, 2022Updated 3 years ago
stefan-it / gc4lm
View on GitHub
GC4LM: A Colossal (Biased) language model for German
☆13May 2, 2021Updated 5 years ago
alexandra-chron / lexical_xlm_relm
View on GitHub
PyTorch source code of NAACL 2021 paper "Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Tran…
☆18Oct 18, 2022Updated 3 years ago
nayeon7lee / factuality_enhanced_lm_hf
View on GitHub
☆13Nov 11, 2022Updated 3 years ago
jqueguiner / wav2vec2-sprint
View on GitHub
docker for HF wav2vec2-sprint
☆13Mar 26, 2021Updated 5 years ago
ltgoslo / simple_elmo_training
View on GitHub
Minimal code to train ELMo models in recent versions of TensorFlow
☆14Jun 16, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tylerachang / goldfish
View on GitHub
Goldfish: Monolingual language models for 350 languages.
☆27Mar 4, 2026Updated 4 months ago
jouniluoma / bert-ner-cmv
View on GitHub
☆13Dec 17, 2021Updated 4 years ago
pdufter / minimult
View on GitHub
Analyzing mBERT's multilinguality in a small laboratory setting
☆13Jun 12, 2023Updated 3 years ago
NathanGodey / headless-lm
View on GitHub
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆29Apr 17, 2024Updated 2 years ago
MilaNLProc / bertlang
View on GitHub
A web interface to understand language-specific BERT-models
☆18Apr 16, 2024Updated 2 years ago
ielab / CharacterBERT-DR
View on GitHub
The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…
☆16May 4, 2022Updated 4 years ago
smallbenchnlp / ELECTRA-DeBERTa
View on GitHub
☆16Dec 14, 2022Updated 3 years ago
ClimSocAna / tecb-de
View on GitHub
German Text Embedding Clustering Benchmark
☆19Mar 15, 2024Updated 2 years ago
suzgunmirac / crowd-sampling
View on GitHub
Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding
☆20Nov 16, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
malteos / awesome-prompt-optimization
View on GitHub
A curated collection of resources for prompt engineering, optimization, and automatic prompt generation across text, image, video, and mu…
☆18Sep 24, 2025Updated 9 months ago
DanielJDufour / hatebase
View on GitHub
Python Version of Andrew Welter's Hatebase Wrapper
☆10Feb 20, 2022Updated 4 years ago
anebz / eu-sim
View on GitHub
Exploring semantic similarities between contextualized embeddings
☆14May 18, 2021Updated 5 years ago
Maxscha / commitbench
View on GitHub
☆12Mar 15, 2024Updated 2 years ago
dhfbk / KIND
View on GitHub
KIND: an Italian Multi-Domain Dataset for Named Entity Recognition
☆13Jun 28, 2023Updated 3 years ago
naverlabseurope / ALPS2024-MT-LAB
View on GitHub
CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022
☆13Apr 1, 2024Updated 2 years ago
wietsedv / xpos
View on GitHub
Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)
☆19May 17, 2022Updated 4 years ago
malteos / awesome-anonymization-for-llms
View on GitHub
A collection of resources for PII detection, anonymization, privacy-preserving techniques, and GDPR compliance in Large Language Model (L…
☆19Sep 24, 2025Updated 9 months ago
primitivefinance / Topological-Data-Analysis
View on GitHub
This repository is for topologic and geometric data analysis.
☆13Jul 2, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
konstantinjdobler / nlp-research-template
View on GitHub
An opinionated NLP research template
☆10Aug 29, 2024Updated last year
alexandra-chron / relm_unmt
View on GitHub
Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".
☆35Mar 16, 2022Updated 4 years ago
thakur-nandan / income
View on GitHub
INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.
☆24Sep 24, 2023Updated 2 years ago
huggingface / datasets-tagging
View on GitHub
A Streamlit app to add structured tags to a dataset card
☆23Jun 30, 2022Updated 4 years ago
plutonium-239 / memsave_torch
View on GitHub
Lowering PyTorch's Memory Consumption for Selective Differentiation
☆12Aug 29, 2024Updated last year
LEYADEV / Vocabulary-Transfer
View on GitHub
Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer" https://arxiv.org/pdf/2112.14569.pdf
☆20Dec 28, 2021Updated 4 years ago
AlexWan0 / infini-gram
View on GitHub
An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)
☆33Jun 19, 2024Updated 2 years ago