google-deepmind/randomized_positional_encodings

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-deepmind/randomized_positional_encodings)

google-deepmind / randomized_positional_encodings

Randomized Positional Encodings Boost Length Generalization of Transformers

☆83

Alternatives and similar repositories for randomized_positional_encodings

Users that are interested in randomized_positional_encodings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fyvo / WMT-Biomed-Test
View on GitHub
☆13Aug 23, 2024Updated last year
Hambaobao / Marathon
View on GitHub
Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.
☆10May 16, 2024Updated 2 years ago
Yuanhy1997 / HyPe
View on GitHub
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]
☆14Jul 11, 2023Updated 3 years ago
X-LANCE / public_talks
View on GitHub
Materials of public talks given By SJTU X-LANCE members
☆14Dec 3, 2022Updated 3 years ago
SimengSun / ChapterBreak
View on GitHub
☆12Jun 5, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Curt-Park / serving-codegen-gptj-triton
View on GitHub
Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetes
☆20May 30, 2023Updated 3 years ago
Sanster / padding_free_llm_train
View on GitHub
☆16Feb 6, 2024Updated 2 years ago
apple / ml-np-rasp
View on GitHub
☆22Jan 19, 2024Updated 2 years ago
DataStates / datastates-llm
View on GitHub
LLM checkpointing for DeepSpeed/Megatron
☆26Nov 30, 2025Updated 7 months ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
facebookresearch / ModelRatatouille
View on GitHub
Recycling diverse models
☆47Jan 18, 2023Updated 3 years ago
cisnlp / mPLM-Sim
View on GitHub
mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models
☆11Jan 19, 2024Updated 2 years ago
lovit / synthetic_dataset
View on GitHub
Synthetic data generator for machine learning
☆16Oct 18, 2023Updated 2 years ago
google-research / interpretability-theory
View on GitHub
☆26Apr 11, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
guanchuwang / Taylor-Unswift
View on GitHub
☆22Oct 3, 2024Updated last year
OpenNLPLab / lightning-attention
View on GitHub
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
☆344Feb 23, 2025Updated last year
MikeWangWZHL / Zemi
View on GitHub
Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings
☆15May 3, 2023Updated 3 years ago
RUCBM / ICLEval
View on GitHub
☆14Jun 24, 2024Updated 2 years ago
JeanKaddour / NoTrainNoGain
View on GitHub
Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)
☆81Aug 30, 2023Updated 2 years ago
hplt-project / OpusTrainer
View on GitHub
Curriculum training
☆22Jun 25, 2025Updated last year
yluo42 / SRVQ
View on GitHub
Spherical residual vector quantization (SRVQ)
☆31Aug 25, 2024Updated last year
DAMO-NLP-SG / CLEX
View on GitHub
[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models
☆78Mar 12, 2024Updated 2 years ago
f-dangel / sirfshampoo
View on GitHub
[ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)
☆15Nov 4, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
alexandra-chron / lexical_xlm_relm
View on GitHub
PyTorch source code of NAACL 2021 paper "Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Tran…
☆18Oct 18, 2022Updated 3 years ago
csukuangfj / kaldi-hmm-gmm
View on GitHub
☆28Apr 24, 2026Updated 3 months ago
kiaia / GIRAFFE
View on GitHub
Extending context length of visual language models
☆12Dec 18, 2024Updated last year
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
zjlww / papers
View on GitHub
Connected Papers knockoff, managing academic papers and citations with graph database.
☆12Dec 26, 2023Updated 2 years ago
ZhongYang2026 / Sandglasset-A-Light-Multi-Granularity-Self-Attentive-Network-For-Time-Domain-Speech-Separation
View on GitHub
Speech Separation
☆21Mar 7, 2024Updated 2 years ago
HeegyuKim / ko-rm-judge
View on GitHub
Reward Model을 이용하여 언어모델의 답변을 평가하기
☆30Feb 23, 2024Updated 2 years ago
superlinear-ai / wtpsplit-lite
View on GitHub
✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models
☆39May 2, 2026Updated 2 months ago
tau-nlp / zero_scrolls
View on GitHub
Running inference on the ZeroSCROLLS benchmark
☆22Apr 18, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
facebookresearch / mega
View on GitHub
Sequence modeling with Mega.
☆303Jan 28, 2023Updated 3 years ago
openjdk / jdk15u
View on GitHub
https://wiki.openjdk.org/display/JDKUpdates/JDK+15u last released 2023-01-18
☆11Jan 18, 2023Updated 3 years ago
rushil2501 / Active-Noise-Cancellation
View on GitHub
A research project and comparative study on various Active Noise Cancellation Algorithms like FxLMS, EMFN, Chebyshev filter and Hammerste…
☆10Jul 3, 2022Updated 4 years ago
rishikksh20 / iSTFT-Avocodo-pytorch
View on GitHub
Ultrafast GAN based Vocoder for Text to Speech
☆50Jul 16, 2022Updated 4 years ago
noiseux1523 / NIST-SRE-2019
View on GitHub
Score Normalization for NIST 2019 Speaker Recognition Evaluation
☆10Nov 8, 2019Updated 6 years ago
IU-SAIGE / pse
View on GitHub
Efficient Personalized Speech Enhancement through Self-Supervised Learning
☆23Mar 12, 2023Updated 3 years ago
simonjisu / annotated-transformer-kr
View on GitHub
annotated-transformer-kr
☆15May 16, 2019Updated 7 years ago