iliaschalkidis / flash-robertaView external linksLinks
Hugging Face RoBERTa with Flash Attention 2
☆24Sep 14, 2025Updated 5 months ago
Alternatives and similar repositories for flash-roberta
Users that are interested in flash-roberta are comparing it to the libraries listed below
Sorting:
- ☆24Jan 30, 2025Updated last year
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆15Sep 4, 2025Updated 5 months ago
- ☆13Nov 19, 2022Updated 3 years ago
- Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking☆13Feb 5, 2023Updated 3 years ago
- Finetune mistral-7b-instruct for sentence embeddings☆88May 2, 2024Updated last year
- Code for embedding and retrieval research.☆16Oct 24, 2023Updated 2 years ago
- ☆42Apr 22, 2025Updated 9 months ago
- [SIGIR24] Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval☆18Feb 29, 2024Updated last year
- doc-cov is a tool for measuring docstring coverage of Python project.☆12Mar 8, 2019Updated 6 years ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆63Dec 12, 2024Updated last year
- ☆21Apr 16, 2024Updated last year
- presentation slides☆20Jan 23, 2026Updated 3 weeks ago
- Implementation of the report: on the domain robustness of prefix and prompt tuning☆20Mar 10, 2022Updated 3 years ago
- ☆57Jan 26, 2025Updated last year
- ☆47Feb 7, 2024Updated 2 years ago
- This repository contains the data and code for the paper "Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Pr…☆26Jun 27, 2022Updated 3 years ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Aug 2, 2024Updated last year
- Checkout the new version at the link!☆22Dec 11, 2020Updated 5 years ago
- Big Data Analysis of Tinder done at Universitat Rovira i Virgili and Universitat Politècnica de Catalunya · BarcelonaTech☆13Jan 3, 2023Updated 3 years ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆34Aug 24, 2024Updated last year
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆32Jan 4, 2023Updated 3 years ago
- Crispy reranking models by Mixedbread☆45Sep 17, 2025Updated 4 months ago
- Deep Learning Utilities for PyTorch users (old name: Zero)☆38Apr 21, 2025Updated 9 months ago
- ☆36Oct 4, 2023Updated 2 years ago
- fine-tuning tutorial☆17Dec 13, 2025Updated 2 months ago
- User-friendly viewer for Parquet files☆10Jan 10, 2026Updated last month
- 書籍「Python自然言語処理入門」用サポートサイト☆13Mar 25, 2020Updated 5 years ago
- SimADFuzz: Simulation-Feedback Fuzz Testing for Autonomous Driving Systems☆10Apr 11, 2025Updated 10 months ago
- Prompt-and-Rerank: A Method for Zero-Shot and Few-Shot Textual Style Transfer☆36Oct 2, 2022Updated 3 years ago
- DPO, but faster 🚀☆47Dec 6, 2024Updated last year
- ☆10Jan 9, 2024Updated 2 years ago
- LightGBM for handling label-imbalanced data with focal and weighted loss functions in binary and multiclass classification☆21Jan 29, 2026Updated 2 weeks ago
- Redis distributed lock implementation for Python based on Pub/Sub messaging☆11Nov 15, 2025Updated 2 months ago
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- An Abstractive Summarization(for Datasets in English format) Implementation with Transformer and Pointer-generator☆12Dec 31, 2020Updated 5 years ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 6 months ago
- Source code for ISSTA'24 paper "AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code Generation"☆12Oct 21, 2024Updated last year
- ☆10May 1, 2025Updated 9 months ago
- This repo is the artifact of FUEL☆13Dec 2, 2025Updated 2 months ago