Finetune Malaysian LLM for Malaysian context embedding task.
☆23Apr 27, 2024Updated last year
Alternatives and similar repositories for llm-embedding
Users that are interested in llm-embedding are comparing it to the libraries listed below
Sorting:
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆14Feb 2, 2026Updated last month
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆21Jun 2, 2025Updated 9 months ago
- GPU-accelerated algorithm for subsampling datasets while preserving diversity☆27Jan 12, 2024Updated 2 years ago
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval☆19Sep 24, 2022Updated 3 years ago
- [COLING'22] Code for our paper: "COLO: A Contrastive Learning based Re-ranking Framework for One-Stage Summarization"☆22Oct 21, 2022Updated 3 years ago
- ☆32Jul 29, 2024Updated last year
- Collections of IR Research☆37May 18, 2025Updated 9 months ago
- ☆35May 18, 2023Updated 2 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Repository of IPBench☆19Jan 4, 2026Updated 2 months ago
- "Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models?"☆38Nov 13, 2024Updated last year
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆23Oct 23, 2025Updated 4 months ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 2 years ago
- ☆11Jul 17, 2023Updated 2 years ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…☆12Feb 19, 2023Updated 3 years ago
- ☆10May 1, 2025Updated 10 months ago
- GBM implementation on Legate☆14Jan 28, 2026Updated last month
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 7 months ago
- Code for EMNLP 2021 paper: "Is Everything in Order? A Simple Way to Order Sentences"☆43Sep 9, 2023Updated 2 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Apr 26, 2023Updated 2 years ago
- Stronger Baselines for Grammatical Error Correction Using a Pretrained Encoder-Decoder Model.☆37Apr 6, 2023Updated 2 years ago
- This project is based on Opencv, and achieves the part of the generation of segmentation (using depth map) and image denoising using Mark…☆11Oct 29, 2018Updated 7 years ago
- ☆15May 26, 2025Updated 9 months ago
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12May 31, 2024Updated last year
- ☆15Jan 12, 2026Updated last month
- Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLP☆10Oct 27, 2023Updated 2 years ago
- High-Frequency Trading (HFT) order matching engine optimized for low latency. Features NUMA-aware memory allocation, thread pinning, RDTS…☆21Nov 10, 2025Updated 3 months ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆13Jul 1, 2024Updated last year
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- ☆12May 23, 2024Updated last year
- Long Context Research☆29Jan 26, 2026Updated last month
- A repository to get acquainted with basic training tasks in natural language processing and machine learning☆11Dec 27, 2023Updated 2 years ago
- This repository contains the implementation code for paper: Mixup Your Own Pairs☆12Oct 1, 2023Updated 2 years ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Aug 6, 2025Updated 7 months ago
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Feb 6, 2023Updated 3 years ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- EA-HAS-Bench: Energy-Aware Hyperparameter and Architecture Search Benchmark (ICLR Spotlight 2023)☆18Dec 8, 2024Updated last year
- Code for the paper "FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024"☆13Feb 14, 2025Updated last year
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year