Finetune Malaysian LLM for Malaysian context embedding task.
☆23Apr 27, 2024Updated 2 years ago
Alternatives and similar repositories for llm-embedding
Users that are interested in llm-embedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking, EMNLP 2022, https://aclan…☆14Mar 30, 2026Updated 2 months ago
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval☆19Sep 24, 2022Updated 3 years ago
- ☆32Jul 29, 2024Updated last year
- ☆28Aug 9, 2025Updated 10 months ago
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆24Jun 2, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLP☆10Oct 27, 2023Updated 2 years ago
- AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)☆16Dec 30, 2024Updated last year
- Stronger Baselines for Grammatical Error Correction Using a Pretrained Encoder-Decoder Model.☆37Apr 6, 2023Updated 3 years ago
- GPU-accelerated algorithm for subsampling datasets while preserving diversity☆27Jan 12, 2024Updated 2 years ago
- Tracking part of siamese-fc.☆10Feb 25, 2017Updated 9 years ago
- Sync your vaults automatically & securely with most of clouds 🌥 by taking advantage of 'RCLONE' & 'syncrclone'☆18May 24, 2022Updated 4 years ago
- Python and R scripts for visualising and analysing baby sleep patterns.☆12May 17, 2017Updated 9 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Codebase for VideoConviction, accepted at KDD 2025 (D&B Track)☆18Jan 22, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 敏感信息,垃圾信息,黄赌毒信息判断☆11Jul 17, 2017Updated 8 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆99Apr 26, 2023Updated 3 years ago
- Simulation of a stop cascade occurring on an exchange☆13Nov 2, 2021Updated 4 years ago
- These are tools I cheated with the help of ChatGPT to help me with Penetration Testing and Red Teaming☆14Feb 24, 2024Updated 2 years ago
- Systematic trading in Python☆13Jun 3, 2026Updated 2 weeks ago
- Resources for the Semeval 2016 Task 3 Community Question Answering. Contains word embeddings and system description results☆10Jan 13, 2017Updated 9 years ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Mar 6, 2023Updated 3 years ago
- This project involved enhancing a GPT-4 based application that simulates sales conversations. The application uses a SalesConversationCha…☆11Mar 30, 2024Updated 2 years ago
- explores Chinese language models with sub-character level visual information☆16Oct 5, 2018Updated 7 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Transformer model for Portuguese language (Brazil pt_BR)☆16Apr 10, 2026Updated 2 months ago
- Routines for implementing various statistical and machine learning techniques.☆19Nov 28, 2022Updated 3 years ago
- Baseline Models for Argumentative Text Understanding for AI Debater (NLPCC2021)☆12May 21, 2021Updated 5 years ago
- ☆16Sep 11, 2019Updated 6 years ago
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆49Dec 28, 2022Updated 3 years ago
- Neural networks in Theano (ABANDONED/DISCONTINUED) - see dagbldr for a continuation of this code with some new tricks☆18Feb 26, 2015Updated 11 years ago
- ☆35May 18, 2023Updated 3 years ago
- A Python reimplementation + extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆17Dec 1, 2023Updated 2 years ago
- ☆12Jan 2, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models☆12Jul 1, 2023Updated 2 years ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…☆12Feb 19, 2023Updated 3 years ago
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 4 years ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 3 years ago
- Collections of IR Research☆37May 18, 2025Updated last year
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Feb 6, 2023Updated 3 years ago