Finetune Malaysian LLM for Malaysian context embedding task.
☆23Apr 27, 2024Updated last year
Alternatives and similar repositories for llm-embedding
Users that are interested in llm-embedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking, EMNLP 2022, https://aclan…☆14Mar 30, 2026Updated 2 weeks ago
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval☆19Sep 24, 2022Updated 3 years ago
- ☆32Jul 29, 2024Updated last year
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆23Jun 2, 2025Updated 10 months ago
- Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLP☆10Oct 27, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)☆16Dec 30, 2024Updated last year
- This is a WIP. Here be dragons.☆12Jun 28, 2019Updated 6 years ago
- Stronger Baselines for Grammatical Error Correction Using a Pretrained Encoder-Decoder Model.☆37Apr 6, 2023Updated 3 years ago
- GPU-accelerated algorithm for subsampling datasets while preserving diversity☆27Jan 12, 2024Updated 2 years ago
- Tracking part of siamese-fc.☆10Feb 25, 2017Updated 9 years ago
- Electronic funhouse mirror for Halloween that puts animals and monsters on people's faces☆11Oct 31, 2019Updated 6 years ago
- Python and R scripts for visualising and analysing baby sleep patterns.☆12May 17, 2017Updated 8 years ago
- ☆12May 21, 2019Updated 6 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Codebase for VideoConviction, accepted at KDD 2025 (D&B Track)☆18Jan 22, 2026Updated 2 months ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆99Apr 26, 2023Updated 2 years ago
- [COLING'22] Code for our paper: "COLO: A Contrastive Learning based Re-ranking Framework for One-Stage Summarization"☆22Oct 21, 2022Updated 3 years ago
- Simulation of a stop cascade occurring on an exchange☆13Nov 2, 2021Updated 4 years ago
- These are tools I cheated with the help of ChatGPT to help me with Penetration Testing and Red Teaming☆14Feb 24, 2024Updated 2 years ago
- Metadata and per-statute PDFs for the U.S. Statutes at Large through volume 64 (1789-1951).☆16Apr 24, 2020Updated 5 years ago
- Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.☆18Feb 20, 2024Updated 2 years ago
- Systematic trading in Python☆12Apr 2, 2026Updated 2 weeks ago
- Resources for the Semeval 2016 Task 3 Community Question Answering. Contains word embeddings and system description results☆10Jan 13, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Mar 6, 2023Updated 3 years ago
- This project involved enhancing a GPT-4 based application that simulates sales conversations. The application uses a SalesConversationCha…☆11Mar 30, 2024Updated 2 years ago
- Dataset and codes for identifying sentence-level discourse elements in Chinese argumentative student essays.☆14Nov 16, 2022Updated 3 years ago
- explores Chinese language models with sub-character level visual information☆16Oct 5, 2018Updated 7 years ago
- Transformer model for Portuguese language (Brazil pt_BR)☆16Apr 10, 2026Updated last week
- SiamFC tracking in MXNet.☆17Jun 5, 2019Updated 6 years ago
- Routines for implementing various statistical and machine learning techniques.☆19Nov 28, 2022Updated 3 years ago
- Baseline Models for Argumentative Text Understanding for AI Debater (NLPCC2021)☆12May 21, 2021Updated 4 years ago
- Nextein starter☆17Jan 18, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆49Dec 28, 2022Updated 3 years ago
- Neural networks in Theano (ABANDONED/DISCONTINUED) - see dagbldr for a continuation of this code with some new tricks☆18Feb 26, 2015Updated 11 years ago
- Implementation of the paper <Model-based Reinforcement Learning for Predictions and Control for Limit Order Books (Wei et al., J.P. Morga…☆11Aug 22, 2023Updated 2 years ago
- ☆35May 18, 2023Updated 2 years ago
- A Python reimplementation + extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆18Dec 1, 2023Updated 2 years ago
- ☆12Jan 2, 2024Updated 2 years ago
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year