Finetune Malaysian LLM for Malaysian context embedding task.
☆23Apr 27, 2024Updated 2 years ago
Alternatives and similar repositories for llm-embedding
Users that are interested in llm-embedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking, EMNLP 2022, https://aclan…☆14Mar 30, 2026Updated last month
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval☆19Sep 24, 2022Updated 3 years ago
- ☆32Jul 29, 2024Updated last year
- ☆26Aug 9, 2025Updated 9 months ago
- ☆13Mar 27, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆23Jun 2, 2025Updated 11 months ago
- Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLP☆10Oct 27, 2023Updated 2 years ago
- AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)☆16Dec 30, 2024Updated last year
- This is a WIP. Here be dragons.☆12Jun 28, 2019Updated 6 years ago
- Stronger Baselines for Grammatical Error Correction Using a Pretrained Encoder-Decoder Model.☆37Apr 6, 2023Updated 3 years ago
- Tracking part of siamese-fc.☆10Feb 25, 2017Updated 9 years ago
- Sync your vaults automatically & securely with most of clouds 🌥 by taking advantage of 'RCLONE' & 'syncrclone'☆18May 24, 2022Updated 4 years ago
- This project is based on Opencv, and achieves the part of the generation of segmentation (using depth map) and image denoising using Mark…☆11Oct 29, 2018Updated 7 years ago
- Python and R scripts for visualising and analysing baby sleep patterns.☆12May 17, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- ☆12May 21, 2019Updated 7 years ago
- Codebase for VideoConviction, accepted at KDD 2025 (D&B Track)☆18Jan 22, 2026Updated 4 months ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆99Apr 26, 2023Updated 3 years ago
- [COLING'22] Code for our paper: "COLO: A Contrastive Learning based Re-ranking Framework for One-Stage Summarization"☆22Oct 21, 2022Updated 3 years ago
- Simulation of a stop cascade occurring on an exchange☆13Nov 2, 2021Updated 4 years ago
- These are tools I cheated with the help of ChatGPT to help me with Penetration Testing and Red Teaming☆14Feb 24, 2024Updated 2 years ago
- Metadata and per-statute PDFs for the U.S. Statutes at Large through volume 64 (1789-1951).☆17Apr 24, 2020Updated 6 years ago
- Systematic trading in Python☆13May 20, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Mar 6, 2023Updated 3 years ago
- This project involved enhancing a GPT-4 based application that simulates sales conversations. The application uses a SalesConversationCha…☆11Mar 30, 2024Updated 2 years ago
- Dataset and codes for identifying sentence-level discourse elements in Chinese argumentative student essays.☆14Nov 16, 2022Updated 3 years ago
- explores Chinese language models with sub-character level visual information☆16Oct 5, 2018Updated 7 years ago
- Transformer model for Portuguese language (Brazil pt_BR)☆16Apr 10, 2026Updated last month
- SiamFC tracking in MXNet.☆17Jun 5, 2019Updated 6 years ago
- Routines for implementing various statistical and machine learning techniques.☆19Nov 28, 2022Updated 3 years ago
- Nextein starter☆17Jan 18, 2023Updated 3 years ago
- ☆16Sep 11, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆49Dec 28, 2022Updated 3 years ago
- Neural networks in Theano (ABANDONED/DISCONTINUED) - see dagbldr for a continuation of this code with some new tricks☆18Feb 26, 2015Updated 11 years ago
- ☆35May 18, 2023Updated 3 years ago
- A Python reimplementation + extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆17Dec 1, 2023Updated 2 years ago
- ☆12Jan 2, 2024Updated 2 years ago
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- Adaptation of Monte Carlo and SARSA algorithms (Reinforcement Learning) for learning the policy of sellers/ buyers in stock market☆12Jul 23, 2018Updated 7 years ago