mesolitica / llm-embeddingLinks
Finetune Malaysian LLM for Malaysian context embedding task.
☆21Updated last year
Alternatives and similar repositories for llm-embedding
Users that are interested in llm-embedding are comparing it to the libraries listed below
Sorting:
- ☆14Updated 8 months ago
- ☆20Updated 2 months ago
- Bi-Directional Attention Flow for Machine Comprehensions☆9Updated 7 years ago
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"☆19Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated this week
- Unifew: Unified Fewshot Learning Model☆18Updated 3 years ago
- official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization☆13Updated last year
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆15Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- ZS4IE: A Toolkit for Zero-Shot Information Extraction with Simple Verbalizations☆28Updated 3 years ago
- ☆16Updated last year
- Code for Paper "Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition"☆21Updated 2 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- [ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators☆24Updated last year
- 🌏 Modular retrievers for zero-shot multilingual IR.☆27Updated last year
- ☆25Updated 2 years ago
- ☆10Updated last week
- Helper scripts and notes that were used while porting various nlp models☆46Updated 3 years ago
- ☆62Updated 11 months ago
- ☆18Updated 6 months ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆14Updated 2 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 2 years ago
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆20Updated last year
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆29Updated 2 years ago
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"☆18Updated 2 years ago
- ☆12Updated last year
- The Implementation for the Paper "Time-Stamped Language Model: Teaching Language Models toUnderstand The Flow of Events"☆11Updated 4 years ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆13Updated last year
- ☆15Updated 2 months ago
- Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.☆20Updated 3 years ago