mesolitica / llm-embeddingLinks

Finetune Malaysian LLM for Malaysian context embedding task.

☆21

Alternatives and similar repositories for llm-embedding

Users that are interested in llm-embedding are comparing it to the libraries listed below

Sorting:

lucy3 / whos_filtered
☆14Updated 8 months ago
nlp-uoregon / ullme
☆20Updated 2 months ago
imraviagrawal / ReadingComprehension
Bi-Directional Attention Flow for Machine Comprehensions
☆9Updated 7 years ago
gautierdag / tokenizer-bench
Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"
☆19Updated last year
kyegomez / Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆30Updated this week
allenai / unifew
Unifew: Unified Fewshot Learning Model
☆18Updated 3 years ago
wangskyGit / passage-sieve
official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization
☆13Updated last year
stanfordnlp / multi-distribution-retrieval
Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval
☆15Updated last year
facebookresearch / lss_eval
This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…
☆31Updated last year
BBN-E / ZS4IE
ZS4IE: A Toolkit for Zero-Shot Information Extraction with Simple Verbalizations
☆28Updated 3 years ago
ari-holtzman / newformer
☆16Updated last year
Yarkona / TOF
Code for Paper "Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition"
☆21Updated 2 years ago
google-research-datasets / swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆48Updated last year
microsoft / AMOS
[ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators
☆24Updated last year
ant-louis / xm-retrievers
🌏 Modular retrievers for zero-shot multilingual IR.
☆27Updated last year
r-three / fib
☆25Updated 2 years ago
salesforce / FactLM
☆10Updated last week
stas00 / porting
Helper scripts and notes that were used while porting various nlp models
☆46Updated 3 years ago
DunZhang / Stella
☆62Updated 11 months ago
LLM360 / TxT360
☆18Updated 6 months ago
UKPLab / incorporating-relevance
Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…
☆14Updated 2 years ago
sunlab-osu / ReasonBERT
Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021
☆29Updated 2 years ago
facebookresearch / ToolVerifier
This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.
☆20Updated last year
guilhermemr04 / scaling-zero-shot-retrieval
No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval
☆29Updated 2 years ago
rsvp-ai / segatron_aaai
codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"
☆18Updated 2 years ago
aviaefrat / lmentry
☆12Updated last year
HLR / TSLM
The Implementation for the Paper "Time-Stamped Language Model: Teaching Language Models toUnderstand The Flow of Events"
☆11Updated 4 years ago
allenai / PlaSma
This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…
☆13Updated last year
choosewhatulike / case2code
☆15Updated 2 months ago
THUDM / GLM-iprompt
Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.
☆20Updated 3 years ago