mehdiir / Roberta-Llama-Mistral
Comparing the Performance of LLMs: A Deep Dive into Roberta, Llama, and Mistral for Disaster Tweets Analysis with Lora
β50Updated last year
Alternatives and similar repositories for Roberta-Llama-Mistral:
Users that are interested in Roberta-Llama-Mistral are comparing it to the libraries listed below
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β67Updated 4 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?β146Updated last year
- Finetune mistral-7b-instruct for sentence embeddingsβ78Updated 9 months ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embeddβ¦β57Updated 2 months ago
- Unofficial implementation of AlpaGasusβ90Updated last year
- Text classification with Foundation Language Model LLaMAβ114Updated last year
- A Multilingual Replicable Instruction-Following Modelβ94Updated last year
- A Multi-Turn Dialogue Corpus based on Alpaca Instructionsβ166Updated last year
- Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder β¦β150Updated 4 months ago
- This repository contains the code to train flan t5 with alpaca instructions and low rank adaptation.β48Updated last year
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasksβ54Updated 10 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuningβ142Updated 5 months ago
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuningβ151Updated 11 months ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"β146Updated 2 months ago
- π’ Data Toolkit for Sailor Language Modelsβ85Updated 2 months ago
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasetsβ206Updated 3 months ago
- Logiqa2.0 dataset - logical reasoning in MRC and NLI tasksβ86Updated last year
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuningβ89Updated last year
- Efficient Attention for Long Sequence Processingβ92Updated last year
- Comprehensive benchmark for RAGβ116Updated 3 months ago
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.β86Updated last year
- Zero-shot Document Ranking with Large Language Models.β109Updated 7 months ago
- Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataseβ¦β51Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersβ125Updated 11 months ago
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".β125Updated 9 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]β135Updated 3 months ago
- Codebase for RetroMAE and beyond.β249Updated 8 months ago
- A framework for few-shot evaluation of autoregressive language models.β102Updated last year
- Code for the paper `Text Classification via Large Language Models`.β78Updated last year
- A Survey of Attributions for Large Language Modelsβ191Updated 5 months ago