Comparing the Performance of LLMs: A Deep Dive into Roberta, Llama, and Mistral for Disaster Tweets Analysis with Lora
☆51Oct 18, 2023Updated 2 years ago
Alternatives and similar repositories for Roberta-Llama-Mistral
Users that are interested in Roberta-Llama-Mistral are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11May 29, 2024Updated last year
- Transformers指导手册中文翻译项目☆13Dec 2, 2020Updated 5 years ago
- ☆11Mar 7, 2023Updated 3 years ago
- Deep Multi-Speech model☆11Jul 25, 2018Updated 7 years ago
- some tool for Goland☆11Nov 20, 2025Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An automated testing framework for api services☆11Jan 16, 2026Updated 2 months ago
- This repository is pytorch version implement of LRU from the paper "Resurrecting Recurrent Neural Networks for Long Sequences" (https://a…☆11May 22, 2023Updated 2 years ago
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20May 10, 2022Updated 3 years ago
- ☆19Apr 28, 2021Updated 4 years ago
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆29Oct 23, 2025Updated 5 months ago
- German Language Understanding Evaluation Benchmark @NAACL24☆22Dec 11, 2025Updated 3 months ago
- Source code for chat.vearne.cc☆13Nov 20, 2025Updated 4 months ago
- ☆22Nov 8, 2024Updated last year
- Resources for: Cross-Lingual Disaster-related Multi-label Tweet Classification with Manifold Mixup (ACL SRW 2020)☆11Sep 9, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- Adversarial Attack for Pre-trained Code Models☆10Jul 19, 2022Updated 3 years ago
- ☆11Sep 27, 2018Updated 7 years ago
- CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model☆25Apr 27, 2024Updated last year
- 优雅的终止docker容器的demo☆15Oct 10, 2017Updated 8 years ago
- Codes for Pretraining Language Models with Text-Attributed Heterogeneous Graphs☆16Oct 13, 2023Updated 2 years ago
- A simple desktop app development framework combining Python, Vue.js, Element Plus and Electron.☆11Feb 9, 2023Updated 3 years ago
- ☆12Jul 25, 2023Updated 2 years ago
- Timer based on delayqueue☆17Aug 26, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Ordinal classification with Deep Learning using the CLM☆13Apr 20, 2023Updated 2 years ago
- [NeurIPS 2023 Spotlight] In-Context Impersonation Reveals Large Language Models' Strengths and Biases☆22Nov 30, 2024Updated last year
- A platform that provides users with easy access to AI services developed by Montimage and usage of explainable AI techniques (e.g., LIME,…☆10Feb 17, 2026Updated last month
- Add funny emoji to all commit☆10Jun 16, 2022Updated 3 years ago
- ☆10Feb 3, 2021Updated 5 years ago
- store personal password like 1password☆21Feb 19, 2024Updated 2 years ago
- ☆21Dec 5, 2022Updated 3 years ago
- TweetFinSent: A Dataset of Stock Sentiments on Twitter☆13Jul 7, 2022Updated 3 years ago
- ☆17Dec 16, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- SNoRe: Scalable Unsupervised Learning of Symbolic Node Representations☆11Sep 26, 2023Updated 2 years ago
- Code on IART: Intent-aware Response Ranking with Transformers in Information-seeking Conversation Systems (WWW 2020)☆11Apr 18, 2021Updated 4 years ago
- Worldquant University's Capstone Project☆14Sep 5, 2023Updated 2 years ago
- ☆15Jan 24, 2023Updated 3 years ago
- Effective Functions for mixed model☆13Jul 10, 2024Updated last year
- Interpreting BERT with LIME and SHAP☆11Jun 12, 2023Updated 2 years ago
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Aug 20, 2021Updated 4 years ago