jcolano / llama3_single_gpu
☆13Updated 9 months ago
Alternatives and similar repositories for llama3_single_gpu:
Users that are interested in llama3_single_gpu are comparing it to the libraries listed below
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- ☆62Updated 3 weeks ago
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆84Updated 2 months ago
- Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.☆58Updated 10 months ago
- ☆53Updated last week
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 6 months ago
- Combining Base and Instruction-Tuned Language Models for Better Synthetic Data Generation☆29Updated 2 months ago
- ☆59Updated 9 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 5 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 4 months ago
- ☆24Updated 7 months ago
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆24Updated this week
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆53Updated 6 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- ☆50Updated 5 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆73Updated last month
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆62Updated 11 months ago
- ☆45Updated 7 months ago
- ☆28Updated last year
- Finetune any model on HF in less than 30 seconds☆58Updated 3 weeks ago
- ☆90Updated last month
- Composition of Multimodal Language Models From Scratch☆14Updated 8 months ago
- ☆14Updated last year
- ☆41Updated 2 months ago
- ☆19Updated 2 weeks ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆11Updated 3 months ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆65Updated 7 months ago
- Running load tests on a FastAPI application using Locust☆15Updated last month
- ☆48Updated 5 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆13Updated 3 weeks ago