jcolano / llama3_single_gpuLinks
☆13Updated 11 months ago
Alternatives and similar repositories for llama3_single_gpu
Users that are interested in llama3_single_gpu are comparing it to the libraries listed below
Sorting:
- ☆24Updated 9 months ago
- Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.☆58Updated last year
- ☆46Updated 9 months ago
- ☆62Updated 11 months ago
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆20Updated 2 months ago
- ☆13Updated 6 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 10 months ago
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆84Updated 4 months ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆73Updated 9 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 11 months ago
- ☆20Updated 3 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 8 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated 2 months ago
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆34Updated 2 months ago
- Verifiers for LLM Reinforcement Learning☆61Updated 2 months ago
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Updated 4 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆55Updated 8 months ago
- Control LLM☆16Updated 2 months ago
- ☆56Updated 7 months ago
- Official implementation of ECCV24 paper: POA☆24Updated 10 months ago
- Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.☆32Updated 3 weeks ago
- [Under Review] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with enla…☆60Updated 8 months ago
- ☆47Updated 4 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 4 months ago
- Automatic prompt optimization framework for multi-step agent tasks.☆31Updated 7 months ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆60Updated 4 months ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆41Updated 7 months ago
- OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆77Updated 3 months ago
- ☆35Updated last month
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year