Collection of papers using LLaMA as backbone model
☆49Apr 6, 2025Updated last year
Alternatives and similar repositories for LLaMA-Paper-List
Users that are interested in LLaMA-Paper-List are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Chang Gung University Computer Science / Artificial Intelligence learning material☆27Sep 5, 2024Updated last year
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"☆12Aug 29, 2023Updated 2 years ago
- Code for SRMRL☆19Sep 5, 2021Updated 4 years ago
- ☆19Jan 21, 2022Updated 4 years ago
- ☆24Oct 19, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆21Jul 16, 2023Updated 2 years ago
- [PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and min…☆10Aug 13, 2024Updated last year
- ☆28Nov 29, 2024Updated last year
- Official implementation of the WACV 2025 paper "3D Part Segmentation via Geometric Aggregation of 2D Visual Features"☆25Jun 8, 2025Updated last year
- This repository contains data and code used for On the Risk of Misinformation Pollution with Large Language Models (EMNLP 2023 Findings).☆17Dec 14, 2023Updated 2 years ago
- Course Website for "AI618: Generative Model and Unsupervised Learning"☆36May 23, 2023Updated 3 years ago
- [CVPR 2025 Highlight] v-CLR: View-Consistent Learning for Open-World Instance Segmentation☆21May 31, 2026Updated last month
- ☆67May 7, 2026Updated last month
- COBS: COmprehensive Building Simulator☆16Jun 23, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Sep 27, 2022Updated 3 years ago
- Are foundation LMs multilingual knowledge bases? (EMNLP 2023)☆19Dec 8, 2023Updated 2 years ago
- [NeurIPS'23] Binary Classification with Confidence Difference☆10May 13, 2024Updated 2 years ago
- Parallel Self-Adjusting Computation☆16Jul 5, 2021Updated 4 years ago
- [HPCA'24] Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System☆52Jul 21, 2025Updated 11 months ago
- Code and data for the paper: On the Reliability of Psychological Scales on Large Language Models☆30Dec 15, 2025Updated 6 months ago
- [ICLR 2025] SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training☆41Apr 4, 2025Updated last year
- ☆10Jun 21, 2021Updated 5 years ago
- ☆63Apr 2, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆13Feb 22, 2025Updated last year
- pytorch☆10Apr 13, 2022Updated 4 years ago
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆19Feb 20, 2025Updated last year
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆39Sep 9, 2025Updated 9 months ago
- Website for CSE 234, Winter 2025☆16Mar 24, 2025Updated last year
- Here, we compare Q(\sigma) learning presented by Sutton and Barto in [1] to Tree-Backup, n-step Expected Sarsa, and n-step Sarsa.☆15Feb 17, 2017Updated 9 years ago
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher [CVPR 2022 Oral]☆29Sep 15, 2022Updated 3 years ago
- Code for Semi-crowdsourced Clustering with Deep Generative Models☆12Dec 9, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Serial Contrastive Knowledge Distillation for Continual Few-shot Relation Extraction, Findings of ACL 2023☆14May 12, 2023Updated 3 years ago
- [CVPR 2025] An Implementation of the paper "Pre-Instruction Data Selection for Visual Instruction Tuning"☆17Jun 9, 2025Updated last year
- Smoothing video traffic to make it a friendlier internet neighbor☆14Apr 23, 2024Updated 2 years ago
- ☆12Feb 2, 2026Updated 5 months ago
- Repository for the DPP'23 course☆11May 2, 2024Updated 2 years ago
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated 2 years ago
- EarthVL: A Progressive Earth Vision-Language Understanding and Generation Framework☆43Jan 22, 2026Updated 5 months ago