Train Llama 2 & 3 on the SQuAD v2 task as an example of how to specialize a generalized (foundation) model.
☆53Jun 7, 2024Updated last year
Alternatives and similar repositories for llama-squad
Users that are interested in llama-squad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- In-context learning, Fine-Tuning, RLHF on Flan-T5☆13Aug 30, 2023Updated 2 years ago
- Python scripts for setting up private LLM's on local and in the cloud with LangChain, GPT4All and Cerebrium☆11May 29, 2023Updated 2 years ago
- [ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"☆10Sep 3, 2024Updated last year
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆35Nov 1, 2025Updated 6 months ago
- ☆16Mar 3, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆13Sep 20, 2023Updated 2 years ago
- Launch machine learning models into production using flask☆13Aug 11, 2022Updated 3 years ago
- This repo contains the code for Late Prompt Tuning.☆12Dec 22, 2025Updated 4 months ago
- Patient Letter Generation☆12Aug 22, 2024Updated last year
- ☆11May 25, 2023Updated 2 years ago
- DifferentialEquations.jl with PyTorch☆11Oct 12, 2022Updated 3 years ago
- ☆11Jun 27, 2019Updated 6 years ago
- ACL24☆11Jun 7, 2024Updated last year
- Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'☆13Aug 2, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆19Apr 5, 2022Updated 4 years ago
- Explains Canadian Bills☆17May 13, 2023Updated 2 years ago
- https://aiisc.ai/defactify2/factify.html☆15Nov 27, 2023Updated 2 years ago
- Assessing Disparate Impacts of Personalized Interventions: Identifiability and Bounds☆11Oct 28, 2019Updated 6 years ago
- Tight Mutual Information Estimation With Contrastive Fenchel-Legendre Optimization☆11Nov 29, 2022Updated 3 years ago
- Mutual information estimators and benchmarks☆14Apr 9, 2026Updated 3 weeks ago
- A tutorial on learned non-adversarial invariance in neural networks☆14Dec 8, 2019Updated 6 years ago
- ☆13Jul 22, 2023Updated 2 years ago
- ☆16Nov 14, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official codes for Scalable Infomin Learning, NeurIPS 2022☆14Feb 28, 2023Updated 3 years ago
- This repository contains software to replicate the iterative realignment for continuous sign language recognition as described in the pap…☆17Dec 24, 2019Updated 6 years ago
- Codes for "EDG-based Question Decomposition for Complex Question Answering over Knowledge Bases"☆13Nov 12, 2021Updated 4 years ago
- ☆17Jan 21, 2025Updated last year
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆22May 18, 2024Updated last year
- The Dataset and Official Implementation for <Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models’ Understandi…☆18Aug 7, 2024Updated last year
- Resources for our IJCAI 2020 paper, TopicKA: Generating Commonsense Knowledge-Aware Dialogue Responses Towards the Recommended Topic Fact☆12Nov 30, 2020Updated 5 years ago
- Official PyTorch implementation of https://arxiv.org/abs/2210.06340 (NeurIPS ‘22)☆21Nov 14, 2022Updated 3 years ago
- Reverse engineered ChatGPT API☆10Feb 14, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is the repository for TimelineQA, a benchmark for querying lifelogs.☆26Jul 5, 2023Updated 2 years ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆23Aug 18, 2024Updated last year
- fast api with machine learning☆10Apr 23, 2023Updated 3 years ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆60Jul 23, 2024Updated last year
- CHAE: Fine-Grained Controllable Story Generation with Characters, Actions and Emotions☆10Jan 31, 2023Updated 3 years ago
- ☆19Sep 24, 2022Updated 3 years ago
- <혼자 만들면서 공부하는 파이썬> 책의 깃허브 자료실☆17Mar 24, 2026Updated last month