Train Llama 2 & 3 on the SQuAD v2 task as an example of how to specialize a generalized (foundation) model.
☆53Jun 7, 2024Updated last year
Alternatives and similar repositories for llama-squad
Users that are interested in llama-squad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- In-context learning, Fine-Tuning, RLHF on Flan-T5☆13Aug 30, 2023Updated 2 years ago
- Text perturbation methods to evaluate the robustness of NLP models☆20Oct 6, 2021Updated 4 years ago
- Python scripts for setting up private LLM's on local and in the cloud with LangChain, GPT4All and Cerebrium☆11May 29, 2023Updated 2 years ago
- CS172 Final project: Text Image Super-Resolution Reconstruction☆14Jun 15, 2020Updated 5 years ago
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆34Nov 1, 2025Updated 4 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆16Mar 3, 2024Updated 2 years ago
- Launch machine learning models into production using flask☆13Aug 11, 2022Updated 3 years ago
- ☆11Feb 3, 2025Updated last year
- Patient Letter Generation☆12Aug 22, 2024Updated last year
- turn books, articles, plaintext or webpages into interactive read eval print loops, manage bookmarks in your own local database☆16Mar 22, 2026Updated last week
- ☆22Aug 8, 2025Updated 7 months ago
- ☆11Jun 27, 2019Updated 6 years ago
- ACL24☆11Jun 7, 2024Updated last year
- ☆19Apr 5, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 자연어 처리 기반 [한글 서술형 수학문제 데이터셋] 공개 저장소입니다.☆14Jun 12, 2023Updated 2 years ago
- Implementation of Wasserstein Generative Adversarial Networks using Tensorflow☆12Jul 25, 2018Updated 7 years ago
- ☆11Feb 22, 2019Updated 7 years ago
- Explains Canadian Bills☆17May 13, 2023Updated 2 years ago
- Assessing Disparate Impacts of Personalized Interventions: Identifiability and Bounds☆11Oct 28, 2019Updated 6 years ago
- Support for training SSD on TF2☆12Mar 29, 2023Updated 3 years ago
- Mutual information estimators and benchmarks☆14Mar 2, 2026Updated 3 weeks ago
- A tutorial on learned non-adversarial invariance in neural networks☆14Dec 8, 2019Updated 6 years ago
- 🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)☆12Updated this week
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆13Jul 22, 2023Updated 2 years ago
- ☆12Sep 1, 2023Updated 2 years ago
- The official codebase for running the experiments described in the AVDC paper.☆20Oct 2, 2024Updated last year
- This repository contains software to replicate the iterative realignment for continuous sign language recognition as described in the pap…☆17Dec 24, 2019Updated 6 years ago
- ☆17Jan 21, 2025Updated last year
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆22May 18, 2024Updated last year
- The Dataset and Official Implementation for <Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models’ Understandi…☆18Aug 7, 2024Updated last year
- Resources for our IJCAI 2020 paper, TopicKA: Generating Commonsense Knowledge-Aware Dialogue Responses Towards the Recommended Topic Fact☆12Nov 30, 2020Updated 5 years ago
- Code and datasets for the ACL 2020 paper "Detecting Perceived Emotions in Hurricane Disasters"☆12Oct 4, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Reverse engineered ChatGPT API☆10Feb 14, 2023Updated 3 years ago
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated last year
- Official PyTorch implementation of https://arxiv.org/abs/2210.06340 (NeurIPS ‘22)☆21Nov 14, 2022Updated 3 years ago
- Official repository for "Action-Based Conversations Dataset: A Corpus for Building More In-Depth Task-Oriented Dialogue Systems"☆75Jan 26, 2022Updated 4 years ago
- CFinBench: A Comprehensive Chinese Financial Benchmark for Large Language Models☆15Oct 14, 2024Updated last year
- Neural Image Caption (NIC) on chainer, its pretrained models on English and Japanese image caption datasets.☆17Dec 14, 2018Updated 7 years ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆23Aug 18, 2024Updated last year