togethercomputer/Llama-2-7B-32K-Instruct

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/togethercomputer/Llama-2-7B-32K-Instruct)

togethercomputer / Llama-2-7B-32K-Instruct

☆84

Alternatives and similar repositories for Llama-2-7B-32K-Instruct

Users that are interested in Llama-2-7B-32K-Instruct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

iantbutler01 / ditty
View on GitHub
A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.
☆16Jun 10, 2026Updated last month
LeeSureman / MoT
View on GitHub
code for Preprint paper at Arxiv: MoT: Pre-thinking and Recalling Enable ChatGPT to Self-Improve with Memory-of-Thoughts
☆24Nov 29, 2023Updated 2 years ago
open-nlplab / fastIE
View on GitHub
Information Extraction related tools and models
☆10Mar 16, 2023Updated 3 years ago
abacusai / Long-Context
View on GitHub
This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…
☆603Nov 17, 2023Updated 2 years ago
SnoopX-AI / Awesome-Weak-to-Strong-Generalization
View on GitHub
☆11Aug 10, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
OpenLMLab / scaling-rope
View on GitHub
code for Scaling Laws of RoPE-based Extrapolation
☆73Oct 16, 2023Updated 2 years ago
dair-iitd / FloNet
View on GitHub
Code for "End-to-End Learning of Flowchart Grounded Task-Oriented Dialogs"
☆14Oct 10, 2022Updated 3 years ago
yuhuixu1993 / qa-lora
View on GitHub
Official PyTorch implementation of QA-LoRA
☆147Mar 13, 2024Updated 2 years ago
OpenLemur / Lemur
View on GitHub
[ICLR 2024] Lemur: Open Foundation Models for Language Agents
☆557Oct 28, 2023Updated 2 years ago
algopapi / RetroformAgent
View on GitHub
Langchain Agent finetuning using 7B - LLAMA 2 , on hotpotQA (Retroformer framework)
☆16Sep 5, 2023Updated 2 years ago
awslabs / extending-the-context-length-of-open-source-llms
View on GitHub
☆56Jun 26, 2025Updated last year
modestyachts / cifar-10.2
View on GitHub
Host CIFAR-10.2 Data Set
☆13Sep 22, 2021Updated 4 years ago
IST-DASLab / qmoe
View on GitHub
Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".
☆278Nov 3, 2023Updated 2 years ago
zer0int / CLIP-SAE-finetune
View on GitHub
Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.
☆18Dec 19, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ars22 / e3
View on GitHub
☆20Sep 16, 2025Updated 10 months ago
neulab / cmulab
View on GitHub
CMU Linguistic Annotation Backend
☆15Sep 22, 2025Updated 10 months ago
HITsz-TMG / ICL-State-Vector
View on GitHub
☆12Jul 4, 2024Updated 2 years ago
OpenNLPLab / TransnormerLLM
View on GitHub
Official implementation of TransNormerLLM: A Faster and Better LLM
☆255Jan 23, 2024Updated 2 years ago
Coding-Crashkurse / Applied-Advanced-RAG
View on GitHub
☆24Jan 28, 2024Updated 2 years ago
Nanami18 / Snowballed_Hallucination
View on GitHub
☆43Sep 3, 2024Updated last year
tom-pollak / claudette-pydantic
View on GitHub
☆10Oct 22, 2024Updated last year
RobertCsordas / moe_layer
View on GitHub
sigma-MoE layer
☆21Jan 5, 2024Updated 2 years ago
stanfordnlp / multi-distribution-retrieval
View on GitHub
Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval
☆17Jan 16, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
tianyi-lab / RuleR
View on GitHub
[NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling
☆14Sep 27, 2025Updated 9 months ago
RUCBM / ICLEval
View on GitHub
☆14Jun 24, 2024Updated 2 years ago
bothe / dialogue-act-recognition
View on GitHub
Context-based Dialogue Act Recognition using Recurrent Neural Networks
☆13Nov 13, 2021Updated 4 years ago
amazon-science / synthesizrr
View on GitHub
Synthesizing realistic and diverse text-datasets from augmented LLMs
☆19Apr 4, 2026Updated 3 months ago
softmax1 / Flash-Attention-Softmax-N
View on GitHub
CUDA and Triton implementations of Flash Attention with SoftmaxN.
☆75May 26, 2024Updated 2 years ago
majumderb / pabst
View on GitHub
Code for "Unsupervised Enrichment of Persona-grounded Dialog with Background Stories", ACL 2021
☆10Jul 8, 2021Updated 5 years ago
nishadsinghi / sc-genrm-scaling
View on GitHub
[COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…
☆15Oct 31, 2025Updated 8 months ago
dashends / CodeSyntax
View on GitHub
Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"
☆16Oct 24, 2022Updated 3 years ago
newcompute-ai / everart-node-sdk
View on GitHub
☆12Nov 21, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
YuxiXie / SelfEval-Guided-Decoding
View on GitHub
☆103Dec 7, 2023Updated 2 years ago
AI-ANK / Airbnb-Listing-Explorer
View on GitHub
☆29Apr 29, 2024Updated 2 years ago
wyu-du / Controlled-Dialogue-Generation
View on GitHub
This repository contains the data and code for the paper "SideControl: Controlled Open-domain Dialogue Generation via Additive Side Netwo…
☆12Dec 1, 2021Updated 4 years ago
IBM / ModuleFormer
View on GitHub
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…
☆225Sep 18, 2025Updated 10 months ago
jquesnelle / yarn
View on GitHub
YaRN: Efficient Context Window Extension of Large Language Models
☆1,740Apr 17, 2024Updated 2 years ago
LINs-lab / ELICIT
View on GitHub
[ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability
☆14Mar 11, 2025Updated last year
huggingface / gaia
View on GitHub
Hugging Face and Pyserini interoperability
☆20May 18, 2023Updated 3 years ago