SwordElucidator/nanoBackpackLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SwordElucidator/nanoBackpackLM)

SwordElucidator / nanoBackpackLM

The simplest repository for training medium-sized BackpackLM for cs224n

☆25

Alternatives and similar repositories for nanoBackpackLM

Users that are interested in nanoBackpackLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

john-hewitt / backpacks-flash-attn
View on GitHub
The original Backpack Language Model implementation, a fork of FlashAttention
☆71May 29, 2023Updated 3 years ago
john-hewitt / truncation-sampling
View on GitHub
Codebase describing experiments in Truncation Sampling as Language Model Desmoothing
☆13Dec 6, 2022Updated 3 years ago
KempnerInstitute / llm_uncertainty
View on GitHub
Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"
☆11Jul 18, 2026Updated last week
Tiiiger / templm
View on GitHub
Code release for "TempLM: Distilling Language Models into Template-Based Generators"
☆14Jul 21, 2022Updated 4 years ago
atticusg / MultiplyQuantifiedData
View on GitHub
☆10Nov 1, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
feyzaakyurek / bbnli
View on GitHub
Bias Benchmark for Natural Language Inference. Code repo for the Findings of NAACL 2022 paper "On Measuring Social Biases in Prompt-Based…
☆15Apr 28, 2022Updated 4 years ago
allenai / dream
View on GitHub
☆23Sep 2, 2024Updated last year
csinva / mdl-complexity
View on GitHub
MDL Complexity computations and experiments from the paper "Revisiting complexity and the bias-variance tradeoff".
☆18Jun 12, 2023Updated 3 years ago
allenai / modularqa
View on GitHub
Code for ModularQA
☆27Jun 8, 2021Updated 5 years ago
azpoliak / hypothesis-only-NLI
View on GitHub
Code and data corresponding to "Hypothesis Only Baselines in Natural Language Inference" (StarSem 2018)
☆25Dec 8, 2022Updated 3 years ago
limanling / pathlm_schema
View on GitHub
Code for EMNLP 2020 paper `Connecting the Dots: Event Graph Schema Induction with Path Language Modeling`
☆23Nov 16, 2020Updated 5 years ago
forkonlp / newspaper
View on GitHub
대부분의 신문사 뉴스를 수집하는 것을 목적으로 하는 크롤러 제작 프로젝트
☆11Jul 29, 2019Updated 6 years ago
AIRI-Institute / Probing_framework
View on GitHub
Framework for probing tasks
☆31Mar 24, 2024Updated 2 years ago
ameyagodbole / Prob-CBR
View on GitHub
☆26Oct 18, 2021Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
limanling / temporal-graph-schema
View on GitHub
Data and code for the paper "Future is not One-dimensional: Complex Event Schema Induction via Graph Modeling".
☆30Apr 24, 2021Updated 5 years ago
nika2312 / qa_explaination
View on GitHub
☆13Jul 8, 2020Updated 6 years ago
StonyBrookNLP / tellmewhy
View on GitHub
Website for release of TellMeWhy dataset for why question answering
☆14Nov 11, 2022Updated 3 years ago
kracr / sparql-cypher-transpiler
View on GitHub
A SPARQL to Cypher query transpiler that converts a given SPARQL query to an equivalent Cypher query.
☆14Aug 25, 2021Updated 4 years ago
ufal / korektor
View on GitHub
Statistical spell- and (occasional) grammar-checker.
☆19Updated this week
FarimaFatahi / CompactIE
View on GitHub
☆11Apr 24, 2023Updated 3 years ago
iwiwi / epochraft-hf-fsdp
View on GitHub
Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP
☆11Jan 29, 2024Updated 2 years ago
wjko2 / Linguistically-Informed-Specificity-and-Semantic-Plausibility-for-Dialogue-Generation
View on GitHub
☆10Jun 11, 2019Updated 7 years ago
declare-lab / WikiDes
View on GitHub
A Wikipedia-based summarization dataset
☆14Mar 27, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SmartDataAnalytics / Wikipedia_TF_IDF_Dataset
View on GitHub
Pre-computed IDF stats over all EN Wiki articles
☆13Jan 30, 2020Updated 6 years ago
zzzace2000 / robust_cls_model
View on GitHub
The code to reproduce CVPR 2021 paper "Towards Robust Classification Model by Counterfactual and Invariant Data Generation"
☆16Jul 29, 2021Updated 4 years ago
lateral / poincare-embeddings
View on GitHub
A multi-threaded C++ implementation of Nickel & Kiela's "Poincare Embeddings" paper from NIPS 2017, following the implementation of the a…
☆18Jun 6, 2018Updated 8 years ago
callummcdougall / sae-exercises-mats
View on GitHub
☆26Dec 20, 2023Updated 2 years ago
VictorProkhorov / Text2Path
View on GitHub
[NAACL(2019)] Generating Knowledge Graph Paths from Textual Definitions using Sequence-to-Sequence Models
☆11Apr 27, 2022Updated 4 years ago
kliegr / word_similarity_relatedness_datasets
View on GitHub
☆12Jul 19, 2018Updated 8 years ago
ritikamangla / QSalience
View on GitHub
https://arxiv.org/abs/2404.10917
☆14Mar 18, 2025Updated last year
kyutai-labs / moshi-webrtc
View on GitHub
Proof of concept for running moshi/hibiki using webrtc
☆21Feb 28, 2025Updated last year
ari-holtzman / newformer
View on GitHub
☆16Jul 20, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
yakazimir / semantic_fragments
View on GitHub
Code and data for experiments on semantic fragments
☆11Jun 23, 2022Updated 4 years ago
CogComp / Event_Process_Typing
View on GitHub
This is the repository for the resources in CoNLL 2020 Paper "What Are You Trying Todo? Semantic Typing of Event Processes"
☆10Jan 5, 2021Updated 5 years ago
avijit-thawani / SWOW-eval
View on GitHub
Intrinsic Evaluation of pre-trained word embeddings, using large Word Association Dataset: SWOW (Small World of Words)
☆11Feb 28, 2024Updated 2 years ago
sheepla / websh-prompt
View on GitHub
💻 A command line websh client with bash-like interactive UI
☆25Jul 14, 2024Updated 2 years ago
ZihanWangKi / GoalEx
View on GitHub
Implementation of the Paper "Goal-Driven Explainable Clustering via Language Descriptions"
☆41May 24, 2023Updated 3 years ago
wangzq870305 / event_chain
View on GitHub
☆28Nov 18, 2017Updated 8 years ago
allenai / comet-public
View on GitHub
A Public repository for the COMeT model
☆13Jul 25, 2024Updated 2 years ago