jina-ai / textbookLinks

distill chatGPT coding ability into small model (1b)

☆30

Alternatives and similar repositories for textbook

Users that are interested in textbook are comparing it to the libraries listed below

Sorting:

xingyaoww / LeTI
Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."
☆64Updated 2 years ago
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆71Updated last year
reasoning-machines / prompt-lib
A set of utilities for running few-shot prompting experiments on large-language models
☆122Updated last year
VITA-Group / ChainCoder
[ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …
☆40Updated last year
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆40Updated 8 months ago
r-three / phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆86Updated last year
nyu-mll / ILF-for-code-generation
☆78Updated 4 months ago
WHGTyen / BIG-Bench-Mistake
A dataset of LLM-generated chain-of-thought steps annotated with mistake location.
☆81Updated 11 months ago
Zyphra / Zyda_processing
☆37Updated last year
casmlab / NPHardEval
Repository for NPHardEval, a quantified-dynamic benchmark of LLMs
☆57Updated last year
LLM360 / crystalcoder-train
Pre-training code for CrystalCoder 7B LLM
☆55Updated last year
austrian-code-wizard / c3po
☆29Updated last week
qrdlgit / graph-of-thoughts
Based on the tree of thoughts paper
☆48Updated last year
para-lost / ReBase
ReBase: Training Task Experts through Retrieval Based Distillation
☆29Updated 6 months ago
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
ZeroSumEval / ZeroSumEval
A framework for pitting LLMs against each other in an evolving library of games ⚔
☆32Updated 3 months ago
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆105Updated 7 months ago
LLM360 / Analysis360
Open Implementations of LLM Analyses
☆105Updated 10 months ago
Anni-Zou / Meta-CoT
Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
☆97Updated last year
gangiswag / cornstack
☆35Updated last month
LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆91Updated last year
Edward-Sun / RECITE
Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI
☆94Updated 2 years ago
GAIR-NLP / Entropy-ABF
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
☆78Updated last year
tanyuqian / cappy
NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
☆43Updated last year
kernelmachine / cbtm
Code repository for the c-BTM paper
☆107Updated last year
jakespringer / echo-embeddings
☆152Updated last year
CarperAI / autocrit
A repository for transformer critique learning and generation
☆90Updated last year
dinobby / MAGDi
The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…
☆36Updated last year
IBM / ModuleFormer
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…
☆223Updated last year
alonj / Same-Task-More-Tokens
The code for the paper: "Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models"
☆54Updated last year