princeton-nlp / ELIZA-Transformer
[NAACL 2025] Representing Rule-based Chatbots with Transformers
☆19Updated 3 weeks ago
Alternatives and similar repositories for ELIZA-Transformer:
Users that are interested in ELIZA-Transformer are comparing it to the libraries listed below
- ☆15Updated 7 months ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆10Updated 4 months ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Updated 4 months ago
- [ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.☆21Updated 11 months ago
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆36Updated last month
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆28Updated 9 months ago
- ☆20Updated 4 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆36Updated last year
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆24Updated this week
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆15Updated 4 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆33Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated 3 weeks ago
- Codebase for Instruction Following without Instruction Tuning☆33Updated 5 months ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆30Updated 8 months ago
- A repository for research on medium sized language models.☆76Updated 9 months ago
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆23Updated 5 months ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated 3 months ago
- ☆60Updated 3 weeks ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆44Updated 2 months ago
- ☆14Updated last year
- The code and data for the paper JiuZhang3.0☆40Updated 9 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆22Updated 2 weeks ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆26Updated 2 months ago
- Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"☆13Updated 5 months ago