ari-holtzman / newformerView external linksLinks
☆15Jul 20, 2023Updated 2 years ago
Alternatives and similar repositories for newformer
Users that are interested in newformer are comparing it to the libraries listed below
Sorting:
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 2 years ago
- [EMNLP 2023] Official repository for Dialogue Chain-of-Thought Distillation (DONUT & DOCTOR)☆11Nov 15, 2023Updated 2 years ago
- ☆19Mar 16, 2025Updated 10 months ago
- Code for ModularQA☆28Jun 8, 2021Updated 4 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆34Aug 24, 2024Updated last year
- SILO Language Models code repository☆83Feb 23, 2024Updated last year
- ☆35May 18, 2023Updated 2 years ago
- NeuralWOZ: Learning to Collect Task-Oriented Dialogue via Model-based Simulation (ACL-IJCNLP 2021)☆36Jul 22, 2021Updated 4 years ago
- ☆10Jul 6, 2023Updated 2 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Base Docker image for deploying Kinesis Client Applications in Python☆10Nov 10, 2015Updated 10 years ago
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- Artificial stock market (ASM) with Julia language.☆10Aug 10, 2021Updated 4 years ago
- ☆12Jul 8, 2024Updated last year
- ☆11Jan 27, 2026Updated 2 weeks ago
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆40Aug 14, 2023Updated 2 years ago
- ☆180Feb 23, 2023Updated 2 years ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆45Jan 11, 2024Updated 2 years ago
- VeighNa框架的LevelDB数据库接口☆13Apr 23, 2023Updated 2 years ago
- RPi SD Card Image for IoT LoRa Range☆11Mar 5, 2020Updated 5 years ago
- 个人学习中总结的 Rust 思维导图☆10Feb 2, 2024Updated 2 years ago
- Code for MERMAID : Metaphor Generation with Symbolism and Discriminative Decoding☆11May 2, 2022Updated 3 years ago
- This AI Agent retrieves the latest news articles based on a multi keyword using the Serp API. It processes the results and returns struct…☆11Jan 31, 2025Updated last year
- [ACL2023] Source code for Dialogue Summarization with Static-Dynamic Structure Fusion Graph☆11Dec 17, 2023Updated 2 years ago
- Data Pipeline that utilizes GCP, Python 3.10, Prefect, and more.☆10Jan 23, 2023Updated 3 years ago
- Continue Pretraining T5 on custom dataset based on available pretrained model checkpoints☆38Mar 21, 2021Updated 4 years ago
- AI agents playing Clash Royale autonomously. Claude Code + multi-agent architecture reached 1000+ trophies live on Twitch.☆16Jan 25, 2026Updated 3 weeks ago
- ☆13May 25, 2023Updated 2 years ago
- playing with gpt4☆14Mar 17, 2023Updated 2 years ago
- Digital texts in Prakrit☆10Sep 14, 2025Updated 5 months ago
- Python and Scala APIs for enhanced Spark analytics☆12Mar 15, 2017Updated 8 years ago
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 3 months ago
- Code for the paper "Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages" (N…☆16Apr 13, 2025Updated 10 months ago
- The official Languini Kitchen repository☆14May 6, 2024Updated last year
- This is the repository for the resources in CoNLL 2020 Paper "What Are You Trying Todo? Semantic Typing of Event Processes"☆11Jan 5, 2021Updated 5 years ago
- Minimal MBrace setup to test out the waters, with as few dependencies as possible☆11Mar 18, 2018Updated 7 years ago
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- This Project focuses on processing legal court decisions and is part of on-going research at New York University. This code is in develop…☆10Jul 17, 2019Updated 6 years ago