☆95Dec 19, 2024Updated last year
Alternatives and similar repositories for curai-research
Users that are interested in curai-research are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Using self-play to augment multi-turn text-to-SQL datasets☆11Oct 20, 2022Updated 3 years ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Jun 29, 2023Updated 2 years ago
- ☆19Oct 13, 2022Updated 3 years ago
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆208May 24, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆175Feb 4, 2023Updated 3 years ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Nov 11, 2024Updated last year
- Code for our EMNLP '22 paper "Fixing Model Bugs with Natural Language Patches"☆19Dec 7, 2022Updated 3 years ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆72Mar 26, 2023Updated 3 years ago
- Accepted by Transactions on Machine Learning Research (TMLR)☆135Oct 5, 2024Updated last year
- ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration ca…☆1,548Aug 11, 2025Updated 10 months ago
- Language models scale reliably with over-training and on downstream tasks☆101Apr 2, 2024Updated 2 years ago
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆25Nov 14, 2023Updated 2 years ago
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆251Dec 15, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆317Feb 14, 2025Updated last year
- This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Da…☆497Mar 26, 2024Updated 2 years ago
- PB-LLM: Partially Binarized Large Language Models☆157Nov 20, 2023Updated 2 years ago
- The code of COMMA: Modeling Relationship among Motivations, Emotions and Actions in Language-based Human Activities. https://aclanthology…☆12Oct 12, 2022Updated 3 years ago
- A hands-on tutorial on how to use Active Learning with Transformer models.☆15Oct 3, 2021Updated 4 years ago
- ☆81Mar 24, 2025Updated last year
- ☆13Jan 27, 2019Updated 7 years ago
- ☆12Mar 21, 2024Updated 2 years ago
- ☆157Mar 18, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [IJCAI2022] Type-aware Embeddings for Multi-Hop Reasoning over Knowledge Graphs☆29Aug 9, 2022Updated 3 years ago
- Differentiable FFT Conv Layer with Dense Color Channels☆11Apr 8, 2022Updated 4 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆139Mar 14, 2024Updated 2 years ago
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Jul 25, 2023Updated 2 years ago
- This repository is for a research project at Cairo University, computer engineering department.☆14Jan 14, 2022Updated 4 years ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆123Aug 16, 2023Updated 2 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆74Mar 2, 2024Updated 2 years ago
- ☆44Jun 2, 2026Updated 2 weeks ago
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"☆457Sep 6, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆81Mar 17, 2022Updated 4 years ago
- ☆69Jul 19, 2022Updated 3 years ago
- Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"☆326Dec 28, 2023Updated 2 years ago
- Causal machine learning pipeline using tlverse/sl3☆13Apr 17, 2026Updated 2 months ago
- Emotion-Aware Dialogue Response Generation by Multi-Task Learning☆13Jan 22, 2022Updated 4 years ago
- [ICML 2026, Spotlight] SleepLM: Natural-Language Intelligence for Human Sleep☆38Mar 10, 2026Updated 3 months ago
- The code of paper Affective Decoding for Empathetic Response Generation☆11Oct 12, 2021Updated 4 years ago