☆52Jun 21, 2025Updated 8 months ago
Alternatives and similar repositories for cornstack
Users that are interested in cornstack are comparing it to the libraries listed below
Sorting:
- CodeSage: Code Representation Learning At Scale (ICLR 2024)☆117Oct 27, 2024Updated last year
- MAIR: A Massive Benchmark for Evaluating Instructed Retrieval. Evaluate your retrieval models on 126 diverse tasks. [EMNLP 2024]☆24Nov 3, 2024Updated last year
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆16Jan 16, 2024Updated 2 years ago
- music visualization via umap of stable audio latents☆53Nov 29, 2025Updated 3 months ago
- The backup repository for FairytaleQA dataset and paper "Fantastic Questions and Where to Find Them: FairytaleQA – An Authentic Dataset f…☆10May 30, 2023Updated 2 years ago
- Implementation for Decision-focused Summarization (EMNLP2021)☆12Mar 14, 2022Updated 4 years ago
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"☆20Mar 31, 2025Updated 11 months ago
- Training and Benchmarking LLMs for Code Preference.☆38Nov 15, 2024Updated last year
- Code release for "TempLM: Distilling Language Models into Template-Based Generators"☆14Jul 21, 2022Updated 3 years ago
- Data and code for the paper "Future is not One-dimensional: Complex Event Schema Induction via Graph Modeling".☆29Apr 24, 2021Updated 4 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆52Jan 6, 2026Updated 2 months ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Mar 27, 2021Updated 4 years ago
- ☆13Jun 6, 2022Updated 3 years ago
- ☆21Sep 6, 2021Updated 4 years ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆53Jul 3, 2024Updated last year
- ☆14May 28, 2019Updated 6 years ago
- Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"☆18May 15, 2025Updated 10 months ago
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- DImensionality REduction in JAX☆26Nov 21, 2025Updated 4 months ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆259Apr 1, 2025Updated 11 months ago
- Code for the paper "A Structural Model for Contextual Code Changes"☆32Oct 25, 2023Updated 2 years ago
- Zero-Shot Cross-Lingual Semantic Parsing (Sherborne & Lapata, ACL 2022)☆17May 16, 2022Updated 3 years ago
- BlockRank makes LLMs efficient and scalable for RAG and in-context ranking☆41Dec 12, 2025Updated 3 months ago
- Official Repo of "CIBench: Evaluation of LLMs as Code Interpreter "☆14Jul 19, 2024Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Oct 19, 2024Updated last year
- Multi-language code navigation API in a container☆103Aug 8, 2025Updated 7 months ago
- Artifact repository for the paper "Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code", In P…☆51Apr 12, 2025Updated 11 months ago
- Replication package for EMNLP2022 paper- RACE: Retrieval-Augmented Commit Message Generation☆20Oct 21, 2022Updated 3 years ago
- Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"☆10Mar 8, 2024Updated 2 years ago
- This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificati…☆16Jun 3, 2024Updated last year
- NNGen, a simple baseline for commit message generation from diffs.☆15Nov 25, 2022Updated 3 years ago
- ☆14Mar 5, 2024Updated 2 years ago
- ☆24Jan 30, 2025Updated last year
- density-based clustering for exploratory data analysis based on multi-parameter persistence☆42Jul 20, 2025Updated 8 months ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆63Aug 6, 2025Updated 7 months ago
- The repository for the paper "Predicting in-hospital mortality by combining clinical notes with time-series data"☆12May 23, 2021Updated 4 years ago
- Fast, permanent and flexible patterns for sharing and computing on texts with metadata using Apache Arrow.☆15Mar 1, 2022Updated 4 years ago