CarperAI / CodeReviewSE
Stuff related to scraping the Code Review StackExchange
☆11Updated 2 years ago
Alternatives and similar repositories for CodeReviewSE:
Users that are interested in CodeReviewSE are comparing it to the libraries listed below
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆32Updated 10 months ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆13Updated last year
- A library for squeakily cleaning and filtering language datasets.☆46Updated last year
- One stop shop for all things carp☆59Updated 2 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- Ludwig benchmark☆20Updated 3 years ago
- Embedding Recycling for Language models☆38Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Automatically check mismatch between code and comments using AI and ML☆53Updated 3 years ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- A sample pattern for running CI tests on Modal☆16Updated 6 months ago
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- DEPRECATED--all functionality moved to nbdev☆15Updated 2 years ago
- A Streamlit app to add structured tags to a dataset card☆22Updated 2 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆15Updated 3 years ago
- EMNLP Findings 2020: Reevaluating Adversarial Examples in Natural Language☆7Updated 4 years ago
- ☆22Updated last year
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Updated 3 years ago
- ☆28Updated last year
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.☆11Updated 4 years ago
- Hugging Face and Pyserini interoperability☆20Updated last year
- Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"☆28Updated 2 years ago
- PyTorch implementation for MRL☆18Updated last year
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Updated 2 years ago
- ChatBot App built using LangChain and Lightning AI☆18Updated 2 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- ☆24Updated last year
- ☆44Updated 4 months ago
- Official code release for the paper Coder Reviewer Reranking for Code Generation.☆42Updated 2 years ago