Code for "Small Models are Valuable Plug-ins for Large Language Models"
β131May 16, 2023Updated 2 years ago
Alternatives and similar repositories for SuperICL
Users that are interested in SuperICL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π©Ί A collection of ChatGPT evaluation reports on various bechmarks.β50Mar 28, 2023Updated 3 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Modelsβ15Mar 8, 2023Updated 3 years ago
- β16Jul 20, 2023Updated 2 years ago
- Self-Alignment with Principle-Following Reward Modelsβ170Sep 18, 2025Updated 6 months ago
- Momentum Decoding: Open-ended Text Generation as Graph Explorationβ19Jan 27, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A local search system implementation using Elasticsearch for Wikipedia data indexing and retrieval.β14May 17, 2025Updated 11 months ago
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedbackβ208May 24, 2023Updated 2 years ago
- Use the tokenizer in parallel to achieve superior accelerationβ20Mar 21, 2024Updated 2 years ago
- Papers and Related work to help learn ICL conveniently for everyone who interests.β14Feb 28, 2024Updated 2 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"β10Dec 13, 2024Updated last year
- Implementation of AAAI23 paper: HG-SL: Jointly Learning of Global and Local User Spreading Behavior for Fake News Early Detectionβ18Sep 11, 2023Updated 2 years ago
- β284Jan 6, 2025Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"β17Feb 22, 2024Updated 2 years ago
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)β15Jul 9, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- β18Mar 10, 2023Updated 3 years ago
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractorsβ40Dec 14, 2025Updated 4 months ago
- β13Mar 5, 2024Updated 2 years ago
- Baseline code for NAACL 2021 paper "Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge"β21Jul 6, 2021Updated 4 years ago
- πΌ Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Expertsβ41Sep 29, 2024Updated last year
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span β¦β14Aug 25, 2023Updated 2 years ago
- The code of CIKM 2023 (Oral Presentation) : A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NEβ¦β14Jul 19, 2024Updated last year
- β74Apr 2, 2024Updated 2 years ago
- Reading comprehension based question-answering model for news articles.β11Jun 22, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- β102Dec 22, 2023Updated 2 years ago
- self-adaptive in-context learningβ45May 5, 2023Updated 2 years ago
- Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Expertsβ16Feb 26, 2024Updated 2 years ago
- ACL 2022(findings): A Sentence is Worth 128 Pseudo Tokens: A Semantic-Aware Contrastive Learning Framework for Sentence Embeddingsβ18Mar 23, 2022Updated 4 years ago
- β30Apr 6, 2026Updated last week
- β21May 22, 2023Updated 2 years ago
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Modelsβ67Dec 10, 2024Updated last year
- AbstainQA, ACL 2024β29Feb 4, 2026Updated 2 months ago
- the instructions and demonstrations for building a formal logical reasoning capable GLMβ54Sep 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- https://arxiv.org/abs/2209.15162β53Jan 24, 2023Updated 3 years ago
- Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Modelsβ46Mar 13, 2022Updated 4 years ago
- Code for ACL2023 paper: Pre-Training to Learn in Contextβ106Jul 26, 2024Updated last year
- Crawl & visualize ICLR papers and reviews.β18Nov 5, 2022Updated 3 years ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.β88Mar 7, 2025Updated last year
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"β58Feb 29, 2024Updated 2 years ago
- β13Feb 7, 2023Updated 3 years ago