[ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models
☆11Sep 19, 2025Updated 6 months ago
Alternatives and similar repositories for STASC
Users that are interested in STASC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2025] Adaptive Retrieval without Self-Knowledge? Bringing Uncertainty Back Home☆17May 17, 2025Updated 10 months ago
- RL Environment and Benchmark pipeline. Code accompanying paper "Neurophysiologically Realistic Environment for Comparing Adaptive Deep Br…☆19Feb 6, 2026Updated last month
- [ACL 2024] TaxoLLaMA: WordNet-based Model for Solving Multiple Lexical Sematic Tasks☆19May 16, 2024Updated last year
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- Documentation at☆14Mar 27, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆27Mar 9, 2026Updated 2 weeks ago
- Beyond Empathy: Integrating Diagnostic and Therapeutic Reasoning with Large Language Models for Mental Health Counseling☆32Jan 24, 2026Updated 2 months ago
- Automated versioning and package publishing tool. Supports semver and calver. Extendible with plugins.☆14Aug 19, 2024Updated last year
- Example repo demonstrating composition patterns for React and Next.js Apps☆11Oct 5, 2022Updated 3 years ago
- 南京大学 大数据综合实验处理☆10May 4, 2018Updated 7 years ago
- Official Implementation of "Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts" at EMNLP 202…☆13Oct 27, 2024Updated last year
- A repository containing the code accompanying the research paper "Neuronal travelling waves explain rotational dynamics in experimental d…☆26Feb 24, 2024Updated 2 years ago
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated last year
- Tobi is a free, open source, multimedia book production authoring tool for creating human narrated talking books synchronized with text a…☆10May 29, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Repository for the paper: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning☆18Feb 21, 2025Updated last year
- Prerequisites for the kubebuilder-workshop☆11Oct 22, 2018Updated 7 years ago
- [ICLR 2025] This repository contains the code to reproduce the results from our paper From Sparse Dependence to Sparse Attention: Unveili…☆12Mar 7, 2025Updated last year
- Quran Tajweed Highligher☆11May 17, 2020Updated 5 years ago
- ☆14Nov 2, 2022Updated 3 years ago
- Compare how fine-tuned AI video models interpret the same prompts☆14Jan 29, 2025Updated last year
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Mar 16, 2026Updated last week
- Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation☆23Sep 24, 2025Updated 6 months ago
- Open source release from our ICLR 2020 paper, CLN2INV: Learning Loop Invariants with Continuous Logic Networks.☆21Jun 8, 2020Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆83Aug 14, 2014Updated 11 years ago
- Quran:plain text and linguistic annotations☆10Oct 10, 2025Updated 5 months ago
- Code associated with the paper **Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees**.☆28Apr 25, 2023Updated 2 years ago
- [XLLM@ACL2025] Official Code for "Less is More: Enhancing Structured Multi-Agent Reasoning via Quality-Guided Distillation"☆23Jul 29, 2025Updated 7 months ago
- A self-hosted version of WaterCrawl, a powerful web crawling and data extraction platform.☆13Jul 27, 2025Updated 7 months ago
- This is an interface that will offline convert anything pdf document you give it into an interview between two people discussing it.☆16Dec 8, 2024Updated last year
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆28Aug 9, 2025Updated 7 months ago
- Personal Infrastructure for Deep Learning based on Pytorch and Tensorflow☆10Jan 10, 2019Updated 7 years ago
- https://github.com/bernakabadayi/ganavatar☆12Oct 8, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆11Apr 23, 2023Updated 2 years ago
- tweets quran verses and translations☆14Mar 2, 2020Updated 6 years ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 3 years ago
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆16Nov 27, 2024Updated last year
- [ACMMM 2025] ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies☆22Jun 20, 2025Updated 9 months ago
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- Code and data for NAACL 2025 paper "IHEval: Evaluating Language Models on Following the Instruction Hierarchy"☆16Feb 25, 2025Updated last year