SumilerGAO/SunGen

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SumilerGAO/SunGen)

SumilerGAO / SunGen

☆28

Alternatives and similar repositories for SunGen

Users that are interested in SunGen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HKUNLP / ProGen
View on GitHub
[EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.
☆27Feb 4, 2023Updated 3 years ago
HKUNLP / subgoal-theorem-prover
View on GitHub
Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"
☆20May 25, 2023Updated 3 years ago
jiacheng-ye / ZeroGen
View on GitHub
[EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.
☆47Feb 18, 2022Updated 4 years ago
HKUNLP / multilingual-transfer
View on GitHub
Code for paper ”Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability“
☆15Jun 13, 2023Updated 3 years ago
pipilurj / ROBOT
View on GitHub
☆27Apr 11, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
xyq7 / Human-Contribution-Measurement
View on GitHub
☆13Jun 4, 2025Updated last year
zhxieml / remiss-jailbreak
View on GitHub
☆33Jun 24, 2024Updated 2 years ago
pipilurj / G-LLaVA
View on GitHub
Official github repo of G-LLaVA
☆154Feb 20, 2025Updated last year
yumeng5 / FewGen
View on GitHub
[ICML 2023] Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning
☆44May 10, 2023Updated 3 years ago
kiaia / GIRAFFE
View on GitHub
Extending context length of visual language models
☆12Dec 18, 2024Updated last year
chang-github-00 / LLM-Predictive-Decoding
View on GitHub
☆16Jul 9, 2025Updated last year
fywalter / label-bias
View on GitHub
A codebase for ACL 2023 paper: Mitigating Label Biases for In-context Learning
☆10Aug 4, 2023Updated 2 years ago
HKUNLP / hkunlp.github.io
View on GitHub
Website for HKU NLP group (under construction)
☆14Jul 6, 2026Updated 2 weeks ago
LZhengisme / self-infilling
View on GitHub
[ICML 2024] Self-Infilling Code Generation
☆18May 5, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yjw1029 / Self-Reminder
View on GitHub
Code for our paper "Defending ChatGPT against Jailbreak Attack via Self-Reminder" in NMI.
☆57Nov 13, 2023Updated 2 years ago
pipilurj / MLLM-protector
View on GitHub
The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"
☆46Apr 21, 2024Updated 2 years ago
Timothyxxx / KVCachePapers
View on GitHub
☆20May 24, 2024Updated 2 years ago
oriyor / turning_tables
View on GitHub
Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…
☆22Nov 2, 2021Updated 4 years ago
xlang-ai / AgentTrek
View on GitHub
[ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials
☆60Feb 21, 2025Updated last year
RyanWangZf / PromptEHR
View on GitHub
EMNLP'22 | PromptEHR: Conditional Electronic Healthcare Records Generation with Prompt Learning
☆31Jun 8, 2023Updated 3 years ago
DreamLM / Dream-VLX
View on GitHub
Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.
☆114Jan 14, 2026Updated 6 months ago
yumeng5 / SuperGen
View on GitHub
[NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding
☆70Sep 18, 2022Updated 3 years ago
HKUNLP / DiffuSearch
View on GitHub
[ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"
☆39Mar 3, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
MangoKiller / SimOAR_OAR
View on GitHub
☆11Nov 8, 2023Updated 2 years ago
Timothyxxx / EnvInteractiveLMPapers
View on GitHub
Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW…
☆128Jul 26, 2023Updated 2 years ago
SEU-WDS / MachineLearningCourses
View on GitHub
暑期机器学习讨论班是由张祥老师组织发起，全体学生参与的讨论交流活动。目的是让学生巩固机器学习基本算法，掌握基本原理和使用。组织形式为学生选题并制作PPT，采用演讲的形式授课给全体参与学生和导师。
☆10Sep 19, 2018Updated 7 years ago
coastalcph / zeroshot_lexglue
View on GitHub
Zero-shot evaluation on LEXGLUE tasks with GTP3.5
☆29Mar 11, 2023Updated 3 years ago
HKUNLP / diffusion-of-thoughts
View on GitHub
[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
☆213Mar 4, 2025Updated last year
alexjfoote / Neuron2Graph
View on GitHub
Tools for exploring Transformer neuron behaviour, including input pruning and diversification.
☆10Jun 6, 2023Updated 3 years ago
taogoddd / GPT-4V-API
View on GitHub
Self-hosted GPT-4V api
☆27Nov 6, 2023Updated 2 years ago
StefanHeng / ProgGen
View on GitHub
Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"
☆17Mar 29, 2024Updated 2 years ago
tianyi-lab / RuleR
View on GitHub
[NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling
☆14Sep 27, 2025Updated 9 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
CogComp / TAWT
View on GitHub
Weighted Training for Cross-Task Learning
☆15Feb 12, 2023Updated 3 years ago
RickySkywalker / TheoremLlama
View on GitHub
This is the official repository for all the code of TheoremLlama
☆48Aug 4, 2025Updated 11 months ago
Stanford-ILIAD / DPP-Batch-Active-Learning
View on GitHub
Companion code to the preprint: E Bıyık, K Wang, N Anari, D Sadigh, "Batch Active Learning using Determinantal Point Processes". arXiv pr…
☆14Jul 25, 2024Updated last year
nlpaueb / multi-eurlex
View on GitHub
MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer
☆40Jun 7, 2022Updated 4 years ago
pipilurj / perceptionGPT
View on GitHub
☆18Aug 7, 2024Updated last year
marvl-challenge / marvl-code
View on GitHub
[EMNLP 2021] Code and data for our paper "Visually Grounded Reasoning across Languages and Cultures"
☆30Dec 30, 2021Updated 4 years ago
xiangyue9607 / Sentence-LDP
View on GitHub
Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"
☆12Feb 20, 2023Updated 3 years ago