SUS-Chat: Instruction tuning done right
☆49Jan 16, 2024Updated 2 years ago
Alternatives and similar repositories for SUS-Chat
Users that are interested in SUS-Chat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11May 27, 2023Updated 2 years ago
- CHIP2021医学对话临床发现阴阳性判别任务冠军方案☆17Mar 11, 2022Updated 4 years ago
- ☆32Jul 29, 2024Updated last year
- OrionStar-Yi-34B-Chat 是一款开源中英文Chat模型,由猎户星空基于Yi-34B开源模型、使用15W+高质量语料微调而成。☆265Apr 9, 2024Updated 2 years ago
- ☆23Sep 19, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- LingoWhale-8B: Open Bilingual LLMs | 开源双语预训练大模型☆147Nov 3, 2023Updated 2 years ago
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- 2022 南方科技大学 SUSTech CS315 计算机安全 课程报告 满分题解☆16Dec 27, 2022Updated 3 years ago
- Code and Data for the paper: Multi-level Protein Structure Pre-training with Prompt Learning [ICLR 2023]☆33Aug 5, 2023Updated 2 years ago
- The official repository for TensorFlow 2.0 implementation of MetaTTE.☆10Mar 9, 2022Updated 4 years ago
- ☆22Dec 18, 2024Updated last year
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆11Dec 3, 2024Updated last year
- 不用校园卡开南科大宿舍门hack sustech dormitory door☆26Sep 14, 2023Updated 2 years ago
- ☆10Aug 3, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [Ebook]从零到百万店铺:一个没有计算机学位的普通人的系统设计实战之旅☆27Nov 11, 2025Updated 6 months ago
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- [CVPR2026] BinaryAttention: One-Bit QK-Attention for Vision and Diffusion Transformers☆33Mar 17, 2026Updated 2 months ago
- ☆13Jul 22, 2024Updated last year
- ☆10Mar 24, 2023Updated 3 years ago
- To assess the longtext capabilities more comprehensively, we propose Needle-in-a-Haystack PLUS, which shifts the focus from simple fact r…☆13Mar 4, 2024Updated 2 years ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆47Oct 8, 2025Updated 7 months ago
- LLM for Astronomy[星语4.0]☆317Apr 6, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Nov 15, 2024Updated last year
- a date understanding and reasoning enhanced model☆52Sep 3, 2025Updated 8 months ago
- 电商评论观点挖掘☆45Jan 29, 2021Updated 5 years ago
- ☆16Jul 6, 2023Updated 2 years ago
- The tool is used for building and driving workflows specifically tailored for AI initiatives. It can be used to construct AI agents.☆162Jul 3, 2024Updated last year
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…☆11Jul 28, 2025Updated 9 months ago
- Follow Me: Conversation Planning for Target-driven Recommendation Dialogue Systems☆12Aug 1, 2023Updated 2 years ago
- Creating High-Fidelity Synthetic GPS Trajectory Dataset for Urban Mobility Analysis☆22Mar 12, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆643Apr 9, 2024Updated 2 years ago
- ☆17Jan 21, 2024Updated 2 years ago
- code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning☆20Jul 16, 2024Updated last year
- Code for trajectory mining, including three parts: 1) trajectory preprocessing, 2) OD points clustering for route patterns discovery, and…☆11Mar 13, 2019Updated 7 years ago
- 使用Electron构建的现代花密实现。☆10Oct 12, 2021Updated 4 years ago
- The official implementation of paper "Can Textual Gradient Work in Federated Learning?" accepted at ICLR 2025☆16Mar 10, 2025Updated last year
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆16Apr 24, 2024Updated 2 years ago