Pre-training code for CrystalCoder 7B LLM
☆57May 10, 2024Updated last year
Alternatives and similar repositories for crystalcoder-train
Users that are interested in crystalcoder-train are comparing it to the libraries listed below
Sorting:
- Data preparation code for CrystalCoder 7B LLM☆43May 10, 2024Updated last year
- Pre-training code for Amber 7B LLM☆172May 10, 2024Updated last year
- Data preparation code for Amber 7B LLM☆93May 10, 2024Updated last year
- Open Implementations of LLM Analyses☆107Oct 8, 2024Updated last year
- A curated list of my GitHub stars☆15Mar 14, 2025Updated last year
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Apr 12, 2024Updated last year
- ☆15Oct 30, 2021Updated 4 years ago
- Official repository for "Reweighting Strategy based on Synthetic Data Identification for Sentence Similarity (COLING2022)"☆18Sep 4, 2022Updated 3 years ago
- ☆56Jul 7, 2025Updated 8 months ago
- A very limited implementation of arXiv:1904.00759☆13Dec 2, 2019Updated 6 years ago
- ☆39Aug 27, 2024Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- ☆13Jan 14, 2025Updated last year
- This repository provides installation scripts and configuration files for deploying the CSGHub instance, includes Helm charts and Docker…☆19Updated this week
- ☆19Dec 6, 2024Updated last year
- A collection of CLI LLM tools that I built and use daily☆15Aug 7, 2024Updated last year
- ☆12Jul 25, 2023Updated 2 years ago
- INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness☆14Nov 10, 2025Updated 4 months ago
- A Chainlit App Used to Showcase: Async, Caching, Additional Chainlit Methods, and more!☆11Oct 1, 2024Updated last year
- UM1 test programs and sample code☆11Jul 25, 2022Updated 3 years ago
- A library for semantic similarity search☆26Jan 31, 2025Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 5 months ago
- Text-2-SQL☆19Feb 21, 2025Updated last year
- Kanban board made with TailwindCSS☆11Jun 10, 2021Updated 4 years ago
- DialogueCSE: Dialogue-based Contrastive Learning of Sentence Embeddings☆19Nov 24, 2021Updated 4 years ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 3 years ago
- The official data and code for EMNLP 2023 main conference paper: CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular D…☆13May 19, 2025Updated 10 months ago
- ☆20Dec 14, 2024Updated last year
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆319Sep 29, 2023Updated 2 years ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated 2 weeks ago
- [IJCAI 2023] Black-box Prompt Tuning for Vision-Language Model as a Service☆18Sep 18, 2023Updated 2 years ago
- Hyperparameter tuning via uncertainty modeling☆49May 3, 2024Updated last year
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 3 months ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- Testing DeepSpeed integration in 🤗 Accelerate☆11Jun 28, 2022Updated 3 years ago
- ☆11May 18, 2025Updated 10 months ago
- 🔮✍🏻 Automatically organize, analyze, and augment the quality of your obsidian.md notes with AI.☆17Aug 28, 2024Updated last year
- awesome-LLM-controlled-constrained-generation☆55Aug 16, 2024Updated last year
- Build a level 1 coding agent.☆17Jan 28, 2025Updated last year