Pre-training code for CrystalCoder 7B LLM
☆59May 10, 2024Updated last year
Alternatives and similar repositories for crystalcoder-train
Users that are interested in crystalcoder-train are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data preparation code for CrystalCoder 7B LLM☆45May 10, 2024Updated last year
- Pre-training code for Amber 7B LLM☆174May 10, 2024Updated last year
- Data preparation code for Amber 7B LLM☆94May 10, 2024Updated last year
- Open Implementations of LLM Analyses☆109Oct 8, 2024Updated last year
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Apr 12, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆15Oct 30, 2021Updated 4 years ago
- Official repository for "Reweighting Strategy based on Synthetic Data Identification for Sentence Similarity (COLING2022)"☆18Sep 4, 2022Updated 3 years ago
- ☆39Aug 27, 2024Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- An extensible framework for building visualization and annotation tools to enable better interaction with NLP and Artificial Intelligence…☆51Feb 4, 2023Updated 3 years ago
- ☆19Nov 10, 2024Updated last year
- [NeurIPS 2024 poster] Cross-model Control: Improving Multiple Large Language Models in One-time Training☆14Oct 25, 2024Updated last year
- ☆15May 27, 2019Updated 6 years ago
- ☆10Apr 15, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- UM1 test programs and sample code☆10Jul 25, 2022Updated 3 years ago
- A library for semantic similarity search☆26Jan 31, 2025Updated last year
- [ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents☆28Jul 31, 2025Updated 9 months ago
- ☆15Oct 2, 2024Updated last year
- Text-2-SQL☆19Feb 21, 2025Updated last year
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 3 years ago
- Kanban board made with TailwindCSS☆11Jun 10, 2021Updated 4 years ago
- DialogueCSE: Dialogue-based Contrastive Learning of Sentence Embeddings☆19Nov 24, 2021Updated 4 years ago
- Say hi to anyone, for humans and agents. An Inbox Zero product☆19Jul 8, 2025Updated 9 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆20Dec 14, 2024Updated last year
- The official data and code for EMNLP 2023 main conference paper: CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular D…☆13May 19, 2025Updated 11 months ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated last month
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆320Sep 29, 2023Updated 2 years ago
- OOPSLA 2019 Artifact for AutoPandas. Website at https://rbavishi.github.io/autopandas☆31Nov 21, 2022Updated 3 years ago
- Hyperparameter tuning via uncertainty modeling☆51May 3, 2024Updated last year
- Testing DeepSpeed integration in 🤗 Accelerate☆11Jun 28, 2022Updated 3 years ago
- Schema2QA Question Answering Dataset☆19Aug 22, 2022Updated 3 years ago
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆24Dec 15, 2023Updated 2 years ago
- An MCP server for Raindrop.io (bookmarking service)☆20Apr 10, 2025Updated last year
- ☆14Mar 1, 2023Updated 3 years ago
- Pytorch implementation of Deep Convolutional Generative Adversarial Networks (DCGAN) for humanface datasets, which can genarate some bea…☆18Apr 24, 2018Updated 8 years ago
- A Dataset of 600k Java Source Code Changes Categorized by Diff Size http://arxiv.org/pdf/2108.04631☆22Mar 22, 2024Updated 2 years ago
- A copy of the DirectX Headers from MinGW-64.☆14Sep 7, 2023Updated 2 years ago
- ☆25Apr 8, 2022Updated 4 years ago