Pre-training code for CrystalCoder 7B LLM
☆58May 10, 2024Updated last year
Alternatives and similar repositories for crystalcoder-train
Users that are interested in crystalcoder-train are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data preparation code for CrystalCoder 7B LLM☆44May 10, 2024Updated last year
- Pre-training code for Amber 7B LLM☆173May 10, 2024Updated last year
- Data preparation code for Amber 7B LLM☆94May 10, 2024Updated last year
- Open Implementations of LLM Analyses☆108Oct 8, 2024Updated last year
- A curated list of my GitHub stars☆15Mar 14, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆15Oct 30, 2021Updated 4 years ago
- Official repository for "Reweighting Strategy based on Synthetic Data Identification for Sentence Similarity (COLING2022)"☆18Sep 4, 2022Updated 3 years ago
- ☆13Jan 14, 2025Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- ☆19Dec 6, 2024Updated last year
- ☆18Nov 10, 2024Updated last year
- A collection of CLI LLM tools that I built and use daily☆15Aug 7, 2024Updated last year
- [NeurIPS 2024 poster] Cross-model Control: Improving Multiple Large Language Models in One-time Training☆14Oct 25, 2024Updated last year
- This repository contains the replication package of our paper "Assessing the Security of GitHub Copilot’s Generated Code - A Targeted Rep…☆10Nov 16, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness☆14Nov 10, 2025Updated 5 months ago
- UM1 test programs and sample code☆11Jul 25, 2022Updated 3 years ago
- ☆13Oct 11, 2024Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 6 months ago
- ☆25May 2, 2025Updated 11 months ago
- A Chainlit App Used to Showcase: Async, Caching, Additional Chainlit Methods, and more!☆11Oct 1, 2024Updated last year
- Text-2-SQL☆19Feb 21, 2025Updated last year
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 3 years ago
- Kanban board made with TailwindCSS☆11Jun 10, 2021Updated 4 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated last month
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆319Sep 29, 2023Updated 2 years ago
- [IJCAI 2023] Black-box Prompt Tuning for Vision-Language Model as a Service☆18Sep 18, 2023Updated 2 years ago
- OOPSLA 2019 Artifact for AutoPandas. Website at https://rbavishi.github.io/autopandas☆31Nov 21, 2022Updated 3 years ago
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 4 months ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- Paper notes for my PhD on Machine Learning (mostly focused on Reinforcement Learning)☆17Jul 22, 2019Updated 6 years ago
- A server code for serving BERT-based models for text classification. It is designed by SerpApi for heavy-load prototyping and production …☆15Apr 17, 2024Updated last year
- 🔮✍🏻 Automatically organize, analyze, and augment the quality of your obsidian.md notes with AI.☆17Aug 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- awesome-LLM-controlled-constrained-generation☆56Aug 16, 2024Updated last year
- Schema2QA Question Answering Dataset☆19Aug 22, 2022Updated 3 years ago
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- vLLM for embedding tasks using Original LLMs (Qwen2, LLaMA)☆29Sep 9, 2024Updated last year
- Explanation Optimization☆13Oct 16, 2020Updated 5 years ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆154May 18, 2024Updated last year
- This is the repo for our work “An Extensible Plug-and-Play Method for Multi-Aspect Controllable Text Generation” (ACL 2023).☆14Jul 23, 2023Updated 2 years ago