Ongoing research project for code&math LLMs
☆31Jul 4, 2025Updated 11 months ago
Alternatives and similar repositories for swallow-code-math
Users that are interested in swallow-code-math are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data and Code for Paper "Reflect Not Reflex: Inference-Based Common Ground Improves Dialogue Response Quality" (EMNLP 2022)☆11Nov 28, 2022Updated 3 years ago
- A powerful text cleaner for Japanese web texts☆12Jan 20, 2024Updated 2 years ago
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution☆104Sep 24, 2025Updated 9 months ago
- Training and evaluation scripts for JGLUE, a Japanese language understanding benchmark☆18Jun 21, 2026Updated last week
- A Japanese dependency parser based on BERT☆23Oct 26, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 11 months ago
- ☆48Sep 15, 2025Updated 9 months ago
- ☆102Feb 11, 2026Updated 4 months ago
- ☆14Jun 24, 2024Updated 2 years ago
- Prediction Intervals: Split Normal Mixture from Quality-Driven Deep Ensembles. Published at Uncertainty in AI (UAI) 2020.☆11Aug 31, 2020Updated 5 years ago
- Implementation of various generative models☆14Oct 1, 2018Updated 7 years ago
- Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".☆23Oct 29, 2025Updated 8 months ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆13Dec 12, 2024Updated last year
- ☆62Jun 13, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆233Oct 27, 2025Updated 8 months ago
- ☆11Mar 12, 2019Updated 7 years ago
- Bayes-Adaptive RL for LLM Reasoning☆45May 28, 2025Updated last year
- ☆13Sep 12, 2024Updated last year
- ☆16Nov 26, 2024Updated last year
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆23Jun 15, 2025Updated last year
- ☆33Jul 31, 2024Updated last year
- ☆25Dec 13, 2024Updated last year
- ☆106Feb 26, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- HIVE: Evaluating the Human Interpretability of Visual Explanations (ECCV 2022)☆22Jan 19, 2023Updated 3 years ago
- DeepSeek-V3.2-Exp DSA Warmup Lightning Indexer training operator based on tilelang☆47Nov 19, 2025Updated 7 months ago
- ☆23Aug 1, 2024Updated last year
- ☆33May 9, 2025Updated last year
- ☆34Oct 13, 2025Updated 8 months ago
- Pytorch implementation of "Diversified in-domain synthesis with efficient fine-tuning for few-shot classification"☆17Mar 25, 2024Updated 2 years ago
- This repository hosts the source code for the paper "ROCODE: Integrating Backtracking Mechanism and Program Analysis in Large Language Mo…☆16Dec 16, 2025Updated 6 months ago
- A comprehensive and efficient long-context model evaluation framework☆31Feb 25, 2026Updated 4 months ago
- ☆34Nov 24, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- NeurIPS-2023: Data Pruning via Moving-one-Sample-out☆10May 21, 2026Updated last month
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆68May 31, 2024Updated 2 years ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆81May 2, 2025Updated last year
- MATLAB implementation of variable elimination in bayesian networks☆10Oct 5, 2018Updated 7 years ago
- Debiasing Through Data Attribution☆13May 23, 2024Updated 2 years ago
- A collection of Black Box Variational Inference algorithms implemented in an object-oriented Python framework using Autograd.☆11Sep 7, 2018Updated 7 years ago
- ACL24☆11Jun 7, 2024Updated 2 years ago