Ongoing research project for code&math LLMs
☆31Jul 4, 2025Updated 11 months ago
Alternatives and similar repositories for swallow-code-math
Users that are interested in swallow-code-math are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for☆28Dec 16, 2024Updated last year
- Code for the preprint "Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?"☆48Jul 29, 2025Updated 10 months ago
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution☆104Sep 24, 2025Updated 8 months ago
- Training and evaluation scripts for JGLUE, a Japanese language understanding benchmark☆18Updated this week
- A Japanese dependency parser based on BERT☆23Oct 26, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 10 months ago
- ☆47Sep 15, 2025Updated 8 months ago
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆24Sep 17, 2025Updated 8 months ago
- ☆14Jun 24, 2024Updated last year
- Prediction Intervals: Split Normal Mixture from Quality-Driven Deep Ensembles. Published at Uncertainty in AI (UAI) 2020.☆11Aug 31, 2020Updated 5 years ago
- Implementation of various generative models☆14Oct 1, 2018Updated 7 years ago
- Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".☆22Oct 29, 2025Updated 7 months ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆13Dec 12, 2024Updated last year
- ☆62Jun 13, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆17Apr 7, 2025Updated last year
- ☆229Oct 27, 2025Updated 7 months ago
- ☆11Mar 12, 2019Updated 7 years ago
- Yet another Python binding for Juman++/KNP/KWJA☆39Jun 4, 2026Updated last week
- Bayes-Adaptive RL for LLM Reasoning☆45May 28, 2025Updated last year
- ☆13Sep 12, 2024Updated last year
- ☆16Nov 26, 2024Updated last year
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆23Jun 15, 2025Updated 11 months ago
- ☆33Jul 31, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination.☆22Jul 18, 2025Updated 10 months ago
- ☆25Dec 13, 2024Updated last year
- ☆108Feb 26, 2025Updated last year
- HIVE: Evaluating the Human Interpretability of Visual Explanations (ECCV 2022)☆22Jan 19, 2023Updated 3 years ago
- DeepSeek-V3.2-Exp DSA Warmup Lightning Indexer training operator based on tilelang☆44Nov 19, 2025Updated 6 months ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated 2 months ago
- OpenCV Sample Projects in Rust☆12Nov 27, 2021Updated 4 years ago
- ☆33May 9, 2025Updated last year
- ☆34Oct 13, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Pytorch implementation of "Diversified in-domain synthesis with efficient fine-tuning for few-shot classification"☆17Mar 25, 2024Updated 2 years ago
- ☆18Mar 2, 2026Updated 3 months ago
- ☆10May 21, 2026Updated 3 weeks ago
- A comprehensive and efficient long-context model evaluation framework☆31Feb 25, 2026Updated 3 months ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆81May 2, 2025Updated last year
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆28Oct 14, 2025Updated 7 months ago
- ☆10Sep 29, 2024Updated last year