huggingface / latex2sympy2_extendedLinks

Parse LaTeX math expressions

☆28

Alternatives and similar repositories for latex2sympy2_extended

Users that are interested in latex2sympy2_extended are comparing it to the libraries listed below

Sorting:

rlite-project / RLite
A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…
☆32Updated last month
facebookresearch / iGSM
The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Proces…
☆55Updated 5 months ago
nishiwen1214 / Benchmark-leakage-detection
Official completion of “Training on the Benchmark Is Not All You Need”.
☆34Updated 5 months ago
JerryWu-code / TinyZero
Deepseek R1 zero tiny version own reproduce on two A100s.
☆67Updated 4 months ago
yyht / openrlhf_async_pipline
☆53Updated last week
GuanghaoYe / Emergence-of-Thinking
☆52Updated 4 months ago
open-compass / MathBench
[ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset
☆101Updated last month
DAMO-NLP-SG / CLEX
[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models
☆78Updated last year
thu-ml / Noise-Contrastive-Alignment
Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)
☆54Updated 7 months ago
GAIR-NLP / MAYE
Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme
☆131Updated 2 months ago
holarissun / RewardModelingBeyondBradleyTerry
official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…
☆62Updated 2 months ago
THUDM / ChatGLM-Math
☆82Updated last year
InternLM / OREAL
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
☆186Updated 3 months ago
PRIME-RL / Entropy-Mechanism-of-RL
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
☆191Updated last week
sail-sg / scaling-with-vocab
[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623
☆85Updated 9 months ago
OpenBMB / OlympiadBench
[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…
☆155Updated 2 weeks ago
GAIR-NLP / LIMR
☆203Updated 4 months ago
Wangmerlyn / MCTS-GSM8k-Demo
This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems
☆84Updated 3 months ago
jiahe7ay / infini-mini-transformer
This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…
☆57Updated last year
Vance0124 / Token-level-Direct-Preference-Optimization
Reference implementation for Token-level Direct Preference Optimization(TDPO)
☆141Updated 4 months ago
AlphaPav / mem-kk-logic
On Memorization of Large Language Models in Logical Reasoning
☆67Updated 2 months ago
ZHZisZZ / weak-to-strong-search
[NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
☆61Updated 6 months ago
tongyx361 / Awesome-LLM4Math
Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…
☆131Updated 11 months ago
Essential-AI / reflection
☆42Updated 2 months ago
GAIR-NLP / ReAlign
Reformatted Alignment
☆113Updated 9 months ago
hkust-nlp / llm-compression-intelligence
Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]
☆138Updated 9 months ago
modelscope / Trinity-RFT
Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…
☆127Updated this week
cofe-ai / Mu-scaling
Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales
☆32Updated last year
KbsdJames / Omni-MATH
The official repository of the Omni-MATH benchmark.
☆84Updated 6 months ago
princeton-pli / MeCo
Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"
☆39Updated last month