Alibaba Cloud
Alternatives

Best cheaper alternatives to Qwen2.5 Math 72B Instruct

Looking for a cheaper or smarter replacement for Qwen2.5 Math 72B Instruct by Alibaba Cloud? We compared every LLM model with verifiable pricing on real input/output cost, context window, intelligence and modalities. 18 options are meaningfully cheaper.

Category: LLM Intelligence 58 4K context
Google
Gemini 1.5 Flash-8B92% CHEAPER
Google
Better valueLarger contextMore modalitiesSame category
Meta
Llama 3.2 11B Vision94% CHEAPER
Meta
Better valueLarger contextMore modalitiesSame category
Google
Gemini 1.5 Flash85% CHEAPER
Google
Better valueLarger contextMore modalitiesSame category
OpenAI
o364% CHEAPER
OpenAI
Better valueSmarter (78 vs 58)Larger contextMore modalitiesSame category
Mistral AI
Ministral 3 8B83% CHEAPER
Mistral AI
Better valueLarger contextMore modalitiesSame category
DeepSeek
DeepSeek-V3 (Chat)86% CHEAPER
DeepSeek
Better valueSmarter (63 vs 58)Larger contextSame category
Meta
Llama 3.1 8B94% CHEAPER
Meta
Better valueLarger contextSame category
Alibaba Cloud
Qwen Turbo90% CHEAPER
Alibaba Cloud
Better valueLarger contextSame category
Google
Gemini 2.0 Flash80% CHEAPER
Google
Better valueLarger contextMore modalitiesSame category
DeepSeek
DeepSeek-R1 (Reasoner)72% CHEAPER
DeepSeek
Better valueSmarter (73 vs 58)Larger contextSame category
OpenAI
o4 Mini64% CHEAPER
OpenAI
Better valueSmarter (70 vs 58)Larger contextMore modalitiesSame category
Mistral AI
Mistral Small83% CHEAPER
Mistral AI
Better valueLarger contextSame category
Meta
Llama 3.2 90B Vision73% CHEAPER
Meta
Better valueLarger contextMore modalitiesSame category
OpenAI
o164% CHEAPER
OpenAI
Better valueSmarter (71 vs 58)Larger contextSame category
Meta
Llama 3.1 70B73% CHEAPER
Meta
Better valueLarger contextSame category
Mistral AI
Ministral 3 3B88% CHEAPER
Mistral AI
Larger contextMore modalitiesSame category
OpenAI
GPT-4o64% CHEAPER
OpenAI
Better valueLarger contextMore modalitiesSame category
OpenAI
GPT-4o Mini64% CHEAPER
OpenAI
Better valueLarger contextMore modalitiesSame category

Costs are normalised to a blended 1M-token workload (or per-unit price for non-token models) using each model's latest verified pricing. Models with pricing under review are excluded from recommendations.