Alternatives
Best cheaper alternatives to Palmyra Fin
Looking for a cheaper or smarter replacement for Palmyra Fin by Writer? We compared every LLM model with verifiable pricing on real input/output cost, context window, intelligence and modalities. 18 options are meaningfully cheaper.
Category: LLM
Llama 3.1 8B Instant 128k99% CHEAPER
Groq
Same category
$0.05 in · $0.08 out / 1M Tokens 128K ctx 98%
Gemini 1.5 Flash-8B99% CHEAPER
Google
Same category
$0.0375 in · $0.15 out / 1M Tokens 36 1.0M ctx 98%
Ministral 3 3B99% CHEAPER
Mistral AI
Same category
$0.1 in · $0.1 out / 1M Tokens 128K ctx 98%
Qwen3.5 9B98% CHEAPER
Together AI
Same category
$0.1 in · $0.15 out / 1M Tokens 262K ctx 98%
OpenAI GPT OSS 20B98% CHEAPER
Fireworks AI
Same category
$0.07 in · $0.3 out / 1M Tokens 128K ctx 98%
Gemini 1.5 Flash98% CHEAPER
Google
Same category
$0.075 in · $0.3 out / 1M Tokens 48 1.0M ctx 98%
GPT OSS 20B 128k98% CHEAPER
Groq
Same category
$0.075 in · $0.3 out / 1M Tokens 128K ctx 98%
Ministral 3 8B98% CHEAPER
Mistral AI
Same category
$0.15 in · $0.15 out / 1M Tokens 37 128K ctx 98%
Llama 4 Scout 17Bx16E 128k98% CHEAPER
Groq
Same category
$0.11 in · $0.34 out / 1M Tokens 128K ctx 98%
DeepSeek V4 Flash97% CHEAPER
DeepSeek
Same category
$0.14 in · $0.28 out / 1M Tokens 128K ctx 98%
Ministral 3 14B97% CHEAPER
Mistral AI
Same category
$0.2 in · $0.2 out / 1M Tokens 128K ctx 98%
Jamba Mini96% CHEAPER
AI21 Labs
Same category
$0.2 in · $0.4 out / 1M Tokens 256K ctx 98%
GPT OSS 120B 128k96% CHEAPER
Groq
Same category
$0.15 in · $0.6 out / 1M Tokens 128K ctx 98%
OpenAI GPT OSS 120B96% CHEAPER
Fireworks AI
Same category
$0.15 in · $0.6 out / 1M Tokens 128K ctx 98%
Google Gemma 4 31B IT96% CHEAPER
Together AI
Same category
$0.2 in · $0.5 out / 1M Tokens 98%
Mistral Nemo96% CHEAPER
Mistral AI
Same category
$0.3 in · $0.3 out / 1M Tokens 39 128K ctx 98%
Qwen3 32B 131k95% CHEAPER
Groq
Same category
$0.29 in · $0.59 out / 1M Tokens 41K ctx 98%
GPT-5.4 Nano93% CHEAPER
OpenAI
Same category
$0.2 in · $1.25 out / 1M Tokens 400K ctx 98%
Costs are normalised to a blended 1M-token workload (or per-unit price for non-token models) using each model's latest verified pricing. Models with pricing under review are excluded from recommendations.