AIAIFindr.app
Ll

Llama

by Meta

8.3
Quality score / 10
Visit LlamaFree tierPay per token

Overview

Llama is Meta's family of open-weight large language models that became the de-facto foundation for the open-source AI ecosystem. Available in sizes from a few billion to hundreds of billions of parameters, Llama models can be fine-tuned and self-hosted with a permissive licence. They power countless products via providers like Groq, Together and Fireworks.

Speed
Fast
Context window
128K tokens
Free tier
Yes
Updated
2026-06-01

Key features

  • Open weights with commercial licence
  • Multiple model sizes
  • Fine-tuning friendly
  • Huge community and tooling
  • Available on many inference providers
  • Multimodal variants

Pricing

Pay per token
Free and open weights
Free tierPay per token
API input
$0.2 / 1M tokens
API output
$0.2 / 1M tokens
查看 Llama 完整 API 價格 → llmprice.app

Pros

  • +Fully self-hostable and customizable
  • +Massive ecosystem and tooling
  • +Very low inference cost

Cons

  • Requires infrastructure to self-host
  • Raw models need tuning for production

Best for

Llama alternatives

See all alternatives →
Mi

Mistral

Mistral AI

8.5

Mistral AI is a European lab offering both open-weight and commercial models that punch well above their size. Le Chat is its consumer assistant, while models like Mistral Large and the Mixtral mixture-of-experts series power developers via API. Mistral is popular for on-premise, privacy-sensitive and cost-conscious deployments thanks to its permissively licensed open models.

Chatbots & LLMsCoding
Free tierPay per tokenView details →
Qw

Qwen

Alibaba Cloud

8.6

Qwen is Alibaba's series of open-weight models spanning chat, coding, vision and math. The lineup is among the strongest open releases, with excellent multilingual ability and competitive benchmark scores. Qwen models are widely used across Asia and increasingly worldwide for self-hosted and API-based deployments.

Chatbots & LLMsCoding
Free tierPay per tokenView details →
De

DeepSeek

DeepSeek

8.9

DeepSeek is a Chinese AI lab that stunned the industry with frontier-level reasoning models at a fraction of typical costs. Its R-series reasoning models and V-series chat models are open-weight and extremely cheap via API. DeepSeek is the go-to choice for developers who need strong math, coding and reasoning performance on a tight budget.

Chatbots & LLMsCoding
Free tierPay per tokenView details →
Ge

Gemini

Google DeepMind

9.2

Gemini is Google's natively multimodal model family, deeply integrated across Search, Workspace, Android and the Pixel line. Its standout feature is an enormous context window of up to one to two million tokens, ideal for analysing long videos, codebases and document sets. Gemini blends Google's real-time knowledge with strong reasoning and is available free in many products.

Chatbots & LLMsCodingWriting
Free tier$20/moView details →

Ready to try Llama?

Start with the free tier and upgrade as you grow.

Visit Llama