Claude 3.5 Sonnet vs GPT-4o (Which is Better in 2024?)

CompareAnthropic logotoOpenAI logo
Comparative Analysis: Claude 3.5 Sonnet vs. GPT-4o

Overview

GPT-4o was released 1 month before Claude 3.5 Sonnet.
Claude 3.5 SonnetClaude 3.5 Sonnet
GPT-4oGPT-4o
Model Provider
The organization behind this AI's development
Anthropic logoAnthropic
OpenAI logoOpenAI
Input Context Window
Maximum input tokens this model can process at once
200.0K
tokens
128.0K
tokens
Output Token Limit
Maximum output tokens this model can generate at once
4096
tokens
2048
tokens
Release Date
When this model first became publicly available
June 20th, 2024
May 13th, 2024

Pricing

Claude 3.5 Sonnet is roughly 0.6x less expensive compared to GPT-4o for input tokens and roughly 1.0x less expensive for output tokens.
Claude 3.5 SonnetClaude 3.5 Sonnet
GPT-4oGPT-4o
Input Token Cost
Cost per million tokens fed into the model
$3.00
per million tokens
$5.00
per million tokens
Output Token Cost
Cost per million tokens generated by the model
$15.00
per million tokens
$15.00
per million tokens

Benchmarks

Compare relevant benchmarks between Claude 3.5 Sonnet and GPT-4o.
Claude 3.5 SonnetClaude 3.5 Sonnet
GPT-4oGPT-4o
MMLU
Measures model's ability to answer questions across various domains
90.4
(5-shot CoT)
88.7
(5-shot)
MMMU
Evaluates model's performance across diverse tasks and data types
68.3
(0-shot CoT)
69.1
HellaSwag
Assesses the model's ability to understand everyday scenarios
Benchmark not available.
Benchmark not available.
Anthropic logoClaude 3.5 Sonnet, developed by Anthropic, features a context window (the maximum amount of text the model can consider at once) of 200.0K tokens (individual units of text or subwords). The model costs $3.00 per million tokens for input (text fed into the model) and $15.00 per million tokens for output (text generated by the model). It was made publicly available on June 20th, 2024. It has achieved impressive scores in benchmarks (standardized tests for AI models) like MMLU (Massive Multitask Language Understanding, a test of general knowledge) with a score of 90.4 in a 5-shot CoT scenario (a specific testing condition).
GPT-4o, developed by OpenAI, features a context window (the maximum amount of text the model can consider at once) of 128.0K tokens (individual units of text or subwords). The model costs $5.00 per million tokens for input (text fed into the model) and $15.00 per million tokens for output (text generated by the model). It was made publicly available on May 13th, 2024. It has achieved impressive scores in benchmarks (standardized tests for AI models) like MMLU (Massive Multitask Language Understanding, a test of general knowledge) with a score of 88.7 in a 5-shot scenario (a specific testing condition).OpenAI logo

Compare more models