Claude 3.5 Sonnet vs GPT-4 32K (Which is Better in 2024?)

Compare

Comparative Analysis: Claude 3.5 Sonnet vs. GPT-4 32K

Interested in trying out these models in one collaborative workspace?Try out Flamingo.ai for free

Overview

GPT-4 32K was released 1 year before Claude 3.5 Sonnet.

	Claude 3.5 Sonnet	GPT-4 32K
Model Provider The organization behind this AI's development	Anthropic	OpenAI
Input Context Window Maximum input tokens this model can process at once	200.0K tokens	32.8K tokens
Output Token Limit Maximum output tokens this model can generate at once	4096 tokens	Not specified tokens
Release Date When this model first became publicly available	June 20, 2024 4 months ago June 20th, 2024	March 14, 2023 1 year ago March 14th, 2023

Pricing

Claude 3.5 Sonnet is roughly 0.05x less expensive compared to GPT-4 32K for input tokens and roughly 0.1x less expensive for output tokens.

	Claude 3.5 Sonnet	GPT-4 32K
Input Token Cost Cost per million tokens fed into the model	$3.00 per million tokens	$60.00 per million tokens
Output Token Cost Cost per million tokens generated by the model	$15.00 per million tokens	$120.00 per million tokens

Benchmarks

Compare relevant benchmarks between Claude 3.5 Sonnet and GPT-4 32K.

	Claude 3.5 Sonnet	GPT-4 32K
MMLU Measures model's ability to answer questions across various domains	90.4 (5-shot CoT) Source	Benchmark not available.
MMMU Evaluates model's performance across diverse tasks and data types	68.3 (0-shot CoT) Source	Benchmark not available.
HellaSwag Assesses the model's ability to understand everyday scenarios	Benchmark not available.	Benchmark not available.

Claude 3.5 Sonnet, developed by Anthropic, features a context window (the maximum amount of text the model can consider at once) of 200.0K tokens (individual units of text or subwords). The model costs $3.00 per million tokens for input (text fed into the model) and $15.00 per million tokens for output (text generated by the model). It was made publicly available on June 20th, 2024. It has achieved impressive scores in benchmarks (standardized tests for AI models) like MMLU (Massive Multitask Language Understanding, a test of general knowledge) with a score of 90.4 in a 5-shot CoT scenario (a specific testing condition).

GPT-4 32K, developed by OpenAI, features a context window (the maximum amount of text the model can consider at once) of 32.8K tokens (individual units of text or subwords). The model costs $60.00 per million tokens for input (text fed into the model) and $120.00 per million tokens for output (text generated by the model). It was made publicly available on March 14th, 2023. OpenAI logo