GPT-4O Mini

By OpenAI

GPT-4O Mini is a compact and efficient variant of OpenAI's flagship GPT-4 Optimized (GPT-4o) model. It brings omni-modal capabilities (text, audio, vision) to a smaller footprint, optimized for speed, cost, and deployment in resource-sensitive environments. While scaled down, it aims to provide a balance of GPT-4o's advanced features with greater accessibility and efficiency.

Technical Specifications

  • Input Context Window

    128,000 tokens

  • Maximum Output

    16,384 tokens

  • Open Source

    No

  • Release Date

    July 2024

  • Knowledge Cut-off

    October 2023

Availability & Features

  • API Providers

    OpenAI API (details typically on provider website)

  • Modalities

    Text
    Code
    Vision

  • Key Features

    Omni-modal Capabilities (scaled down)
    Compact & Efficient
    Optimized for Speed & Cost
    Balanced performance/size
    Code Generation

Benchmark Performance

BenchmarkScore
Speed t/s
95.7%
GPQA Diamond
80.1%
Code SWE
75.3%

Pricing

  • Input Processing
    $0.15 / M tokensPer million tokens processed
  • Output Generation
    $0.60 / M tokensPer million tokens generated