In October 2024, we launched the Vision Fine tuning enabling developers to fine tune GPT-4o model with both images and text. Vision fine tuning costs $25 per million training tokens, $3.75 per million input tokens, and $15 per million output tokens.
Fine-tuning is available for GPT-4o and GPT-4o mini to developers on all paid usage tiers (Usage Tiers 1- 5). You can start fine-tuning these models for free by visiting your fine-tuning dashboard, clicking “create”, and selecting ‘gpt-4o-2024-08-06’ or ‘gpt-4o-mini-2024-07-18’ from the base model drop-down.
GPT-4o fine-tuning training costs $25 per million tokens, and inference is $3.75 per million input tokens and $15 per million output tokens. For GPT-4o mini, training cost is $3 per million tokens, and inference is $0.30 per million input tokens and $1.20 per million output tokens.