GPT-4 Turbo

Our latest model

Michael Schade avatar
Written by Michael Schade
Updated over a week ago

What is it?

GPT-4 Turbo is our latest generation model. It’s more capable, has an updated knowledge cutoff of April 2023 and introduces a 128k context window (the equivalent of 300 pages of text in a single prompt). The model is also 3X cheaper for input tokens and 2X cheaper for output tokens compared to the original GPT-4 model. The maximum number of output tokens for this model is 4096.

How can I get access to it?

Anyone with an OpenAI API account and existing GPT-4 access can use this model. The model can be accessed by passing gpt-4-1106-preview as the model name in the API.

What are the rate limits? Can I get an increase?

Rate limits are dependent on your usage tier. You can find which usage tier you are on your Limits settings page. Since this model is a preview, we won’t be accommodating rate limit increases on GPT-4 Turbo at this time. We plan to release the stable production-ready model in the coming weeks.

Did this answer your question?