101 on Portkey's Gateway Configs
You are likely familiar with how to make an API call to GPT4 for chat completions. However, did you know you can set up automatic retries for requests that might fail on OpenAI’s end using Portkey?
The Portkey AI gateway provides several useful features that you can use to enhance your requests. In this cookbook, we will start by making an API call to LLM and explore how Gateway Configs can be utilized to optimize these API calls.
1. API calls to LLMs with Portkey
Consider a typical API call to GPT4 to get chat completions using OpenAI SDK. It takes messages
and model
arguments to get us a response. If you have tried one before, the following code snippet should look familiar. That’s because Portkey Client SDK follows the same signature as OpenAI’s.
Along with Portkey API Key (get one), you might’ve noticed a new parameter while instantiating the portkey
variable — virtualKey
. Portkey securely stores API keys of LLM providers in a vault and substitutes them at runtime in your requests. These unique identifiers to your API keys are called Virtual Keys. For more information, see the docs.
With basics out of our way, let’s jump into applying what we set out to do in the first place with the AI gateway — To automatically retry our request when we hit rate-limits (429 status codes).
2. Apply Gateway Configs
The AI gateway requires instructions to automatically retry requests. This involves providing Gateway Configs, which are essentially JSON objects that orchestrate the AI gateway. In our current scenario, we are targeting GPT4 with requests that have automatic retries on 429 status codes.
We now have our Gateway Configs sorted. But how do we instruct our AI gateway?
You guessed it, on the request headers. The next section will explore two ways to create and reference Gateway Configs.
a. Reference Gateway Configs from the UI
Just as the title says — you create them on the UI and use an ID to have Portkey automatically apply in the request headers to instruct the AI gateway. UI builder features lint suggestions, makes it easy to reference (through config ID), eliminates manual management, and allows you to view version history.
To create Gateway Configs,
Go to portkey.ai and
Click on Configs
Select Create
Choose any name (such as request_retries)
Write the configs in the playground and click Save Config:
See the saved configs in the list along with the ID
:
Try it out now!
The Configs saved will appear as a row item on the Configs page. The ID
is important as it is referenced in our calls through the AI gateway.
Portkey SDK
The Portkey SDK accepts the config parameter that takes the created config ID as it’s argument. To ensure all requests have automatic retries enabled on them, pass the config ID as argument when portkey
is instantiated.
That’s right! One line of code, and all the request from your apps now inherit Gateway Configs and demonstrate automatic retries.
Let’s take a look at the code snippet:
Axios
In the cases, where you are not able to use an SDK, you can pass the same configs as headers with the key x-portkey-config
.
OpenAI SDK
Portkey can be used with OpenAI SDK.
To send a request with using OpenAI SDK client and apply gateway configs to the request pass a baseURL
and necessary headers as follows:
The approach to declare the Gateway Configs in the UI and reference them in the code is recommended since it keeps the Configs atomic and decoupled from the business logic and can be upgraded to add more features. What if you want to enable caching for all your thousands of requests? Just update the Configs from the UI. No commits. No redeploys.
b. Reference Gateway Configs in the Code
Depending on the dynamics of your app, you might want to construct the Gateway Configs at the runtime. All you need to do is to pass the Gateway Configs directly to the config
parameter as an argument.
Portkey SDK
Axios
OpenAI SDK
Those are three ways to use Gateway Configs in your requests.
In the cases where you want to specifically add a config for a specific request instead of all, Portkey allows you to pass config
argument as seperate objects right at the time of chat completions call instead of Portkey({..})
instantiation.
Applying retry super power to your requests is that easy!
Next Steps: Dive into features of AI gateway
Great job on implementing the retry behavior for your LLM calls to OpenAI!
Gateway Configs is a tool that can help you manage fallbacks, request timeouts, load balancing, caching, and more. With Portkey's support for over 100+ LLMs, it is a powerful tool for managing complex use cases that involve multiple target configurations. A Gateway Config that encompasses such complexity may look like:
For complete reference, refer to the Config Object.
It's exciting to see all the AI gateway features available for your requests. Feel free to experiment and make the most of them. Keep up the great work!
Last updated