KPIs
Overview
Key Performance Indicators (KPIs) in Pay-i allow you to define and track custom business metrics associated with your GenAI use cases. Pay-i already provides many standard metrics for use cases out of the box (such as cost, usage, failure rates, and performance metrics), but KPIs let you add your own business-specific measurements. As you’re iterating on your use cases, you want to understand not just how its performing from a cost, latency, and failure perspective, but also whether it is meeting the goals of your business.
There are two steps to working with KPIs in Pay-i, defining them, and then providing the data (called "scores"). Once defined, Pay-i will correlate them with the versions of your Use Case so that you can see how implementation changes (such as new models) are impacting your business.
Further, Pay-i lets you define Value Policies that use KPI data to measure quantifiable ROI.
When to Use KPIs
KPIs are valuable when you need to track metrics beyond what Pay-i automatically provides, such as:
- Success/Failure Rates for Specific Outcomes: Track metrics like Deflection Rate in chatbots to measure how often the AI successfully handles queries without human intervention
- User Satisfaction Ratings from Feedback Systems: Customer Satisfaction scores, such as a Likert scale (1-5), to measure user sentiment about AI interactions
- Business outcomes like conversion rates or revenue generated: Track metrics like Clickthrough Rate, Close Rate, and Conversion Value of AI outreach to directly measure business impact
Defining KPIs
Pay-i makes defining KPIs very straightforward. There are four values for each KPI definition. Not all fields can be edited once the KPI is created, but you can define any number of KPIs for your use cases and remove them at any time.
| # | Value | Description | Editable | Unique per Use Case | Required |
|---|---|---|---|---|---|
| 1. | Name | This is used to identify the KPI both in code and in the Pay-i UI. | ❌ | ✅ | ✅ |
| 2. | Description | An optional text description used for display purposes only. | ✅ | ❌ | ❌ |
| 3. | Type | Pay-i supports several types of KPIs, described below. | ❌ | ❌ | ✅ |
| 4. | Goal | The KPI target score, defined differently for each KPI Type. | ✅ | ❌ | ✅ |
KPI Types
Pay-i supports several KPI types to accommodate different measurement approaches. If there is a KPI Type that you feel is missing, please contact [email protected].
| KPI Type | Description | Example Use |
|---|---|---|
boolean | Binary measurements (true/false) | Pass/fail assessments, success indicators |
number | Numeric measurements with no specific scale | Revenue generated |
percentage | Measurements on a 0-100% scale | Accuracy rates, completion rates |
likert5 | Measurements on a 5-point Likert scale | User satisfaction ratings (1-5) |
likert7 | Measurements on a 7-point Likert scale | More nuanced satisfaction scales (1-7) |
likert10 | Measurements on a 10-point Likert scale | Detailed rating systems (1-10) |
Defining Goals
Apart from being able to track whether your Use Cases are meeting their targets, goals for KPIs are used when creating Value Policies. For each KPI Type, goals are defined slightly differently.
| KPI Type | Goal Description | Legal Goal Scores |
|---|---|---|
boolean | The percentage of “True” responses to aim for. | 0-100 |
number | The average number to aim for. | Any number |
percentage | The average percentage to aim for. | 0-100 |
likert5 | The average Likert response to aim for. | 1-5 |
likert7 | The average Likert response to aim for. | 1-7 |
likert10 | The average Likert response to aim for. | 1-10 |
Providing KPI Scores
Once defined, the next step is to provide Pay-i with the KPI data. Details for how this is done technically are described in the development section, but conceptually this is very straightforward.
KPIs are always defined for a Use Case. Every time that use case is used (e.g., every time an agent is executed), an Instance is created with a unique identifier. From there, a single API call lets you set the KPI scores for that Instance.
KPI Score Example
Let's take a chatbot as example use case. After each chat, the user is given a thumbs up/thumbs down button to rate their experience as a whole.
To model this, the chatbot use case has a single KPI defined in Pay-i. For this example, we will name the KPI User Satisfaction. This KPI's type is Boolean, to model the thumbs up as True and thumbs down as False.
Each chat with this chatbot is an Instance of the chatbot use case, and has a unique ID which can be provided either by your service or auto-generated by Pay-i. Let's imagine that the ID for our example chat is chat1 and the user gave it a thumbs up.
Setting the score for the User Satisfaction KPI for a given chat is as simple as calling Pay-i and providing the Instance ID, chat1 in this example, the name of the KPI User Satisfaction and the score True. You can call this Pay-i API any number of times, setting and resetting the scores as needed for any number of KPIs on the chatbot use case.
Pay-i knows which version of your use case the Instance belongs to, and it knows which KPIs have been defined for that use case. From there, Pay-i handles the rest, aggregating across all Instances of that use case and integrating the data with Value Policies to calculate ROI.
Real-World KPI Examples
Here are example KPIs based on real world production implementations.
| KPI Name | KPI Type | Typical Goal | Description | Common Application |
|---|---|---|---|---|
| Clickthrough Rate | Boolean | 1.00% True | Tracks when users click on AI-generated content | Content generation, ads |
| Close Rate | Boolean | 1.00% True | Measures conversion after engaging with AI content | E-commerce, sales |
| Deflection Rate | Boolean | 25.00% True | Tracks when AI resolves issues without human intervention | Chat bots, customer support |
| Conversion Value | number | $10.00 | Measures monetary value of purchases resulting from AI interactions | Ad generation, product recommendations |
| Customer Satisfaction | likert5 | 3.50 / 5.00 | Captures user sentiment about AI interaction quality | Chat bots, content generation |
Updated about 1 month ago
