LLM Actions
An LLM Action enables you to use LLMs to perform reasoning, generate text and feed that information back into your scenario in order to influence the next steps of the conversation.
Using LLM Actions you have more fine-grained control over when and how to involve an LLM in your conversation since actions can be performed after a user intent has been interpreted (by an OpenDialog interpreter) or before a bot message was sent to the user. This enables you to influence the context of the conversation as best required by your scenario.
This section will help you to understand what an LLM action is, and how to configure and use it within your conversation.
The basics
LLM actions provide conversation designers with the ability to use LLMs in several different ways, leading to a more flexible and fluent conversational application, complete with safeguarding and moderation tools to provide better protected, dynamic content generation and reasoning.
As a conversation designer, LLM actions provide a suite of tools and benefits:
Build conversations easier, quicker, and more efficiently
Easily configure how the LLM should respond
Analyse incoming user utterances without the need for additional language service or interpretation
Generate response text without the need for multiple messages or conditions
Provide more personalised and dynamic 'welcome' and 'no match' messages
Use contextual data gathered within a conversation to provide more bespoke respones
Provide support, guidance and recommendations such as 'tell me about a subject'
Keep customers on track and repair the conversation when it diverges
Moderate LLM behaviour with safeguarding tools to deliver a safer conversation
In action
Create tailored responses
The LLM can better understand how best to respond by providing direction within a system prompt and reduce the need to include additional language services and interpreters in your conversation design.
Moderated and safer responses
Apply safeguarding features to mitigate risk, and contribute to a safer, more pleasant conversation experience.
Test and iterate
Amend and configure the behaviour of your LLM action using the test panel, ensuring it's ready for use within your scenario.
Where to find
LLM actions are an additional integration to the conversation designer and can be found under 'Integrate' within the sidebar menu.
To access LLM actions:
Go to your workspace overview (Scenarios)
Select the scenario where you would like to create an LLM action
Use the left-hand menu and expand the 'Integrate' section using the down-ward arrow
Select 'LLM actions'
Click 'Create an LLM action'
How to use
Structure
LLM Actions have two main components:
Settings - This is where to configure your LLM, give it a name and description, choose your engine, configure your prompts, advanced settings and choose any safeguarding settings you require.
Test panel - Here you can test your LLM action using the 'Input' and 'Output' areas to view what has been sent to the LLM, and what the LLM returns as a response.
Settings
Within the settings area, you can configure your engine, enter your prompt, apply any advanced settings and enter any safeguarding considerations.
Name and description
When creating your LLM action, it's important to give it a unique name and description that helps you identify it should you have more than one.
Engine configuration
You can choose from the following LLM engines when you create an LLM action:
Custom configuration
Each requires a slightly different configuration.
When using OpenAI
• Selecting 'OpenDialog managed' means you will use a service directly managed by OpenDialog, and will not need to provide any credentials. You will be able to select a model from a pre-defined dropdown list.
• Unselecting a managed service means you will need to provide your OpenAI API Key, OpenAI Organisation and specify a model name.
When using Azure OpenAI
• OpenDialog does not provide a managed service, so you will need to provide your Azure OpenAI API Key, Azure OpenAI Resource Name and Azure OpenAI Deployment Name.
When using a custom configuration
If you choose the 'Custom' engine, you will need to manually configure your action using the JSON input field
Prompt configuration
The prompt configuration area is where you can provide instructions for the LLM on how best to assist the user with a response.
Within prompt configuration you will see two main areas:
System prompt and temperature - perfect for novice users
Advanced settings - suitable for power users
System prompt
Use the system prompt to specify how the LLM should respond. When creating a prompt, always ensure it is clear, specific, and concise. Provide context, avoid ambiguity, and define the desired output format.
Let's look at an example:
"You are a helpful car insurance assistant. The user will provide questions or statements related to car insurance claims, and you will categorise their input. Output whether the user is asking to make a claim (make_claim), asking about the status of a claim (claim_status) or an FAQ about a claim (claim_faq). If none of the above, return an empty string."
It is also possible to include attributes within your system, for example:
"You are a helpful car insurance assistant. The user will provide questions or statements related to car insurance claims, and you will categorise their input. The user's vehicle is a {vehicle_type}. Output whether the user is asking to make a claim (make_claim), asking about the status of a claim (claim_status) or an FAQ about a claim (claim_faq). If none of the above, return an empty string."
You can add or create an attribute using the curly brace {
symbol. Read more about entering attributes. Attributes added within the system prompt are automatically added to the test panel.
Attributes require a value. When using the testing panel, where possible, OpenDialog will add a default value to any attributes you create. In cases where this is not possible, ensure you add a value to any attributes within the test panel. This will not occur when using the action in a deployed scenario, and all attribute values will need to be provided via user input, actions, interpreters or the embed snippet.
Using Attributes in prompts
When you refer to an attribute in a prompt, you are telling OpenDialog to substitute the value of that attribute into the prompt at the point it is submitted to the LLM. Due to this potential variation in the content of the attribute, it is not possible to know the full token usage of a prompt until the attribute values have been substituted. To determine the total token usage (and consequently the cost) of a specific attribute value, please see the "Inspect" section of the testing panel.
Using RAG service strings
To use your RAG service from within a system prompt you should use the special syntax %% [service].[topic] %%
to reference the data you want to use. For example if you have a knowledge service called 'countries_ks' and you want to reference the 'capital_cities' topic data you would use %% countries_ks.capital_cities %%
.
Optionally you can also add a query to only return the topic data that is useful to the user's question. To do this, the syntax is
%% [service].[topic] ? [query] %%
An example RAG service string:
"You are a chatbot answering user questions. You may only find your answer from within the following data: %% countries_ks.capital_cities ? France %%:
Using output attributes
When you check the ‘Use custom output attributes’ checkbox, and enter your own additional attributes for the LLM to populate, you may incur additional cost. This is due to additional processing that is required by the LLM in order to place the correct content in each attribute, meaning your token usage may increase.
Read more on 'Using custom output attributes'
Temperature
Temperature is a parameter that influences the language model’s output. Setting a lower temperature will yield more deterministic results, however, it might also result in less creative or diverse text, and feel slightly robotic. A low temperature also does not guarantee that the LLM action will respond with the same values each time it is used.
Setting a higher temperature will provide less probable words, leading to more varied and sometimes more creative outputs. However, this can also result in more errors or nonsensical responses. When creating an LLM action, use the testing panel to sense-check your settings and find the right temperature.
Advanced prompt configuration
This area allows you to configure specific aspects of your LLM’s behaviour. You can find these within the advanced expandable section under 'Prompt configuration'. OpenDialog automatically provides a default setting for most options, however you can adjust them to your preference, and see immediate cause and effect using the testing panel.
User prompt
The user prompt field allows you to provide a custom user prompt to the LLM. By default this field will contain {utterance_text}
which means that the LLM will receive whatever text the user enters into Webchat. A custom user prompt allows you to constrain what is received by the LLM. You may choose to use attributes to provide a bespoke input, such as "I work {distance}km away from my house". These attributes can be retrieved from form messages, button messages, interpreters or other actions. You can also choose to remove the user prompt completely. This means that no user input is sent to the LLM (besides any attributes in the system prompt). This can be useful if you want to generate text based solely on pre-defined instructions in your system prompt.
When a user prompt is specified in the advanced section, the user utterance within the test panel is replaced with the user prompt you entered.
Use custom output attributes
By defining output attributes, the LLM will be prompted to return data for each attribute. This allows you to retrieve granular data alongside standard response text. Each output attribute can also be coupled with an optional instruction. This instruction specifies how the LLM should attempt to populate the attribute. Any populated output attributes will be displayed in the testing panel after running a test of the action.
Output attributes are incredibly useful, and can be used to affect the conversational flow via conditions.
With these output attributes created and their values defined, we can now configure messages based on the circumstances of the users application. If they can be insured, we can redirect to {can_insure_string}. If we need more information, we can redirect to the {follow_up_question} to gather more information.
Frequency penalty
A positive frequency penalty value discourages the recurrence of words in the generated text by applying penalties to tokens based on their frequency of occurrence. The higher the frequency of a token in the text, the lower the likelihood that the LLM will use it again.
Max tokens
Sets the limit on how many tokens it can use in one go. If you exceed this limit, the model won't be able to process the sentence properly.
Presence penalty
A positive presence penalty imposes a penalty on a token if it is present at all, distinct from the frequency penalty, where the penalty remains consistent for all instances of repetition.
Top P
Nucleus sampling (a sampling method with temperature) lets you regulate the model's determinism. Lower values favour precise, factual responses, while higher values encourage diversity.
Seed
In an LLM, the 'seed' is a number that increases the likelihood of a deterministic response when generating text, but it is no guaranteed.
Stop
Using keywords or phrases, you can list specific tokens in the stop parameter to make the LLM stop generating text when it reaches any of them. This is useful for stopping the generation of text after a specific word or punctuation.
Safeguarding
LLMs can sometimes be unpredictable without careful consideration. We highly recommend using the test panel to gain a better perspective on the kind of responses your LLM might return.
At OpenDialog, the safety of both our customers and your users is paramount; therefore, our LLM actions provide safeguarding tools to help meet both the criteria of your business and the expectations of your customers.
Failure response
If the model returns this text exactly, then the action will fail. If you want to use this feature, you must prompt the model to return it. For example, if your failure response value is unknown_input, then you would need to update your prompt to include 'Otherwise if the user's input isn't valid, respond with unknown_input'.
Content moderation
This setting allow you to further control the behaviour and content of your LLM action by moderating offensive/inappropriate content. This is enabled by default. If content is moderated the action will fail and some output attributes contain granular detail of the moderation will be provided.
We strongly recommend that this settings be used with care as disabling content moderation could have undesired affects.
User utterance/response exclusion list
If terms are provided in the user utterance exclusion list, when an utterance including any of the terms is sent to the LLM (such as 'Alcohol'), the LLM action will fail.
Similarly to the utterance exclusion list, action failure will occur if the LLM's response contains any words or phrases you have specified in the response exclusion list.
Please note that the utterances and words added to the exclusion lists are considered as exact matches and will not be interpreted.
Test panel
When creating your LLM action, the test panel will allow you to try the action, and see both the Inputs and the Outputs within a test. It's a great indicator of what you can expect from your action and provides a safe space to change and iterate the behaviour of your action. It's also worth noting that testing response text and attributes are not guaranteed to be the same when used in your scenario due to the inherent non-determinism of LLMs.
→ Inputs
This area shows the information sent to the LLM:
Attributes and values (gathered from the system or user prompt)
User utterance (if a custom user prompt is not set in the advanced section)
Attributes and values
When an attribute is entered in the system prompt, it will automatically appear in the test panel and where possible, be given a default value by OpenDialog. This is shown with an 'i' (information) alert tooltip within the field.
When a default value is set by OpenDialog, the field will appear yellow with an 'i' tooltip that states 'Example value is in use'
If no default value can be set by OpenDialog, the field will appear red with an 'i' tooltip that states 'This attribute has no value'
Setting attribute values
By clicking an attribute in the test panel you can easily set its value within the popup value menu.
Any attributes that have had a manual value entered will no longer appear with any form of 'information' alert tooltip.
Outputs →
This area shows the information received from the LLM based on the input data.
Success/Failure
Indicates whether the LLM worked successfully, or in the case of failure, will indicate what the cause of failure was with some guidance on how best to make changes to your LLM action.
Attributes
Displays any attributes output by the LLM within the test. You may also see additional output attributes that you did not specify during configuration, for example, in the case of success, error or failure.
Read more about output attributes
Response text
Shows the response text from the LLM.
Inspect
An expandable JSON tool that allows you to expand and inspect the raw data of the LLM action exchange. You can also see both the compiled prompts which show the system and user prompts with their attributes replaced.
Tokens
Prompts are sent to an LLM as 'Tokens'.
Tokens are the basic building blocks of text that Large Language Models (LLMs) use to process and generate language. A token can be a whole word, part of a word, or a single character. The more complex the prompt sent to the LLM, the more tokens will be used, and therefore the higher the cost.
The cost of using an LLM depends on the number of tokens involved in each query. This includes both the tokens in the text you submit to the LLM (input tokens) and the tokens in the text the LLM generates within its response (output tokens). More tokens mean a higher cost for each query.
To see how many tokens a request could cost, use the inspect area within the test panel to review the JSON token data.
While OpenDialog provides token data when using LLM actions, the exact number of tokens your provider may bill you for cannot be guaranteed.
Token calculation varies based on the LLM you use and the specific model selected within that LLM.
Frequently asked questions
Do I need my own LLM to create an action?
No. When creating a LLM action, use OpenAI and toggle 'OpenDialog managed'. You will be able to use one of the OpenDialog LLM configurations (this option only appears within OpenAI).
For Azure OpenAI or a custom configuration, you will need your own LLM credentials to use it.
Do I have to configure the advanced settings section?
No, all settings in this area are optional, however, we would recommend using the test panel to better understand the cause and effect of the settings you choose.
My LLM action is failing when I run a test. What do I do?
There will be an indication of what is causing the failure within the test panel, with some instructions on what could be causing it.
Can I re-use an LLM action elsewhere in my conversation?
Yes. You can add the same LLM action anywhere within your conversation by adding it to the intent. When this intent is matched during a conversation, the action will be run.
How do I add my LLM action to an intent in the conversation designer?
• Navigate to the correct intent within the designer.
• Within the sidebar, click 'Add conditions, actions & attributes'.
• To add your action, click '+ Add new action' and select your action from the drop-down.
• Within a following text message, add the attribute {llm_response}
Last updated