AI assistant

Written by

Updated at November 6, 2025

General questions about AI assistants
Tokens and billing
Working with indexes
Working with functions and tools

General questions about AI assistants

How do I write effective instructions for an AI assistant?

When creating an AI assistant, provide a detailed plain-language description of what your assistant should do in the instruction field. For example:

You are a support specialist. Your task is to answer questions on the topic. 
Stick strictly to the context.
If the answer is not in the context, simply state that, without making assumptions.
Keep your responses short but informative.

Does the assistant wait for the model to complete its response before proceeding to the next event?

Yes. A single assistant run triggers a single model call. The model receives your prompt, invokes the required tools, waits for the results, and then generates a response.

Can I cancel a request and interrupt an ongoing model response?

No, you cannot cancel a submitted request. To limit the model's response length, use the maxTokens parameter.

Can I send multiple requests to the same thread from different assistants concurrently?

Yes, there are no restrictions on concurrent use.

How do I prevent the transfer of context from the previous request?

You can specify the number of thread messages to send to the model with each new request using the thread_num_messages parameter. This way you can control how much context to use for the response, including messages from both the user and the model.

You can also create a new thread for each request and delete the old one.

Can the assistant detect images in text files?

We plan to add support for image input.

Does AI Studio have native monitoring and quality assessment tools?

We are actively developing AI Studio and plan to introduce monitoring and quality assessment tools.

Which languages does the AI assistant support?

The main YandexGPT Pro languages are Russian and English. Soon, we will release new large open-source models that will work well with other languages.

What are the differences between the RC and Latest versions?

We are continuously enhancing the response quality and features of our models. Once internal metrics show the new model version is ready, we publish the updates to the RC branch for testing and notify users in our release notes.

Learn more about the model lifecycle here.

Tokens and billing

How do I spend tokens when using the AI assistant?

Just like with generative models, you pay for both request and model response tokens. This also includes context from the knowledge base and chat history. You can use the maxTokens parameter to limit the number of tokens in the model's response.

How can I estimate the number of tokens in text files beforehand?

To estimate the text size in tokens, use a tokenizer. Learn more in Estimating prompt size in tokens.

Do I get charged for the model's response to a request that triggered the ethics filter?

No, you do not get charged for such responses.

Do I get charged for creating search indexes, uploading files through the Files API, or storing files and indexes?

Note

We do not recommend using AI Assistant API in new projects. To create AI agents, use the Responses API.

Creating or storing files, threads, or indexes is free of charge.

Working with indexes

How do I set up a knowledge base for my AI assistant?

To set up a knowledge base for your AI assistant:

Create your documents as Markdown files.
Keep as much information in plain text as possible.
Remove any footnotes and comments to make the content concise and clear.
If your documents contain tables, convert them to Markdown format.
Make sure each text chunk is big enough to include the largest tables from your documents.

Add usage examples to the function descriptions.
Set clear conditions for function calls in the instructions for your AI assistant.
Expand parameter descriptions by specifying typical values.

How do I manage function selection and calling conditions in the assistant?

Implement fallback logic to call a function when it is not explicitly selected but the input contains relevant keywords. Use an external intent analyzer to improve recognition quality.

AI assistant

General questions about AI assistants

How do I write effective instructions for an AI assistant?

Does the assistant wait for the model to complete its response before proceeding to the next event?

Can I cancel a request and interrupt an ongoing model response?

Can I send multiple requests to the same thread from different assistants concurrently?

How do I prevent the transfer of context from the previous request?

Can the assistant detect images in text files?

Does AI Studio have native monitoring and quality assessment tools?

Which languages does the AI assistant support?

What are the differences between the RC and Latest versions?

Tokens and billing

How do I spend tokens when using the AI assistant?

How can I estimate the number of tokens in text files beforehand?

Do I get charged for the model's response to a request that triggered the ethics filter?

Do I get charged for creating search indexes, uploading files through the Files API, or storing files and indexes?

Working with indexes

How do I set up a knowledge base for my AI assistant?

How can I build an assistant that can search documents, use them to generate responses, and provide source links?

How do I add, update, or delete documents from the index?

How do I upload files directly from an object storage without saving them to my computer?

What method should I use to upload files from RAM?

What happens if I delete a file used in the index?

Can I set up my AI assistant to provide no links when the index has no relevant content?

Can I connect multiple indexes to the same assistant?

Can I select a specific file for my AI assistant to search for an answer?

Working with functions and tools

Can the model work with a toolchain?

How can I improve the quality of function calls?

How do I manage function selection and calling conditions in the assistant?

Was the article helpful?

AI assistant

General questions about AI assistantsGeneral questions about AI assistants

How do I write effective instructions for an AI assistant?How do I write effective instructions for an AI assistant?

Does the assistant wait for the model to complete its response before proceeding to the next event?Does the assistant wait for the model to complete its response before proceeding to the next event?

Can I cancel a request and interrupt an ongoing model response?Can I cancel a request and interrupt an ongoing model response?

Can I send multiple requests to the same thread from different assistants concurrently?Can I send multiple requests to the same thread from different assistants concurrently?

How do I prevent the transfer of context from the previous request?How do I prevent the transfer of context from the previous request?

Can the assistant detect images in text files?Can the assistant detect images in text files?

Does AI Studio have native monitoring and quality assessment tools?Does AI Studio have native monitoring and quality assessment tools?

Which languages does the AI assistant support?Which languages does the AI assistant support?

What are the differences between the RC and Latest versions?What are the differences between the RC and Latest versions?

Tokens and billingTokens and billing

How do I spend tokens when using the AI assistant?How do I spend tokens when using the AI assistant?

How can I estimate the number of tokens in text files beforehand?How can I estimate the number of tokens in text files beforehand?

Do I get charged for the model's response to a request that triggered the ethics filter?Do I get charged for the model's response to a request that triggered the ethics filter?

Do I get charged for creating search indexes, uploading files through the Files API, or storing files and indexes?Do I get charged for creating search indexes, uploading files through the Files API, or storing files and indexes?

Working with indexesWorking with indexes

How do I set up a knowledge base for my AI assistant?How do I set up a knowledge base for my AI assistant?

How can I build an assistant that can search documents, use them to generate responses, and provide source links?How can I build an assistant that can search documents, use them to generate responses, and provide source links?

How do I add, update, or delete documents from the index?How do I add, update, or delete documents from the index?

How do I upload files directly from an object storage without saving them to my computer?How do I upload files directly from an object storage without saving them to my computer?

What method should I use to upload files from RAM?What method should I use to upload files from RAM?

What happens if I delete a file used in the index?What happens if I delete a file used in the index?

Can I set up my AI assistant to provide no links when the index has no relevant content?Can I set up my AI assistant to provide no links when the index has no relevant content?

Can I connect multiple indexes to the same assistant?Can I connect multiple indexes to the same assistant?

Can I select a specific file for my AI assistant to search for an answer?Can I select a specific file for my AI assistant to search for an answer?

Working with functions and toolsWorking with functions and tools

Can the model work with a toolchain?Can the model work with a toolchain?

How can I improve the quality of function calls?How can I improve the quality of function calls?

How do I manage function selection and calling conditions in the assistant?How do I manage function selection and calling conditions in the assistant?

Was the article helpful?

General questions about AI assistants

How do I write effective instructions for an AI assistant?

Does the assistant wait for the model to complete its response before proceeding to the next event?

Can I cancel a request and interrupt an ongoing model response?

Can I send multiple requests to the same thread from different assistants concurrently?

How do I prevent the transfer of context from the previous request?

Can the assistant detect images in text files?

Does AI Studio have native monitoring and quality assessment tools?

Which languages does the AI assistant support?

What are the differences between the RC and Latest versions?

Tokens and billing

How do I spend tokens when using the AI assistant?

How can I estimate the number of tokens in text files beforehand?

Do I get charged for the model's response to a request that triggered the ethics filter?

Do I get charged for creating search indexes, uploading files through the Files API, or storing files and indexes?

Working with indexes

How do I set up a knowledge base for my AI assistant?

How can I build an assistant that can search documents, use them to generate responses, and provide source links?

How do I add, update, or delete documents from the index?

How do I upload files directly from an object storage without saving them to my computer?

What method should I use to upload files from RAM?

What happens if I delete a file used in the index?

Can I set up my AI assistant to provide no links when the index has no relevant content?

Can I connect multiple indexes to the same assistant?

Can I select a specific file for my AI assistant to search for an answer?

Working with functions and tools

Can the model work with a toolchain?

How can I improve the quality of function calls?

How do I manage function selection and calling conditions in the assistant?