Creating a trigger for Data Streams that invokes a container from Serverless Containers
Create a trigger for Data Streams that invokes a container from Serverless Containers when data is sent to a stream.
Note
The trigger for Data Streams receives and sends messages in JSON
Getting started
To create a trigger, you will need:
-
Container the trigger will invoke. If you do not have a container:
-
Optionally, a dead-letter queue for unprocessed messages from the container. If you do not have a queue, create one.
-
Service accounts with the following permissions:
- To invoke a container.
- To read from the stream that will fire the trigger when data is sent to it.
- Optionally, to write to a dead-letter queue.
You can use the same service account or different ones. If you do not have a service account, create one.
-
Stream that will fire the trigger when data is sent to it. If you do not have a stream, create one.
Creating a trigger
Note
The trigger is initiated within five minutes after it is created.
-
In the management console
, select the folder where you want to create a trigger. -
Navigate to Serverless Containers.
-
In the left-hand panel, select
Triggers. -
Click Create trigger.
-
Under Basic settings:
- Enter a name and description for the trigger.
- In the Type field, select
Data Streams. - In the Launched resource field, select
Container.
-
Under Data Streams settings, select a data stream and a service account with read and write permissions for that stream.
-
Under Batch message settings, specify:
- Waiting time, s. The values may range from 1 to 60 seconds. The default value is 1 second.
- Batch size, B. The values may range from 1 B to 64 KB. The default value is 1 B.
The trigger groups messages within the specified wait time period and sends them to the container. The total amount of data transmitted to a container may exceed the specified batch size if the data is transmitted as a single message. In all other cases, the amount of data does not exceed the batch size.
-
Under Container settings, select a container and specify a service account to invoke it under.
-
Optionally, under Repeat request settings:
- In the Interval field, specify the time interval to retry invoking the container if the current attempt fails. The values may range from 10 to 60 seconds. The default value is 10 seconds.
- In the Number of attempts field, specify the number of invocation retries before the trigger moves a message to the dead letter queue. The values may range from 1 to 5. The default value is 1.
-
Optionally, under Dead Letter Queue settings, select a dead-letter queue and a service account with write permissions for that queue.
-
Click Create trigger.
If you do not have the Yandex Cloud CLI yet, install and initialize it.
The folder used by default is the one specified when creating the CLI profile. To change the default folder, use the yc config set folder-id <folder_ID> command. You can also specify a different folder for any command using --folder-name or --folder-id. If you access a resource by its name, the search will be limited to the default folder. If you access a resource by its ID, the search will be global, i.e., through all folders based on access permissions.
To create a trigger that invokes a container, run this command:
yc serverless trigger create yds \
--name <trigger_name> \
--database <database_location> \
--stream <data_stream_name> \
--batch-size <message_batch_size> \
--batch-cutoff <maximum_wait_time> \
--stream-service-account-id <service_account_ID> \
--invoke-container-id <container_ID> \
--invoke-container-service-account-id <service_account_ID> \
--retry-attempts <number_of_retry_attempts> \
--retry-interval <interval_between_retry_attempts> \
--dlq-queue-id <dead-letter_queue_ID> \
--dlq-service-account-id <service_account_ID>
Where:
-
--name: Trigger name. -
--database: Location of the YDB database associated with the stream in Data Streams.To find out where the database is located, run the
yc ydb database listcommand. The database location is specified in theENDPOINTcolumn, in thedatabaseproperty, e.g.,/ru-central1/b1gia87mbah2********/etn7hehf6gh3********. -
--stream: Stream name. -
--batch-size: Message batch size. This is an optional setting. The values may range from 1 B to 64 KB. The default value is 1 B. -
--batch-cutoff: Maximum wait time. This is an optional setting. The values may range from 1 to 60 seconds. The default value is 1 second. The trigger groups messages within thebatch-cutoffperiod and sends them to the container. The total amount of data transmitted to a container may exceedbatch-sizeif the data is transmitted as a single message. In all other cases, the amount of data does not exceedbatch-size. -
--stream-service-account-id: ID of the service account with write and read permissions for the stream.
--invoke-container-id: Container ID.--invoke-container-service-account-id: ID of the service account with permissions to invoke the container.--retry-attempts: Number of invocation retries before the trigger moves a message to the dead-letter queue. This is an optional parameter. The values may range from 1 to 5. The default value is 1.--retry-interval: Time to retry invoking the container if the current attempt fails. This is an optional parameter. The values may range from 10 to 60 seconds. The default value is 10 seconds.--dlq-queue-id: Dead-letter queue ID. This is an optional parameter.--dlq-service-account-id: ID of the service account with write permissions to the dead-letter queue. This is an optional parameter.
Result:
id: a1s5msktijh2********
folder_id: b1gmit33hgh2********
created_at: "2022-10-24T14:07:04.693126923Z"
name: data-streams-trigger
rule:
data_stream:
database: /ru-central1/b1gia87mbah2********/etn7hehh2********
stream: streams-name
service_account_id: ajep8qm0kh2********
batch_settings:
size: "1"
cutoff: 1s
invoke_container:
container_id: bba5jb38o8h2********
service_account_id: aje03adgd2h2********
retry_settings:
retry_attempts: "1"
interval: 10s
dead_letter_queue:
queue-id: yrn:yc:ymq:ru-central1:b1gmit33ngh2********:dlq
service-account-id: aje3lebfemh2********
status: ACTIVE
With Terraform
Terraform is distributed under the Business Source License
For more information about the provider resources, see the relevant documentation on the Terraform
If you do not have Terraform yet, install it and configure the Yandex Cloud provider.
To manage infrastructure using Terraform under a service account or user accounts (a Yandex account, a federated account, or a local user), authenticate using the appropriate method.
To create a trigger for Data Streams:
-
Describe the trigger in the configuration file:
resource "yandex_function_trigger" "my_trigger" { name = "<trigger_name>" container { id = "<container_ID>" service_account_id = "<service_account_ID>" retry_attempts = "<number_of_retry_attempts>" retry_interval = "<time_between_retry_attempts>" } data_streams { stream_name = "<data_stream_name>" database = "<database_location>" service_account_id = "<service_account_ID>" batch_cutoff = "<maximum_wait_time>" batch_size = "<message_batch_size>" } dlq { queue_id = "<dead-letter_queue_ID>" service_account_id = "<service_account_ID>" } }Where:
-
name: Trigger name. Follow these naming requirements:- Length: between 3 and 63 characters.
- It can only contain lowercase Latin letters, numbers, and hyphens.
- It must start with a letter and cannot end with a hyphen.
-
container: Container settings:id: Container ID.service_account_id: ID of the service account with rights to invoke the container.
retry_attempts: Number of invocation retries before the trigger moves a message to the dead letter queue. This is an optional parameter. The values may range from 1 to 5. The default value is 1.retry_intervall: Time to retry invoking the container if the current attempt fails. This is an optional parameter. The values may range from 10 to 60 seconds. The default value is 10 seconds.
-
data_streams: Trigger settings:-
stream_name: Data stream name. -
database: Location of the YDB database associated with the stream in Data Streams.To find out where the DB is located, run the
yc ydb database listcommand. The database location is specified in theENDPOINTcolumn, in thedatabaseproperty, e.g.,/ru-central1/b1gia87mba**********/etn7hehf6g*******. -
service_account_id: ID of the service account with write and read permissions for the stream. -
batch_cutoff: Maximum wait time. The values may range from 1 to 60 seconds. The default value is 1 second. The trigger groups messages within thebatch_cutoffperiod and sends them to the container. The number of messages cannot exceedbatch_size. -
batch_size: Message batch size. This is an optional setting. The values may range from 1 B to 64 KB. The default value is 1 B.
-
dlq: Dead-letter queue settings:queue_id: Dead-letter queue ID.service_account_id: ID of the service account with write permissions for the dead-letter queue.
For more information about
yandex_function_triggerproperties, see this provider guide. -
-
Create the resources:
-
In the terminal, navigate to the configuration file directory.
-
Make sure the configuration is correct using this command:
terraform validateIf the configuration is valid, you will get this message:
Success! The configuration is valid. -
Run this command:
terraform planYou will see a list of resources and their properties. No changes will be made at this step. Terraform will show any errors in the configuration.
-
Apply the configuration changes:
terraform apply -
Type
yesand press Enter to confirm the changes.
Terraform will create all the required resources. You can check the new resources using the management console
or this CLI command:yc serverless trigger list -
To create a trigger for Data Streams, use the create REST API method for the Trigger resource or the TriggerService/Create gRPC API call.
Checking the result
Make sure the trigger is working properly. To do this, view container logs that show information about invocations.