Creating a schema registry to deliver data in Debezium CDC format from Apache Kafka®
You can create a schema registry to track schema evolution for an Apache Kafka® or Yandex Managed Service for Apache Kafka® source using Yandex Data Transfer. This tutorial tracks the evolution of schemas for a Managed Service for Apache Kafka® source.
To track schema changes for a Managed Service for Apache Kafka® source:
If you no longer need the resources you created, delete them.
Getting started
Sign up for Yandex Cloud and create a billing account:
- Navigate to the management console
and log in to Yandex Cloud or create a new account. - On the Yandex Cloud Billing
page, make sure you have a billing account linked and it has theACTIVEorTRIAL_ACTIVEstatus. If you do not have a billing account, create one and link a cloud to it.
If you have an active billing account, you can create or select a folder for your infrastructure on the cloud page
Learn more about clouds and folders here.
Required paid resources
- Managed Service for Apache Kafka® cluster, which includes the use of computing resources allocated to hosts, storage and backup size (see Managed Service for Apache Kafka® pricing).
- Each transfer: use of computing resources and the number of transferred data rows (see Data Transfer pricing).
Set up your infrastructure
Create a Managed Service for Apache Kafka® cluster of any suitable configuration. If you intend to connect to broker hosts from the internet, enable public access. In this case, you can only connect to them using SSL.
Create a schema registry
- In the management console
, select the folder you are going to use to work with your schema registry. - Navigate to Yandex MetaData Hub.
- In the left-hand panel, select
Schema Registry / Namespace. - Click Create namespace.
- Specify namespace settings. After saving a namespace, you can use its ID to get the Schema Registry URL:
{namespace_id}.schema-registry.yandexcloud.net:443.
Prepare and run your transfer
-
Open the management console
. -
Navigate to Yandex Data Transfer.
-
Create an Apache Kafka® source endpoint with the following settings:
-
Database type: Apache Kafka®.
-
Connection type: Manual setup.
-
Installation type: Managed Service for Apache Kafka® cluster.
-
Under Managed Service for Apache Kafka® cluster:
- Managed Service for Apache Kafka® cluster: Select the Managed Service for Apache Kafka® cluster you created previously.
- Authentication: SASL.
- Username and Password: User credentials in the Managed Service for Apache Kafka® cluster.
-
Under Advanced settings → Conversion rules:
- Conversion rules: Debezium CDC parser.
- Schema Registry connection settings: On Premise Schema Registry.
- Schema Registry URL: Schema Registry URL you got earlier.
-
-
Create and run a data transfer.
Delete the resources you created
Note
Before deleting the resources, deactivate the transfer.
Some resources are not free of charge. Delete the resources you no longer need to avoid paying for them: