Yandex Cloud
Search
Contact UsGet started
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • AI for business
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Center for Technologies and Society
    • Yandex Cloud Partner program
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
© 2025 Direct Cursus Technology L.L.C.
Yandex MetaData Hub
  • About Yandex MetaData Hub
        • Getting a list of ingestions
        • Creating an ingestion
        • Starting an ingestion
        • Updating an ingestion
        • Getting ingestion info
        • Deleting an ingestion
    • Service roles for access management
  • Access management
  • Quotas and limits
  • Pricing policy
  • Public materials
  • Release notes
  1. Data Catalog
  2. Step-by-step guides
  3. Ingestions
  4. Updating an ingestion

Updating a data ingestion

Written by
Yandex Cloud
Updated at October 15, 2025

Note

This feature is in the Preview stage.

Management console
  1. In the management console, select the resource folder you created the metadata catalog in.

  2. Select Yandex MetaData Hub.

  3. In the left-hand panel, select Data Catalog.

  4. In the list that opens, select the metadata catalog you want to update an ingestion in.

  5. In the left-hand panel, select Ingestions.

  6. In the list of ingestions, click in the line with the ingestion and select Edit.

  7. Edit its settings:

    • In the Name field, specify a new unique name for the ingestion.

    • Optionally, edit the ingestion description.

    • Select a new data source.

    • Under PostgreSQL Ingestion Configuration:

      • Update the ingestion schedule:

        • Monthly: Select the dates and the ingestion start and end time.

        • Weekly: Select the days of the week and the ingestion start and end time.

          Note

          If scheduled for Monthly or Weekly, the ingestion will start at the specified time and stop as soon as new data has been ingested. If there are errors while ingesting, the ingestion will restart until the data has been ingested or until the specified time is over.

        • Daily: Select time intervals for ingestion.

        • Manually: For manual start only.

      • Optionally, under Data Filters, use regular expressions to specify which databases and database objects to include in or exclude from the ingestion.

      • Under Metadata Types, select the metadata types to extract from the source.
      • Optionally, under Data Profiling:
        • Select Enable Profiling to perform data profiling, i.e., analysis and collection of statistics on the data being extracted.
        • Select Profile Table Level Only to skip data profiling in every table column. With this option on, data characteristics will only be collected for the table as a whole.
        • In the Max Workers field, specify the number of computing threads for profiling.
        • In the Sample Size field, specify the number of rows for sampling for column profiling. This setting applies when the Use Sampling option is enabled.
        • In the Table Size Limit (GB) field, specify the table size in GB above which the table will be excluded from profiling.
        • In the Table Row Count Limit field, specify the number of rows above which the table will be excluded from profiling.
        • Specify which data characteristics to extract from the source:
          • include_field_null_count: Number of NULL rows per table or column.
          • include_field_distinct_count: Number of rows with different values per table or column.
          • include_field_min_value: Minimum value per table or column.
          • include_field_max_value: Maximum value per table or column.
          • include_field_mean_value: Average value per table or column.
          • include_field_median_value: Median value per table or column.
          • include_field_stddev_value: Standard deviation per table or column.
          • include_field_sample_values: Data slices, i.e., several consecutive values for each column.
      • Under Metadata Processing, select the image for metadata processing:
        • Enable Use File Cache to improve ingestion performance.
  8. Click Apply.

Was the article helpful?

Previous
Starting an ingestion
Next
Getting ingestion info
© 2025 Direct Cursus Technology L.L.C.