Yandex Cloud
Search
Discuss with expertTry it for free
  • Customer Stories
  • Documentation
  • Blog
  • All Services
  • System Status
  • Marketplace
    • Featured
    • Infrastructure & Network
    • Data Platform
    • AI for business
    • Security
    • DevOps tools
    • Serverless
    • Monitoring & Resources
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Center for Technologies and Society
    • Yandex Cloud Partner program
    • Price calculator
    • Pricing plans
  • Customer Stories
  • Documentation
  • Blog
© 2026 Direct Cursus Technology L.L.C.
Yandex MPP Analytics for PostgreSQL
  • Getting started
    • All guides
      • Overview
      • Creating an external table
      • Editing PXF settings
    • Connecting to an external file server (gpfdist)
    • Auxiliary utilities
  • Access management
  • Pricing policy
  • Terraform reference
  • Monitoring metrics
  • Audit Trails events
  • Public materials
  • Release notes

In this article:

  • Getting started
  • Get started with external tables using PXF
  1. Step-by-step guides
  2. Working with PXF
  3. Overview

Working with PXF

Written by
Yandex Cloud
Updated at June 3, 2026
  • Getting started
  • Get started with external tables using PXF

The Greenplum® Platform Extension Framework (PXF) protocol is used to access data in external databases.

Let's say there is a table with sales data over several years. It contains three data types:

  • Hot data over the last few months stored in MySQL®.
  • Warm data over the last few years stored in Yandex MPP Analytics for PostgreSQL.
  • Cold data over an earlier period stored in S3.

The colder the data, the less often it is accessed.

To distribute data across multiple DBMSs and enable access to them, PXF is used to create external tables, i.e., special DBMS objects that reference tables, buckets, or files from external sources. This section provides guidelines on how to create external tables that reference external DBMSs.

For such tables, you can specify external data source settings in the SQL query. Alternatively, you can create a source in Yandex MPP Analytics for PostgreSQL with the settings you need and provide that source in the SQL query.

In Yandex MPP Analytics for PostgreSQL clusters with Apache Cloudberry™, in addition to external tables, you can also create foreign tables via FDW. Such tables provide access to the same external DBMSs as external ones. However, foreign tables support both read and write operations within a single table, unlike external tables which are strictly read-only or write-only.

Getting startedGetting started

  1. In the Yandex MPP Analytics for PostgreSQL cluster's subnet, set up a NAT gateway and link a routing table.
  2. In the Yandex MPP Analytics for PostgreSQL cluster network, create a security group allowing all incoming and outgoing traffic from all addresses.

Get started with external tables using PXFGet started with external tables using PXF

  1. Add a data source to Yandex MPP Analytics for PostgreSQL. The steps for adding a source depend on the source connection type:

    • S3
    • JDBC
    • HDFS
    • Hive
  2. Create an external table using PXF.

  3. Optionally, update the default PXF settings.

Greenplum® and Greenplum Database® are registered trademarks or trademarks of Broadcom Inc. in the United States and/or other countries.

Was the article helpful?

Previous
Managing client processes and user sessions
Next
S3
© 2026 Direct Cursus Technology L.L.C.