Vision OCR API, REST: TextRecognitionAsync.GetRecognition

Written by

Updated at August 8, 2025

HTTP request
Query parameters
Response
TextAnnotation
Block
Polygon
Vertex
Line
Word
TextSegments
DetectedLanguage
Entity
Table
TableCell
Picture

To get recognition results.

HTTP request

GET https://ocr.api.cloud.yandex.net/ocr/v1/getRecognition

Query parameters

Field

Description

operationId

string

Required field. Operation ID of async recognition request.

Response

HTTP Code: 200 - OK

{
  "textAnnotation": {
    "width": "string",
    "height": "string",
    "blocks": [
      {
        "boundingBox": {
          "vertices": [
            {
              "x": "string",
              "y": "string"
            }
          ]
        },
        "lines": [
          {
            "boundingBox": {
              "vertices": [
                {
                  "x": "string",
                  "y": "string"
                }
              ]
            },
            "text": "string",
            "words": [
              {
                "boundingBox": {
                  "vertices": [
                    {
                      "x": "string",
                      "y": "string"
                    }
                  ]
                },
                "text": "string",
                "entityIndex": "string",
                "textSegments": [
                  {
                    "startIndex": "string",
                    "length": "string"
                  }
                ]
              }
            ],
            "textSegments": [
              {
                "startIndex": "string",
                "length": "string"
              }
            ],
            "orientation": "string"
          }
        ],
        "languages": [
          {
            "languageCode": "string"
          }
        ],
        "textSegments": [
          {
            "startIndex": "string",
            "length": "string"
          }
        ],
        "layoutType": "string"
      }
    ],
    "entities": [
      {
        "name": "string",
        "text": "string"
      }
    ],
    "tables": [
      {
        "boundingBox": {
          "vertices": [
            {
              "x": "string",
              "y": "string"
            }
          ]
        },
        "rowCount": "string",
        "columnCount": "string",
        "cells": [
          {
            "boundingBox": {
              "vertices": [
                {
                  "x": "string",
                  "y": "string"
                }
              ]
            },
            "rowIndex": "string",
            "columnIndex": "string",
            "columnSpan": "string",
            "rowSpan": "string",
            "text": "string",
            "textSegments": [
              {
                "startIndex": "string",
                "length": "string"
              }
            ]
          }
        ]
      }
    ],
    "fullText": "string",
    "rotate": "string",
    "markdown": "string",
    "pictures": [
      {
        "boundingBox": {
          "vertices": [
            {
              "x": "string",
              "y": "string"
            }
          ]
        },
        "score": "string"
      }
    ]
  },
  "page": "string"
}

Field

Description

textAnnotation

TextAnnotation

Recognized text blocks in page or text from entities.

page

string (int64)

Page number in PDF file.

TextAnnotation

Field	Description
width	string (int64) Page width in pixels.
height	string (int64) Page height in pixels.
blocks[]	Block Recognized text blocks in this page.
entities[]	Entity Recognized entities.
tables[]	Table
fullText	string Full text recognized from image.
rotate	enum (Angle) Angle of image rotation. `ANGLE_UNSPECIFIED` `ANGLE_0` `ANGLE_90` `ANGLE_180` `ANGLE_270`
markdown	string Full markdown (without pictures inside) from image. Available only in markdown and math-markdown models.
pictures[]	Picture List of pictures locations from image.

Block

Field	Description
boundingBox	Polygon Area on the page where the text block is located.
lines[]	Line Recognized lines in this block.
languages[]	DetectedLanguage A list of detected languages
textSegments[]	TextSegments Block position from full_text string.
layoutType	enum (LayoutType) Block layout type. `LAYOUT_TYPE_UNSPECIFIED` `LAYOUT_TYPE_UNKNOWN` `LAYOUT_TYPE_TEXT` `LAYOUT_TYPE_HEADER` `LAYOUT_TYPE_SECTION_HEADER` `LAYOUT_TYPE_FOOTER` `LAYOUT_TYPE_FOOTNOTE` `LAYOUT_TYPE_PICTURE` `LAYOUT_TYPE_CAPTION` `LAYOUT_TYPE_TITLE` `LAYOUT_TYPE_LIST`

Polygon

Field

Description

vertices[]

Vertex

The bounding polygon vertices.

Vertex

Field

Description

string (int64)

X coordinate in pixels.

string (int64)

Y coordinate in pixels.

Line

Field	Description
boundingBox	Polygon Area on the page where the line is located.
text	string Recognized text.
words[]	Word Recognized words.
textSegments[]	TextSegments Line position from full_text string.
orientation	enum (Angle) Angle of line rotation. `ANGLE_UNSPECIFIED` `ANGLE_0` `ANGLE_90` `ANGLE_180` `ANGLE_270`

Word

Field	Description
boundingBox	Polygon Area on the page where the word is located.
text	string Recognized word value.
entityIndex	string (int64) ID of the recognized word in entities array.
textSegments[]	TextSegments Word position from full_text string.

TextSegments

Field

Description

startIndex

string (int64)

Start character position from full_text string.

length

string (int64)

Text segment length.

DetectedLanguage

Field

Description

languageCode

string

Detected language code.

Entity

Field

Description

name

string

Entity name.

text

string

Recognized entity text.

Table

Field	Description
boundingBox	Polygon Area on the page where the table is located.
rowCount	string (int64) Number of rows in table.
columnCount	string (int64) Number of columns in table.
cells[]	TableCell Table cells.

TableCell

Field	Description
boundingBox	Polygon Area on the page where the table cell is located.
rowIndex	string (int64) Row index.
columnIndex	string (int64) Column index.
columnSpan	string (int64) Column span.
rowSpan	string (int64) Row span.
text	string Text in cell.
textSegments[]	TextSegments Table cell position from full_text string.

Picture

Field

Description

boundingBox

Polygon

Area on the page where the picture is located.

score

string

Confidence score of picture location.

Vision OCR API, REST: TextRecognitionAsync.GetRecognition

HTTP requestHTTP request

Query parametersQuery parameters

ResponseResponse

TextAnnotationTextAnnotation

BlockBlock

PolygonPolygon

VertexVertex

LineLine

WordWord

TextSegmentsTextSegments

DetectedLanguageDetectedLanguage

EntityEntity

TableTable

TableCellTableCell

PicturePicture

Was the article helpful?

HTTP request

Query parameters

Response

TextAnnotation

Block

Polygon

Vertex

Line

Word

TextSegments

DetectedLanguage

Entity

Table

TableCell

Picture