Bemærk
Adgang til denne side kræver godkendelse. Du kan prøve at logge på eller ændre mapper.
Adgang til denne side kræver godkendelse. Du kan prøve at ændre mapper.
Parses a column containing binary data (blob) and returns a VariantType. Throws an exception if the input blob is invalid.
Syntax
from pyspark.databricks.sql import functions as dbf
dbf.ai_parse_document(col=<col>, options=<options>)
Parameters
| Parameter | Type | Description |
|---|---|---|
col |
pyspark.sql.Column or binary data |
A column or column name containing the binary blobs to parse. |
options |
dict, optional |
A dictionary of options to control the parsing behavior. |
Returns
pyspark.sql.Column: A new column of VariantType containing the document parse result. The result includes a document object with pages and elements arrays, along with error_status and metadata. Each element represents a discrete unit of content within the parsed document, such as a text paragraph, table, figure, or layout marker. For the full output schema and details about elements, see ai_parse_document.