The Data Scan Async API’s PUT method is designed to initiate an asynchronous scanning process for column values within a specified data source. This document provides information on how to use the API, the required input parameters, and the expected output.
Note: To obtain the <auth token>, please refer to the Step-by-Step Guide to Obtain Your Auth Token.
# Import the ProtectoVault class from the protecto_ai module
from protecto_ai import ProtectoVault
# Create an instance of ProtectoVault with your authentication token
obj = ProtectoVault("<auth_token>")
HTTP Method: PUT
Endpoint: /data-scan-async
Code Snippet
[
{
"data_source_name": "Datasource1",
"object_name": ["db1","schema1","table1"],
"data_samples": [
{
"column_name": "column_1",
"samples": [ "s1", "s2","s3","s4","s5"]
}
]
}
]
input Parameters :
- data_source_name:
- Description: A user-defined identifier for the data source. Users can provide any name for this identifier.
- Type: String
- Example:
"Datasource1"
- object_name:
- Description: The fully qualified table name or object name in the list that represents the target data structure for scanning.
- Type: String
- Example:
"schema_name.table_name"
- List of columns and their sample values:
- Description: An array or nested structure containing information about the columns to be scanned and their sample values
Output :
Upon successful initiation of the scanning process, the API will respond with a tracking ID. The tracking ID serves as a unique identifier for the initiated scan.
{
"data": {
"tracking_id": "47882682-9f38-4f45-afec-daadaa1b230b"
},
"success": "true",
"error": {
"message": ""
}
}