curl --request POST \
--url https://api.example.com/api/data/pii-scan \
--header 'Content-Type: application/json' \
--data '
{
"upload_id": "<string>",
"scan_id": "<string>"
}
'{
"findings": [
{}
],
"summary": "<string>",
"pii_detected": true,
"error": "<string>",
"message": "<string>"
}Run PII detection on uploaded dataset to identify sensitive data exposure
curl --request POST \
--url https://api.example.com/api/data/pii-scan \
--header 'Content-Type: application/json' \
--data '
{
"upload_id": "<string>",
"scan_id": "<string>"
}
'{
"findings": [
{}
],
"summary": "<string>",
"pii_detected": true,
"error": "<string>",
"message": "<string>"
}This endpoint analyzes the uploaded dataset for Personally Identifiable Information (PII) exposure. It uses AI-powered analysis combined with regex-based detection to identify sensitive data across all columns and rows.Documentation Index
Fetch the complete documentation index at: https://mintlify.com/Basit-Ali0/Yggdrasil/llms.txt
Use this file to discover all available pages before exploring further.
/api/data/upload.upload_id only.curl -X POST https://your-domain.com/api/data/pii-scan \
-H "Content-Type: application/json" \
-d '{
"upload_id": "a3f12b45-8c7d-4e9f-b1a2-3c4d5e6f7g8h",
"scan_id": "f7e6d5c4-b3a2-1098-7654-321fedcba098"
}'
column_name (string) - Name of the column containing PIIpii_type (string) - Type of PII detectedseverity (string) - Risk severity levelconfidence (number) - AI confidence score (0-100)match_count (number) - Number of rows with PII matchestotal_rows (number) - Total rows analyzedmasked_samples (array) - Sample values with PII maskeddetection_regex (string) - Regex pattern used for detectionviolation_text (string) - Description of the privacy risksuggestion (string) - Recommended remediation actionemail - Email addressesphone - Phone numbersssn - Social Security Numbersname - Personal namesaddress - Physical addressesdate_of_birth - Birth datescredit_card - Credit card numbersip_address - IP addressespassport - Passport numbersnational_id - National ID numbersbank_account - Bank account numbersother - Other PII typesCRITICAL - SSN, credit card, passport (immediate risk)HIGH - Email, phone, bank account (high risk)MEDIUM - Name, address, date of birth, IP address (moderate risk){
"findings": [
{
"column_name": "customer_email",
"pii_type": "email",
"severity": "HIGH",
"confidence": 95,
"match_count": 1247,
"total_rows": 1250,
"masked_samples": [
"j***@example.com",
"s***@company.org",
"a***@domain.net"
],
"detection_regex": "[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\\.[a-zA-Z]{2,}",
"violation_text": "Column contains email addresses which are considered PII under GDPR and CCPA. Exposure risk: HIGH. This data could be used to identify individuals and may require consent for processing.",
"suggestion": "Hash email addresses using SHA-256 or remove column if not required for analysis. Consider implementing email masking for non-production environments."
},
{
"column_name": "account_holder",
"pii_type": "name",
"severity": "MEDIUM",
"confidence": 87,
"match_count": 1250,
"total_rows": 1250,
"masked_samples": [
"J*** D***",
"S*** W***",
"A*** M***"
],
"detection_regex": "^[A-Z][a-z]+ [A-Z][a-z]+$",
"violation_text": "Column contains personal names. Under GDPR Article 4, names are directly identifiable PII. This creates a moderate privacy risk.",
"suggestion": "Replace names with pseudonymized identifiers (e.g., USER_001) or use tokenization to preserve referential integrity while protecting identity."
},
{
"column_name": "ssn",
"pii_type": "ssn",
"severity": "CRITICAL",
"confidence": 98,
"match_count": 423,
"total_rows": 1250,
"masked_samples": [
"***-**-1234",
"***-**-5678",
"***-**-9012"
],
"detection_regex": "\\b\\d{3}-\\d{2}-\\d{4}\\b",
"violation_text": "CRITICAL: Column contains Social Security Numbers. This is highly sensitive PII that poses severe identity theft risk if exposed. Violates multiple regulations including GLBA and state privacy laws.",
"suggestion": "IMMEDIATE ACTION REQUIRED: Encrypt or remove SSN column. If retention is legally required, use field-level encryption with key management system. Never store SSNs in plain text."
}
],
"summary": "PII analysis complete. Found 3 columns with PII exposure across 1,250 rows. Severity breakdown: 1 CRITICAL, 1 HIGH, 1 MEDIUM. Immediate remediation required for SSN column.",
"pii_detected": true
}
Bad Request - Missing or invalid upload_idNot Found - Upload not found or expiredInternal Server Error - Unexpected server error{
"error": "Not Found",
"message": "Upload not found — may have expired"
}
pii_findings tablescan_id (if provided) and upload_idopen for review workflowpii_detected to false