Process a ∀Doc

POST /api/v8/partner/any-documents

Veryfi's Process a ∀Doc endpoint allows you to submit and extract data from unstructured documents such as Contracts, Articles of Incorporation and any other documents and turn them into valuable business insights. The Process a ∀Doc endpoint enables you to submit the following image formats: .pdf,.jpg,.jpeg,.png,.tiff,.tif,.bmp,.gif and retrieve extracted data in JSON format. The max file size is 20mb, min file size is 0.25kb. Rate limit is 60 requests per second.

Request

application/json
multipart/form-data

Body

external_id (string | null)

Possible values: non-empty

A custom identification value. Use this if you would like to assign your own ID to documents. This parameter is useful when mapping this document to a service or resource outside Veryfi.

meta.tags string[]

Possible values: non-empty

Default value: ``

Tags you want to associate with the document.

package_path (string | null)

Possible values: non-empty

A path to a file in an S3 bucket, e.g. 'some/receipt.jpg

bucket (string | null)

Possible values: non-empty

An S3 bucket for 'package_path', e.g. 'documents'.

file_data (string | null)

Possible values: non-empty

Used to upload a document via base64 encoded string, could be raw or data URI scheme. This is the least effective way to upload a document for processing. See file_urls or uploading zip files.

file_url (string | null)

Possible values: non-empty

A URL to a publicly accessible document to be sent to Veryfi for processing.

file_urls string[]

Possible values: non-empty

An array of URLs to publicly accessible documents to be sent to Veryfi for processing.

file_name (string | null)

Possible values: non-empty

An optional filename. Useful to determine file type.

meta.device_data.uuid object

Device unique identifier

anyOf

MOD1
MOD2

string

meta.device_data.user_uuid object

User unique identifier, like a digital fingerprint (hashed login) used to access the app where they upload their documents. Used in fraud detection.

anyOf

MOD1
MOD2

string

max_pages_to_process (integer | null)

Possible values: >= 1 and <= 50

Default value: 20

The number of pages to process for the document. The limit is 50 pages per document.

blueprint_name (string | null)

Possible values: non-empty

The name of the extraction blueprints. Default blueprints include [auto_insurance_card, bill_of_lading, flight_itinerary, goods_received_note, incorporation_document, incorporation_document_latam, indian_passport, latam_passport, prescription_medication_label, product_nutrition_facts, restaurant_menu, shipping_label, uk_drivers_license, us_driver_license, us_health_insurance_card, us_passport, vehicle_registration, vendor_statement, work_order]

template_name (string | null)deprecated

Possible values: non-empty

Deprecated.The blueprint name which was used to extract the data. Same as blueprint_name.

Body

meta.external_id (string | null)

Possible values: non-empty

External ID you want to associate with the document.

meta.tags string[]

Possible values: non-empty

Default value: ``

Tags you want to associate with the document.

external_id (string | null)

Possible values: non-empty

A custom identification value. Use this if you would like to assign your own ID to documents. This parameter is useful when mapping this document to a service or resource outside Veryfi.

package_path (string | null)

Possible values: non-empty

A path to a file in an S3 bucket, e.g. 'some/receipt.jpg

bucket (string | null)

Possible values: non-empty

An S3 bucket for 'package_path', e.g. 'documents'.

file_data (string | null)

Possible values: non-empty

Used to upload a document via base64 encoded string, could be raw or data URI scheme. This is the least effective way to upload a document for processing. See file_urls or uploading zip files.

file_url (string | null)

Possible values: non-empty

A URL to a publicly accessible document to be sent to Veryfi for processing.

file_urls string[]

Possible values: non-empty

An array of URLs to publicly accessible documents to be sent to Veryfi for processing.

file_name (string | null)

Possible values: non-empty

An optional filename. Useful to determine file type.

file binary

A binary file. Submitting zipped documents through this parameter is the fastest way to process any document.

meta.device_data.uuid object

Device unique identifier

anyOf

MOD1
MOD2

string

meta.device_data.user_uuid object

User unique identifier, like a digital fingerprint (hashed login) used to access the app where they upload their documents. Used in fraud detection.

anyOf

MOD1
MOD2

string

max_pages_to_process (integer | null)

Possible values: >= 1 and <= 50

Default value: 20

The number of pages to process for the document. The limit is 50 pages per document.

blueprint_name (string | null)

Possible values: non-empty

template_name (string | null)deprecated

Possible values: non-empty

Deprecated.The blueprint name which was used to extract the data. Same as blueprint_name.

Responses

Returns a processed ∀Doc.

application/json

Schema
Example (from schema)

Schema

anyOf

API_V8_PARTNER_ANYDOCUMENTS_ANYDOCUMENTRESPONSE

external_id (string | null)

Possible values: non-empty

A custom identification value. Use this if you would like to assign your own ID to documents. This parameter is useful when mapping this document to a service or resource outside Veryfi.

meta object

external_id (string | null)

Possible values: non-empty

A custom identification value. Use this if you would like to assign your own ID to documents. This parameter is useful when mapping this document to a service or resource outside Veryfi.

pages object[]

Array [

ocr_score (number | null)

Possible values: <= 1

The average OCR score of the page.

width (integer | null)

The width of the page.

height (integer | null)

The height of the page.

]

tags string[]

Possible values: non-empty

Default value: ``

Tags associated with the document.

ocr_score (number | null)

Possible values: <= 1

The average OCR score of the whole document.

model (string | null)

Possible values: non-empty

The version of the model used to process the document.

device_data ResponsesDeviceData

device data containing uuid

uuid object

Device unique identifier

anyOf

MOD1
MOD2

string

user_uuid object

User unique identifier, like a digital fingerprint (hashed login) used to access the app where they upload their documents. Used in fraud detection.

anyOf

MOD1
MOD2

string

duplicates object[]

An array of duplicate documents found in the system.

Array [

id integerrequired

The id of the duplicate document.

url urirequired

Possible values: non-empty and <= 2083 characters

The url of the duplicate document.

score numberrequired

Possible values: <= 1

How close is the match

]

fraud AnydocFraud

An object that contains additional information to help check for fraud.

score (number | null)

Possible values: <= 1

Confidence of Fraud Detector in it's prediction

color (string | null)

Possible values: [green, yellow, red]

Color from Fraud Detector: green means legitimate, yellow means review needed and red means fraud

types string[]

Possible values: [LCD photo, screenshot, fraudulent pdf, not a document, generated document, ai generated, duplicate, multiple profiles or devices]

List of attributions which marked the document as fraud

pages object[]

An array containing fraud info about each extracted page

Array [

is_lcd ClassBoolField

score (number | null)

Possible values: <= 1

The score shows how confident the model is that the predicted value belongs to the field. See confidence scores explained for more information.

value booleanrequired

The extracted value.

flags object[]

List of flags which marked the document as fraud

Array [

score (number | null)

Possible values: <= 1

The score shows how confident the model is that the predicted value belongs to the field. See confidence scores explained for more information.

value stringrequired

Possible values: non-empty

]

handwriting object[]

An array containing handwriting info about each extracted page

Array [

bounding_region number[]required

Possible values: >= 8, <= 8

Bounding region of the artifact in [x1,y1,x2,y2,x3,y3,x4,y4] format

type stringrequired

Possible values: non-empty

Type of the artifact

score number

]

digital_tampering object[]

An array containing digital tampering info about each extracted page

Array [

bounding_region number[]required

Possible values: >= 8, <= 8

Bounding region of the artifact in [x1,y1,x2,y2,x3,y3,x4,y4] format

type stringrequired

Possible values: non-empty

Type of the artifact

score number

]

pdf_url uri

Possible values: non-empty and <= 2083 characters

A signed URL to access the auto-generated PDF created from the submitted document. This URL expires 15 minutes after the response object is returned and is resigned during every GET request.

id integerrequired

The unique number created to identify the document.

created_date date-timerequired

updated_date date-timerequired

img_thumbnail_url uri

Possible values: non-empty and <= 2083 characters

A signed URL to access the auto-generated thumbnail created for the submitted document. This URL expires 15 minutes after the response object is returned and is resigned during every GET request.

text (string | null)

The text returned from converting the document into a machine-readable text format.

blueprint_name stringrequired

Possible values: non-empty

The blueprint name which was used to extract the data. Sample blueprints: [ "auto_insurance_card", "bill_of_lading", "flight_itinerary", "goods_received_note", "incorporation_document", "incorporation_document_latam", "indian_passport", "latam_passport", "prescription_medication_label", "product_nutrition_facts", "restaurant_menu", "shipping_label", "uk_drivers_license", "us_driver_license", "us_health_insurance_card", "us_passport", "vehicle_registration", "vendor_statement", "work_order"]

template_name stringdeprecated

Possible values: non-empty

Deprecated. The blueprint name which was used to extract the data. Same as blueprint_name.

{}

User error

application/json

Schema
Example (from schema)

Schema

oneOf

FAILED_TO_PROCESS_DOCUMENTS
FAILED_TO_EXTRACT

status string

Default value: fail

error string

Default value: Failed to process documents

status string

Default value: fail

error string

Default value: Failed to extract

{}

User error

application/json

Schema
Example (from schema)

Schema

oneOf

AUTHENTICATION_CREDENTIALS_WERE_NOT_PROVIDED

status string

Default value: fail

error string

Default value: Authentication credentials were not provided.

{}

Not found

application/json

Schema
Example (from schema)

Schema

oneOf

NOT_FOUND
DOCUMENT_NOT_FOUND

status string

Default value: fail

error string

Default value: Not found.

status string

Default value: fail

error string

Default value: Document Not Found

{}

Request body too large

application/json

Schema
Example (from schema)

Schema

oneOf

REQUEST_BODY_TOO_LARGE

status string

Default value: fail

error string

Default value: Request body too large

{}

Rate limit

application/json

Schema
Example (from schema)

Schema

oneOf

YOU_HAVE_BEEN_RATE_LIMITED

status string

Default value: fail

error string

Default value: You have been rate limited

details undefined[]

Default value: [object Object]

{}

User error

application/json

Schema
Example (from schema)

Schema

oneOf

CLIENT_CLOSED_REQUEST_OR_LOST_CONNECTION

status string

Default value: fail

error string

Default value: Client closed request or lost connection

{}

Service is temporarily unavailable

application/json

Schema
Example (from schema)

Schema

oneOf

SERVICE_IS_TEMPORARILY_UNAVAILABLE_PLEASE_TRY_AGAIN_LATER

status string

Default value: fail

error string

Default value: Service is temporarily unavailable. Please try again later

{}

Gateway timeout. Returned if request takes more than 100 seconds. The request might finish successfully later.

application/json

Schema
Example (from schema)

Schema

oneOf

GATEWAY_TIMEOUT

status string

Default value: fail

error string

Default value: Gateway timeout

{}

General error response

application/json

Schema
Example (from schema)

Schema

status string

Default value: fail

error stringrequired

details object[]

{
  "status": "fail",
  "error": "string"
}

Process a ∀Doc

/api/v8/partner/any-documents

Request​

Body

Body

Responses​

Request

Responses