UnstructuredClient
object’s workflows.list_workflows
function (for the Python SDK) or
the GET
method to call the /workflows
endpoint (for curl
or Postman). Learn more.UnstructuredClient
object’s workflows.get_workflow
function (for the Python SDK) or
the GET
method to call the /workflows/<workflow-id>
endpoint (for curl
or Postman)use the GET
method to call the /workflows/<workflow-id>
endpoint. Learn more.UnstructuredClient
object’s workflows.create_workflow
function (for the Python SDK) or
the POST
method to call the /workflows
endpoint (for curl
or Postman). Learn more.UnstructuredClient
object’s workflows.run_workflow
function (for the Python SDK) or
the POST
method to call the /workflows/<workflow-id>/run
endpoint (for curl
or Postman). Learn more.UnstructuredClient
object’s workflows.update_workflow
function (for the Python SDK) or
the PUT
method to call the /workflows/<workflow-id>
endpoint (for curl
or Postman). Learn more.UnstructuredClient
object’s workflows.delete_workflow
function (for the Python SDK) or
the DELETE
method to call the /workflows/<workflow-id>
endpoint (for curl
or Postman). Learn more.UnstructuredClient
object’s workflows.create_workflow
function (for the Python SDK) or
the POST
method to call the /workflows
endpoint (for curl
or Postman).
In the CreateWorkflow
object (for the Python SDK) or
the request body (for curl
or Postman),
specify the settings for the workflow, as follows:
Python SDK (remote source and remote destination)
Python SDK (local source and local destination)
source_id
or destination_id
value. Also, the workflow_type
must be set to CUSTOM
.curl
or Postman. Learn how.Python SDK (local source and remote destination)
destination_id
value, and do not specify a source_id
value. Also, the workflow_type
must be set to CUSTOM
.curl
or Postman. Learn how.Python SDK (async) (remote source and remote destination)
Python SDK (async) (local source and local destination)
source_id
or destination_id
value. Also, the workflow_type
must be set to CUSTOM
.curl
or Postman. Learn how.Python SDK (async) (local source and remote destination)
destination_id
value, and do not specify a source_id
value. Also, the workflow_type
must be set to CUSTOM
.curl
or Postman. Learn how.curl (remote source and remote destination)
curl (local source and local destination)
source_id
or destination_id
value. Also, the workflow_type
must be set to custom
.curl
(or Postman). Learn how.curl (local source and remote destination)
destination_id
value, and do not specify a source_id
value. Also, the workflow_type
must be set to custom
.curl
(or Postman). Learn how.Postman (remote source and remote destination)
unstructured-api-key
, Value: {{UNSTRUCTURED_API_KEY}}
accept
, Value: application/json
Postman (local source and local destination)
source_id
or destination_id
value. Also, the workflow_type
must be set to custom
.curl
). Learn how.unstructured-api-key
, Value: {{UNSTRUCTURED_API_KEY}}
accept
, Value: application/json
Postman (local source and remote destination)
destination_id
value, and do not specify a source_id
value. Also, the workflow_type
must be set to custom
.curl
). Learn how.unstructured-api-key
, Value: {{UNSTRUCTURED_API_KEY}}
accept
, Value: application/json
<name>
(required) - A unique name for this workflow.
<source-connector-id>
(required) - The ID of the target source connector. To get the ID,
use the UnstructuredClient
object’s sources.list_sources
function (for the Python SDK) or
the GET
method to call the /sources
endpoint (for curl
or Postman). Learn more.
<destination-connector-id>
(required) - The ID of the target destination connector. To get the ID,
use the UnstructuredClient
object’s destinations.list_destinations
function (for the Python SDK) or
the GET
method to call the /destinations
endpoint (for curl
or Postman). Learn more.
<TYPE>
(for the Python SDK) or <type>
(for curl
or Postman) (required) - The workflow type. Available values include CUSTOM
(for the Python SDK) and custom
(for curl
or Postman).
If <TYPE>
is set to CUSTOM
(for the Python SDK), or if <type>
is set to custom
(for curl
or Postman), you must add a workflow_nodes
array. For instructions, see Custom workflow DAG nodes.
ADVANCED
, BASIC
, and PLATINUM
(for the Python SDK) and
advanced
, basic
, and platinum
(for curl
or Postman) are non-operational and planned to be fully removed in a future release.The ability to create an automatic workflow type is currently not available but is planned to be added in a future release.<schedule-timeframe>
- The repeating automatic run schedule, specified as a predefined phrase. The available predefined phrases are:
every 15 minutes
(for curl
or Postman): Every 15 minutes (cron expression: */15 * * * *
).every hour
: At the first minute of every hour (cron expression: 0 * * * *
).every 2 hours
: At the first minute of every second hour (cron expression: 0 */2 * * *
).every 4 hours
: At the first minute of every fourth hour (cron expression: 0 */4 * * *
).every 6 hours
: At the first minute of every sixth hour (cron expression: 0 */6 * * *
).every 8 hours
: At the first minute of every eighth hour (cron expression: 0 */8 * * *
).every 10 hours
: At the first minute of every tenth hour (cron expression: 0 */10 * * *
).every 12 hours
: At the first minute of every twelfth hour (cron expression: 0 */12 * * *
).daily
: At the first minute of every day (cron expression: 0 0 * * *
).weekly
: At the first minute of every Sunday (cron expression: 0 0 * * 0
).monthly
: At the first minute of the first day of every month (cron expression: 0 0 1 * *
).schedule
is not specified, the workflow does not automatically run on a repeating schedule.
Workflows with a local source cannot be set to run on a repeating schedule.
UnstructuredClient
object’s workflows.update_workflow
function (for the Python SDK) or
the PUT
method to call the /workflows/<workflow-id>
endpoint (for curl
or Postman), replacing
<workflow-id>
with the workflow’s unique ID. To get this ID, see List workflows.
In the request body, specify the settings for the workflow. For the specific settings to include, see
Create a workflow.
Python SDK
Python SDK (async)
curl
Postman
unstructured-api-key
, Value: {{UNSTRUCTURED_API_KEY}}
accept
, Value: application/json
WorkflowType
is set to CUSTOM
(for the Python SDK), or if workflow_type
is set to custom
(for curl
or Postman), you must also specify the settings for the workflow’s
directed acyclic graph (DAG) nodes. These nodes’ settings are specified in the workflow_nodes
array.
source_id
value outside of theworkflow_nodes
array.destination_id
value outside of the
workflow_nodes
array.workflow_nodes
array will be the same order that these nodes appear in the DAG,
with the first node in the array added directly after the Source node. The Destination node
follows the last node in the array.type
of partition
.
Learn about the available partitioning strategies.
Python SDK
curl, Postman
Python SDK
curl, Postman
provider
and model
include:
"provider": "anthropic"
"model": "claude-3-5-sonnet-20241022"
"provider": "openai"
"model": "gpt-4o"
"provider": "bedrock"
"model": "us.anthropic.claude-3-5-sonnet-20241022-v2:0"
"model": "us.anthropic.claude-3-opus-20240229-v1:0"
"model": "us.anthropic.claude-3-haiku-20240307-v1:0"
"model": "us.anthropic.claude-3-sonnet-20240229-v1:0"
"model": "us.amazon.nova-pro-v1:0"
"model": "us.amazon.nova-lite-v1:0"
"model": "us.meta.llama3-2-90b-instruct-v1:0"
"model": "us.meta.llama3-2-11b-instruct-v1:0"
"provider": "vertexai"
"model": "gemini-2.0-flash-001"
Python SDK
curl, Postman
Python SDK
curl, Postman
type
of chunk
.
Learn about the available chunking strategies.
Python SDK
curl, Postman
Python SDK
curl, Postman
Python SDK
curl, Postman
Python SDK
curl, Postman
type
of prompter
.
Learn about the available enrichments.
Python SDK
curl, Postman
<subtype>
include:
openai_image_description
anthropic_image_description
bedrock_image_description
vertexai_image_description
Python SDK
curl, Postman
<subtype>
include:
openai_table_description
anthropic_table_description
bedrock_table_description
vertexai_table_description
Python SDK
curl, Postman
Python SDK
curl, Postman
type
of embed
.
Learn about the available embedding providers and models.
Python SDK
curl, Postman
subtype
and model_name
include:
"subtype": "azure_openai"
"model_name": "text-embedding-3-small"
"model_name": "text-embedding-3-large"
"model_name": "text-embedding-ada-002"
"subtype": "bedrock"
"model_name": "amazon.titan-embed-text-v2:0"
"model_name": "amazon.titan-embed-text-v1"
"model_name": "amazon.titan-embed-image-v1"
"model_name": "cohere.embed-english-v3"
"model_name": "cohere.embed-multilingual-v3"
"subtype": "togetherai"
"model_name": "togethercomputer/m2-bert-80M-2k-retrieval"
"model_name": "togethercomputer/m2-bert-80M-8k-retrieval"
"model_name": "togethercomputer/m2-bert-80M-32k-retrieval"