Parse Async - Reducto

import requests url = "https://platform.reducto.ai/parse_async" payload = { "input": "<string>", "async": { "priority": False }, "enhance": { "agentic": [], "summarize_figures": True, "intelligent_ordering": False }, "retrieval": { "chunking": { "chunk_mode": "disabled", "chunk_overlap": 0 }, "filter_blocks": [], "embedding_optimized": False }, "formatting": { "add_page_markers": False, "table_output_format": "dynamic", "merge_tables": False, "include": [] }, "spreadsheet": { "split_large_tables": { "enabled": True, "size": 50 }, "include": [], "clustering": "accurate", "exclude": [] }, "settings": { "ocr_system": "standard", "extraction_mode": "hybrid", "force_url_result": False, "return_ocr_data": False, "return_images": [], "embed_pdf_metadata": False, "embed_pdf_metadata_dpi": 100, "persist_results": False }, "queue_priority": "auto" } headers = { "Authorization": "Bearer <token>", "Content-Type": "application/json" } response = requests.post(url, json=payload, headers=headers) print(response.text)

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Headers

user-id

string | null

Body

application/json

input

required

For parse/split/extract pipelines, the URL of the document to be processed. You can provide one of the following: 1. A publicly available URL 2. A presigned S3 URL 3. A reducto:// prefixed URL obtained from the /upload endpoint after directly uploading a document 4. A jobid:// prefixed URL obtained from a previous /parse invocation 5. A list of URLs (for multi-document pipelines, V3 API only)

For edit pipelines, this should be a string containing the edit instructions

async

AsyncConfig · object

The configuration options for asynchronous processing (default synchronous).

Show child attributes

enhance

Enhance · object

Show child attributes

retrieval

Retrieval · object

Show child attributes

formatting

Formatting · object

Show child attributes

spreadsheet

Spreadsheet · object

Show child attributes

settings

Settings · object

Show child attributes

queue_priority

enum<string>

default:auto

Queue priority. 'batch' for non-urgent work that processes when spare GPU capacity is available.

Available options:

auto,

batch

Response

Successful Response

job_id

string

required

Documentation Index

Authorizations

Headers

Body

Response