If you are just looking for a markdown representation of a given document, you can disable chunking altogether and just use
response['result']['chunks'][0]['content'].Attribute breakdown
Show child attributes
Show child attributes
Either “full” or “url” — defines how the chunks are delivered (inline or via URL).
Present only when
type is “full” — an array of chunk objects.The content of the chunk extracted from the document.
Chunk content optimized for embedding and retrieval. For example, it differs from
content when figure/table summaries are enabled.The enriched content of the chunk extracted from the document.
Array of block-level structures—such as Text, Table, or Figure blocks.
Block type indicating the content category: e.g., “Text”, “Table”, “Figure”.
Contains bounding box information for the block, using normalized or spreadsheet-based coordinates.
Normalized horizontal start coordinate (0–1 range), or column index in spreadsheets.
Normalized vertical start coordinate (0–1 range), or row index in spreadsheets.
Normalized width relative to page size (0–1 range), or column width count in spreadsheets.
Normalized height relative to page size (0–1 range), or row height count in spreadsheets.
The parsed page number (1-indexed)—either page index or sheet index for spreadsheets.
The original page number from the source document, useful when filtering or slicing page ranges.
The actual content of the block—text, table HTML, or figure caption/data.
Presigned URL (if enabled) to download the block’s figure or table image; may expire (~24 h).
Either a “low” or “high” confidence score for each block.
Numeric confidence score based on logprobs and OCR confidence.
Present only when
type is “url” — endpoint to fetch the chunk JSON remotely.Code snippet example
The Parse output can have two types: eitherfull or url. The full type contains all the content and block information. Reducto swaps to using type=url when the request response size is close to exceeding the maximum allowed HTTP response size (6 MB for our current infra).
Set force_url_result to true to consistently return URLs.
Copy
Ask AI
{
"job_id": "abcdef123456",
"duration": 5,
"pdf_url": null,
"studio_link": "https://studio.reducto.ai/job/xyz",
"usage": {
"num_pages": 1,
"credits": 0.5
},
"result": {
"type": "full",
"chunks": [
{
"content": "Detailed content for full LLM usage.",
"embed": "Summarized content optimized for embeddings.",
"enriched": "Additional semantic enrichment details.",
"enrichment_success": true,
"blocks": [
{
"type": "Header",
"bbox": {
// All bbox values normalized to [0,1] range
"left": 0.1, // Distance from left edge
"top": 0.05, // Distance from top edge
"width": 0.8, // Width as % of page width
"height": 0.1, // Height as % of page height
"page": 1, // Current page number (1-indexed)
"original_page": 10 // Original doc page number
},
"content": "Text content",
"image_url": null, // presigned url to download figure/table image
"confidence": "High",
"logprobs_confidence": 0.92
} ...
] ...
} ...
]
}
}