Batch Service - Pinkfish - AI Agents & Workflows for Getting Work Done

The Batch Service allows you to break down large jobs into manageable pieces and process them through queues for parallel execution.

How Batch Works

Break down large jobs into manageable pieces and process them through queues.

Create batch with unique batchId containing work items
Assign workflow to process each item
Items placed in queue for parallel processing
Track completion and trigger actions when batch finishes

Batch Features

Unique Identification: Each batch gets a unique batchId
Flexible Item Structure: Define custom data for each item
Completion Actions: Configure triggers, webhooks, or no action when batch completes
Real-time Monitoring: Track batch progress and individual item status

Large CSV uploads

Batches can also be created from a CSV file. Three ingest paths are supported, depending on the size of the file:

Path	When to use	Limits
Inline `csvData`	Small batches submitted directly with the API request	10 MB request body
`csvUrl`	The file is already hosted at a fetchable URL	10 MB after fetch
`csvKey`	Browser-direct upload via a presigned S3 URL (the right choice for files larger than the inline cap)	≤ 1 GiB file size · ≤ 500,000 rows per batch

Browser-direct CSV upload flow

To use the csvKey path:

Request a presigned upload URL. The request body declares the filename and exact byte size of the file you’re about to upload; the platform validates against the 1 GiB cap and pre-signs an S3 PUT URL bound to that exact Content-Length. POST /api/queues/{queueId}/items/csv/upload-url
{ "filename": "leads.csv", "sizeBytes": 47185920 }
Response:
{ "uploadUrl": "https://...amazonaws.com/...?X-Amz-Signature=...", "csvKey": "{orgId}/{queueId}/{uuid}.csv", "expiresAt": "2026-05-14T03:20:00Z" }
The csvKey is server-controlled (the platform generates the UUID; the customer never supplies any part of it). The presigned URL expires in 5 minutes.
Upload the file directly to S3 with an HTTP PUT to uploadUrl. The presigned URL is bound to the exact sizeBytes you declared and to Content-Type: text/csv, so the request must set both headers:
PUT <uploadUrl> Content-Type: text/csv Content-Length: <sizeBytes from step 1>
Mismatched headers cause an S3 signature failure. The browser’s XMLHttpRequest.upload.onprogress event can drive an upload progress bar; no traffic flows through the API.
Create the batch by posting to the existing CSV-ingest endpoint with the returned csvKey and the workflow trigger to run for each row: POST /api/queues/{queueId}/items/csv
{ "csvKey": "<from step 1>", "triggerId": "trg_..." }
triggerId is required — it identifies the workflow trigger the batch’s per-row Runs will fire. Additional optional fields include name (display label), onDoneTriggerId (run a follow-up trigger when the batch completes), and onDoneWebhookUrl (POST to a URL on completion). The platform’s batch-processor then validates the CSV header, chunks the file into ~1,000-row segments, and creates one workflow Run per row on the customer’s queue.

What happens during ingest

A CSV-driven batch moves through three statuses on its way to running:

validating — the batch-processor reads just the head of the CSV and confirms the column shape. Fast (sub-second on typical files).
pending — the chunker streams the full file to count rows, enforces the 500,000-row cap, and publishes per-chunk row-creation jobs to an internal SQS queue. A very large CSV can sit in this state while counting. A batch that exceeds the row cap is canceled here with reason row_count_exceeded.
running — chunks are being processed in parallel; the customer’s queue starts dispatching Runs as soon as each chunk’s rows land.

If a batch is canceled (either by the user or by a queue delete) before row-creation finishes, any chunks still in flight abort cleanly. Pending rows that were already created are lazy-flipped out of the queue without dispatching.

Per-row errors

Rows that fail validation are recorded as individual BatchRowError records rather than failing the entire batch. The two failure modes shipped in v1 are:

column_count — the row has more or fewer columns than the CSV header
unterminated_quote — the CSV parser hit an unterminated quoted-field run while reading this row

Retrieve them with: GET /api/queues/{queueId}/batches/{batchId}/row-errors?limit=&cursor= The response paginates with an opaque cursor; pass it back as the next call’s cursor parameter to walk through all errors. Counts of valid runs created and rows skipped are also surfaced on the Batch record itself (runsRequested, runsSkipped).

Queue Integration

Batch service integrates with Queue service:

Batch items are placed into queues for processing
Queue workers process individual items
Queue tracking monitors batch completion
Completion triggers fire when batches finish

Example: Producer-consumer pattern showing how batches break down big jobs into manageable queue items for parallel processing.

For practical examples of using batches to handle complex workflows, see Breaking Up Big Jobs.

Questions? Reach out on Discord https://discord.com/invite/HaDg7R4VZG

​How Batch Works

​Batch Features

​Large CSV uploads

​Browser-direct CSV upload flow

​What happens during ingest

​Per-row errors

​Queue Integration