Formatting Output and Constraints

Listen to the full lesson

AI Narration

Quick Summary

Modern LLMs support strict format controls: JSON mode, schemas, regex constraints, and length bounds. Programmatic AI features depend on formats that machines can parse, not just humans can read.

What you will learn

·Control output format, length, and structure reliably
·Use JSON mode and structured outputs for programmatic processing
·Understand how constraints improve consistency

One of the most practically valuable prompting skills is controlling output format. LLMs are trained on diverse text and will produce diverse formats by default — sometimes prose, sometimes bullets, sometimes markdown, sometimes JSON. For production applications, you need consistent, predictable output structure that your code can reliably parse.

The simplest approach is explicit format instructions at the end of the prompt: "Return your response as a JSON object with the following fields: summary (string, max 2 sentences), sentiment (one of: positive, negative, neutral), confidence (number 0.0-1.0), key_topics (array of strings, max 5)." Modern frontier models follow detailed format instructions very reliably.

For programmatic use, the OpenAI API offers "structured outputs" — you provide a JSON schema and the model is constrained to produce valid JSON that matches exactly. Anthropic offers similar functionality. This eliminates parsing failures in production code, where a model that occasionally outputs prose instead of JSON can break an entire pipeline.

Length constraints help focus responses. "In exactly 3 bullet points" is more reliable than "briefly." "In 2-3 sentences" is better than "concisely." Word count ranges work well: "between 150 and 200 words." For documents with multiple sections, specify the length for each: "Write a 1-sentence hook, followed by a 3-paragraph body (each 60-80 words), followed by a 1-sentence call to action."

Key Insights

Explicit format instructions at the end of prompts dramatically improve consistency
JSON mode / structured outputs eliminate parsing failures in production pipelines
Length ranges work better than vague words like 'briefly' or 'concisely'
Specify format per-section for structured documents: hook (1 sentence), body (3 paragraphs)...
More constraints narrow the solution space — LLMs perform better with clearer targets

Why It Matters

The line between an AI demo and an AI product is usually format reliability. A free-text response that occasionally drops a field will crash a downstream pipeline. Investing in structured outputs upfront — and validating them — turns AI features from "works most of the time" into something you can confidently put on a critical user path.