Instructor: Structured LLM Outputs¶


Difficulty
Path(s)	AI/ML
Prereqs	Jinja2, Pydantic
Unlocks	DSPy Series, RLM Project, marketing-bot
Repo	pysprings/lightning-talks/instructor

Overview¶

The Instructor library patches OpenAI's client to return validated Pydantic models instead of raw text. This talk demonstrates it by building a tool that reads source code and generates CRC (Class-Responsibility-Collaboration) card diagrams using an LLM.

Key Concepts¶

Structured outputs — Force LLMs to return data matching a Pydantic schema
instructor.patch() — Wraps the OpenAI client to add response_model support
Prompt templates — Jinja2 templates for dynamic prompt construction
Pydantic validation — Type-checked, validated responses from the LLM
Pipeline pattern — Source code → prompt template → LLM → Pydantic model → Graphviz diagram

The Code¶

crc-cards.py — Main script
prompt.txt — Jinja2 prompt template

Pydantic Models¶

from pydantic import BaseModel

class Collaborator(BaseModel):
    name: str
    description: str

class Card(BaseModel):
    name: str
    responsibilities: str
    collaborators: list[Collaborator]

class CardStack(BaseModel):
    cards: list[Card]

Structured LLM Call¶

import instructor
from openai import OpenAI

client = instructor.patch(OpenAI())
response = client.chat.completions.create(
    model="gpt-4o",
    response_model=CardStack,  # <-- Forces structured output
    messages=[{"role": "user", "content": rendered_prompt}],
    temperature=0,
)
# response is a validated CardStack instance, not raw text

The Pipeline¶

Source File → Jinja2 Template → Rendered Prompt → LLM → CardStack → Graphviz DOT

Try It¶

Clone: git clone https://github.com/pysprings/lightning-talks
cd instructor
Install deps: pip install instructor openai pydantic jinja2 graphviz
Set your OpenAI API key: export OPENAI_API_KEY=sk-...
Run: python crc-cards.py some_python_file.py
Challenge: Add a new Pydantic model for function signatures and modify the prompt to extract them too

Where to Go Next¶

DSPy Mastery Series — Systematic AI development framework that builds on these structured output concepts
RLM Project — LLM orchestration with code execution
marketing-bot — Clean architecture for AI planning systems